Tag: firefox

November 18, 2024

JavaScript: Extracting Web Content You Cannot Copy

There are many times I need to copy “stuff” from a website that is structured in such a way that simply copy/pasting the table data is impossible. Screen prints work, but I usually want the table of data in Excel so I can add notations and such. In these cases, running JavaScript from the browser’s developers console lets you access the underlying text elements.

Right click on one of the text elements and select “Inspect”

Now copy the element’s XPath

Read the value — we don’t generally want just this one element … but the path down to the “tbody” tag looks like a reasonable place to find the values within the table.

/html/body/div[1]/div/div/div[2]/div[2]/div[2]/div/div[3]/div/div/div[3]/div/div/div/table/tbody/a[4]/td[2]/div/span[2]

Use JavaScript to grab all of the TD elements under the tbody:

// Define the XPath expression to select all <td> elements within the specific <tbody>
const xpathExpression = "/html/body/div[1]/div/div/div[2]/div[2]/div[2]/div/div[3]/div/div/div[3]/div/div/div/table/tbody//td";

// Use document.evaluate to get all matching <td> nodes
const nodesSnapshot = document.evaluate(xpathExpression, document, null, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null);

// Log the number of nodes found (for debugging purposes)
console.log("Total <td> elements found:", nodesSnapshot.snapshotLength);

// Iterate over the nodes and log their text content
for (let i = 0; i < nodesSnapshot.snapshotLength; i++) {
    let node = nodesSnapshot.snapshotItem(i);
    if (node) {
        const textContent = node.textContent.trim();
        if (textContent) { // Only log non-empty content
            console.log(textContent);
        }
    }
}

Voila! I redacted some data below, but it’s just a list of values, one per line.

March 14, 2021

Firefox Session Store Backups

Writing it down this time … so I don’t have to figure it out next time Scott’s Firefox sessions poof away — Firefox stores the session (importantly the tabs that you’ve got opened) at ~/.mozilla/firefox/<funky guid looking thing> default/sessionstore-backups

February 5, 2021

Browser How-To: Using the Developer Console

The developer console will show client-side errors. You can also use it to interact with data on a web page (like the approaches I’ve published to exporting data from Teams). To display the developer console, use Ctrl+Shift+i

When you first display the console, you may want to clear the existing output – it’s difficult to correlate the errors to discrete actions you’ve taken on the website. Once the console is clear, perform the action again and watch for errors as you perform each individual operation.

Clearing console output on Firefox:

Clearing console output on Chrome:

February 5, 2021

Browser How-To: Hard Refresh

For both Firefox and Chrome, you can hold CTRL while pressing F5 and bypass cache when reloading a web page. This is particularly useful when the JavaScript for a page has been updated — rather than clearing your entire cache, you can hard refresh the page to get the updated JS file.