Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0597 |
Symbol | |
ID | 5170916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 593816 |
End bp | 594826 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640563105 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_001244194 |
Protein GI | 148269734 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000135069 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAAA AGCTTCTTTT GATGCTGATT TTAGGCGTGT TTTTAGTGTC GGCTGTGTTC GGTGCAAAGT ACACACTGAG ATTTGGACAC GTTCTTGCTC CAGGTGAACC GTACCATCAG GCGTTTCTGA AATGGGCCAA GGCAGTTGAG GAGAGAACGA ACGGCGATGT TAGAATTGAA GTCTTCCCGA GTTCACAGCT TGGTGTGGAA GAGGACATCA TCGAGCAGAT CAGAATGGGA GCGCCTGTGG GATGGAACAC GGACTCTGCA AGGCTTGGAA TGTACGTTAA AGACATCGGC GTCATGAACC TTGCATACTT CATCGATTTC ATGGGCGCAA AAACTCCTGA AGAGGCGATG GAGGTTCTGA AGAAGATCAA GCAATCTCCA ACAATGCAGA AGTGGTTGAA AGAGTTGGAA CAGAAATTCG GTATAAAGGT TCTCTCCTTC TACTGGGTGC AGGGTTACAG ACACTTTGTG ACAAACAAAC CCATCAGGGA ACCGGAAGAC CTGAATGGTT TGAGAATCAG AACTCCCGGT GCGCCTGCAT GGCAGGAATC CATAAGAGCT CTTGGTGCCA TCCCTGTTGC CGTCAACTTC GGAGAGATCT ACACAGCGGT CCAGACGAAA GCGGTCGATG GAGCAGAACT CACTTACGCG AATGTTTACA ACGGTGGTCT ATACGAAGTT TTGAAGTACA TGTCTGAAAC GGGACACTTC CTTCTCATCA ACTTCGAAAT CGTCAGCGCA GACTGGTTCA ACAGCCTGCC CAAGGAATAT CAGAAGATTA TTGAGGAAGA GATTGACAAA GCGGGAATCG AAGTTTCTCT CAAAATCATG AAAGAACTCG AAGAAGAATA CAAACAGAAG TGTATTGAAA AGGGTATGAC AGTGATACCA GCTTCTGAAA TCGACAAGGA AGCCTTTATG GAAAGGGCAA AACAGGCTTA CAAGAATCTC GGTCTTGAGG ATGCTCTCAA CCAGTTGATC AAGGAAGTGA AGGGAGAGTA A
|
Protein sequence | MGKKLLLMLI LGVFLVSAVF GAKYTLRFGH VLAPGEPYHQ AFLKWAKAVE ERTNGDVRIE VFPSSQLGVE EDIIEQIRMG APVGWNTDSA RLGMYVKDIG VMNLAYFIDF MGAKTPEEAM EVLKKIKQSP TMQKWLKELE QKFGIKVLSF YWVQGYRHFV TNKPIREPED LNGLRIRTPG APAWQESIRA LGAIPVAVNF GEIYTAVQTK AVDGAELTYA NVYNGGLYEV LKYMSETGHF LLINFEIVSA DWFNSLPKEY QKIIEEEIDK AGIEVSLKIM KELEEEYKQK CIEKGMTVIP ASEIDKEAFM ERAKQAYKNL GLEDALNQLI KEVKGE
|
| |