Gene Tpet_1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1677 
Symbol 
ID5171299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1672949 
End bp1674916 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content45% 
IMG OID640564203 
Productextracellular solute-binding protein 
Protein accessionYP_001245258 
Protein GI148270798 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00224475 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGGA GAGTTCTGTG GGGCCTTCTT GTAGTAATCT TCGCAGCCCA GATCCTTGCT 
ATTGGACTCA ACAAAATCGT TCCCGGTGAG TATTACAATC TCACCGACTA CGAACGTCTG
ACAGGCAAGA AGATCACGAA ATTCAACGAA GCACCGATGT TGAAAGAGAT GGTTGAGAAA
GGACTGCTTC CACCTGTGGA GGAAAGGCTT CCGAAGAATC CGGTTGTGGT GACACCTTAT
GAGGAGATAG GTCAATACGG TGGTACCTGG AGAAGAGTAT GGTTCGGCCT TCCGGATCAG
CCCAATGTCG ATAAGATCGC TGTCGAGAAA CTCGTGATGT TCGATAAGAC CGGTGGGGTA
ATACTTCCGA ATATCCTCGA AGAGTGGCAG GTTAGCAGTG ATGGTAAAAC GTTTGTCTTC
AAGATAAGAG AAGGGCTGAA GTGGTCTGAT GGTGTGCCTG TCACCACAGA GGACGTGAGA
TTTTGGTATG AAGACATTTT ACTGGATGAA AATCTGACCC CTACGATTCC TTCCTGGCTG
ATCGCTGGAG GTAAACCCTT AAAAGTAGAA ATCGTCGATA AGTGTACATT CAAAGTAAAT
TTCGAAGTCC CCTATCCTCT GTTTCTCTAT CAGCTAGCAT ACCGGGGACA GGGCGGTTAC
GTTTTCGTTG TCCCATCGCA CTATCTGAAA AACTTCCATC CAAAGTATGT CCCGCTTGAA
AAACTGACAC AAATGGCGAA GGAAGAAGGA TACGATTACT GGTGGCAACT TTTCGCTGCG
AAAGGTACCA ATACCAATGC GTGGATTACG AATCCTGAGC TTCCCGTACT CTATCCATGG
AAATTGAAGA AATTGACTGA TTCACAACTC GTCATCGAAA GGAACCCATA CTATTTCAAG
GTGGATCCTG AAGGGAATCA GCTTCCATAC ATAGATGAAA TAGTGTTCTA CAGGATTCAA
GACAAACAGA TGGCGCTCAT GAAAGCTATG GCTGGAGAAA TAGATATGCA AACCAGGCAC
TTTGGAACGG AACAGTTCAC TATATTACTT GAGAACAGGG AAAAAGGTGG CTATAGAGTT
TTGAGATGGG TTTGGGGTGT TGGCAGCATA GTAACGTTCT ATGTGAATCA AAATGTGAAA
GATCCCGTTC TTAGAGAACT CTTCCAGAAT CCAAAGTTCA GATACGCCCT TTCACTGGCG
ATAAACCGAG AAGAAATAGC TACCCTGGTC TTCCACAACC TTGGTGAGCC ACGTCAAGCA
TCACTGATCA CAGGTGTTGC TTTCTACGAT CCTGAATGGG AGAAAGCATA TGCGGAATAT
AACCCTGAGA AGGCGAACGC TCTCTTGGAT GAAATAGGCC TGACAAAGCG AGATGCCGAG
GGTTATAGAA TAAGATCGGA TGGCAAAAGG TTGGAAATAA TAATAGAGTA CTCCGTAACA
GACGCTGTTG TTGACGTACT GGAGATGGTA AAACAGTACT GGGAAAATCT GGGTATCAAG
GTGCTCCTGA AACCTGAGGA ACGATCGCTC TACATGACAA GGTGTGAAGC AGGAGAGCCT
GAAATAGGTG CGTGGTCATT CGACAGATGT GCAGCCGTAT TGAGCGATCC TGGAAGGTTA
CTGGGAACAG TGTGGGATGG CCCATGGGCA CCTCTTTATG CAAGGTGGTA CATTTCCGGT
GGAAAAGCTG GCGAGGAACC ACCAGAAGGC TCAGACATTA GAAGAATCTA CGAGCTTTGG
GACAAAGTAA AAGTAACCGT CGATGAAGAA GAAAGAGACA GACTTTTCAA GGAGCTCATC
AACATTCATA AGAAAAATAT CTTCTTCATA GGAACGGTGG GAGAAGTCCA GATACCTGTC
ATCGTGAAGG ACAATTTCAG AAATGTCCCT GATGGATTAA TCTTTGATCA TCCTCTCTTC
AGTCCAAAGA ATGCCCGACC GGAACAATTC TTCTTTGAAC TGAAATAA
 
Protein sequence
MFRRVLWGLL VVIFAAQILA IGLNKIVPGE YYNLTDYERL TGKKITKFNE APMLKEMVEK 
GLLPPVEERL PKNPVVVTPY EEIGQYGGTW RRVWFGLPDQ PNVDKIAVEK LVMFDKTGGV
ILPNILEEWQ VSSDGKTFVF KIREGLKWSD GVPVTTEDVR FWYEDILLDE NLTPTIPSWL
IAGGKPLKVE IVDKCTFKVN FEVPYPLFLY QLAYRGQGGY VFVVPSHYLK NFHPKYVPLE
KLTQMAKEEG YDYWWQLFAA KGTNTNAWIT NPELPVLYPW KLKKLTDSQL VIERNPYYFK
VDPEGNQLPY IDEIVFYRIQ DKQMALMKAM AGEIDMQTRH FGTEQFTILL ENREKGGYRV
LRWVWGVGSI VTFYVNQNVK DPVLRELFQN PKFRYALSLA INREEIATLV FHNLGEPRQA
SLITGVAFYD PEWEKAYAEY NPEKANALLD EIGLTKRDAE GYRIRSDGKR LEIIIEYSVT
DAVVDVLEMV KQYWENLGIK VLLKPEERSL YMTRCEAGEP EIGAWSFDRC AAVLSDPGRL
LGTVWDGPWA PLYARWYISG GKAGEEPPEG SDIRRIYELW DKVKVTVDEE ERDRLFKELI
NIHKKNIFFI GTVGEVQIPV IVKDNFRNVP DGLIFDHPLF SPKNARPEQF FFELK