Gene Tpet_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0954 
Symbol 
ID5171032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp982252 
End bp983487 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content49% 
IMG OID640563472 
Productextracellular solute-binding protein 
Protein accessionYP_001244548 
Protein GI148270088 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT ACTTTGTTCT GTTGCTAGCA GTTCTTCTGG TTGGTGGACT CTTCGCTGTG 
AAAATCACTA TGACATCTGG AGGGGTCGGA AAGGAACTCG AGGTACTGAA AAAGCAGCTG
GAGATGTTCC ACCAGCAGTA CCCAGATATC GAAGTGGAAA TCATTCCGAT GCCGGACAGT
TCAACTGAAA GGCACGATCT CTACGTCACG TACTTTGCCG CCGGAGAGAC GGATCCAGAC
GTTCTCATGC TCGATGTGAT ATGGCCTGCT GAGTTTGCTC CGTTCCTTGA AGATCTGACA
GCAGACAAAG ACTACTTCGA ACTCGGTGAA TTCCTACCCG GAACTGTGAT GTCTGTCACG
GTCAATGGAA GAATCGTTGC TGTTCCCTGG TTCACAGATG CAGGTCTCCT TTACTACAGA
AAAGACCTCC TCGAGAAATA CGGTTACGAT CACGCTCCGA GAACCTGGGA TGAACTCGTC
GAAATGGCAA AGAAGATCTC TCAGGCTGAA GGCATCCACG GATTCGTCTG GCAGGGTGCA
AGATACGAAG GCCTTGTCTG TGATTTCCTT GAATACCTCT GGTCTTTCGG TGGGGATGTG
CTCGATGAGA GTGGAAAAGT TGTGATCGAT TCTCCAGAAG CTGTTGCGGC TCTTCAGTTC
ATGGTCGATC TCATCTACAA GCACAAAGTC ACTCCTGAAG GAGTTACCAC CTACATGGAA
GAAGACGCAA GAAGAATCTT CCAGAACGGA GAAGCTGTTT TTATGAGGAA CTGGCCGTAC
GCCTGGTCCC TCGTGAACAG CGACGAATCC CCAATCAAAG GAAAGGTTGG AGTTGCTCCT
CTTCCAATGG GTCCTGGTGG AAGAAGAGCT GCCACACTCG GTGGGTGGGT CCTCGGTATA
AACAAATTCT CGTCACCTGA AGAAAAGGAA GCCGCAAAGA AGCTCATAAA GTTCCTCACA
AGTTACGACC AGCAGCTCTA CAAAGCGATC AACGCCGGAC AGAATCCAAC GAGAAAAGCC
GTTTACAAAG ATCCAAAACT CAAAGAAGCT GCTCCGTTCA TGGTTGAACT TCTCGGAGTT
TTCATCAACG CTCTTCCAAG ACCAAGGGTT GCGAACTACA CAGAAGTTTC CGATGTCATT
CAGAGGTACG TGCACGCTGC TCTGACAAGA CAGACAACAC CAGAAGACGC AATAAAGAAC
ATTGCAAAAG AGCTCAAATT CCTGCTTGGA CAGTAA
 
Protein sequence
MKKYFVLLLA VLLVGGLFAV KITMTSGGVG KELEVLKKQL EMFHQQYPDI EVEIIPMPDS 
STERHDLYVT YFAAGETDPD VLMLDVIWPA EFAPFLEDLT ADKDYFELGE FLPGTVMSVT
VNGRIVAVPW FTDAGLLYYR KDLLEKYGYD HAPRTWDELV EMAKKISQAE GIHGFVWQGA
RYEGLVCDFL EYLWSFGGDV LDESGKVVID SPEAVAALQF MVDLIYKHKV TPEGVTTYME
EDARRIFQNG EAVFMRNWPY AWSLVNSDES PIKGKVGVAP LPMGPGGRRA ATLGGWVLGI
NKFSSPEEKE AAKKLIKFLT SYDQQLYKAI NAGQNPTRKA VYKDPKLKEA APFMVELLGV
FINALPRPRV ANYTEVSDVI QRYVHAALTR QTTPEDAIKN IAKELKFLLG Q