Gene Tpet_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1534 
Symbol 
ID5170727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1522251 
End bp1523543 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content52% 
IMG OID640564061 
Productextracellular solute-binding protein 
Protein accessionYP_001245118 
Protein GI148270658 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.45285e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAATC TTTTCAAGGG GGTGTTCAAG GTGAGAAAGT GGTTGTTTTT CATGGTTCTT 
CTGATCGTTG CGGGTCTCAT GTTCGGAAAG GTGAACTTCG CATCCACACA GATGACGCCC
GCTGCTGAGA GGGAGTTCAT GCTCAACAAA CTCGTGGAAT TCTCGAAGAA GACCGGTATC
GGTGTGGAGT TTCTCAACTT CGAGTATCCA CAGCTCTACA GCAGGCTCCA GGCGGAGATC
AGAGCCGGTA AAAATACGCT GAACCTGATT GCAGACCTCC AGGGAAACCT CTACATAATG
GCTTCCGAAG GATTTCTGAG CGATCTCAAG GATCTCAAAT TCGAAGGAAA AACCTTCATC
GAGACGCTTG AGAAGTTCGC TTATGTGAAA GGTGAAAAGG TGTTCATTCC CTGGCTCCAG
GCAACTTACG TGATGGCCGT TAACAAAAAG GCGTTTGACT ACCTGCCGCG CGGTCTTTCG
AAAGAAGACG TCATCAGGGG GACGGAGAAG TGGACTTACG ACGCTCTGCT CGAGTGGGCA
AAGAACATCT ATGAGAAGAC GAAACAACCC CTTCTTGGCT TCCCGATCGG ACCGAAGGGA
CTCTGGCACA GGTTCCTCCA CGGCTACATC TATCCATCCT TCACGGGAGC GCAGGCTCTG
AAGTTCGACA GTGTGAGGGC CGTTGAAATG TGGAACTATC TGAAGGAGCT CTTCAAATAC
GTACATCCGG CAAGCTCCAC CTGGGACGGG ATGGCCGATC CTCTCCTGAG AGAAGAAGTC
TGGATCGCCT GGGATCACAC TGCAAGACTC AAACCCGCGA TCGTTGAAAA GCCTAACGAT
TTCGTTGTTG TACCGGTCCC AAGAGGGCCG ATGGGTAGAG GGTACATCAT AGTGCTTGTG
GGTCTTGCCA TACCGAAGGG AGCGGATTTC GAGGAACCCG CGAAAGTGAT AGACTTCCTC
ACTTCTCCGG AGATGCAGGT TGAAATCCTC AAGAACGTCG GTTTCTTCCC TGTGGTTCAG
GAGGCTGTCG GTGCCGTGCC AGAAGGTGCC CTCAGGGTGC TCGCGGAAGG TGTGATAAAT
CAGTCTGCCA CGAAGGACTC CGTCGTTTCC TTCATACCGA GTCTTGGATC AAAGAGCGGA
GAGTTCACCG AAACCTACAG GATGGCCTTC ACGAGGATCG TCTTCCAAGG TGAAGACCCA
GCGAAGGTAG TGAAGGAACT CGGTGAGCGA ATCAGACAGC TGTTCAAAGA ATCCGGAGCG
GAACTTCCAG AACCCGACGC GAGCCTCTTC TGA
 
Protein sequence
MKNLFKGVFK VRKWLFFMVL LIVAGLMFGK VNFASTQMTP AAEREFMLNK LVEFSKKTGI 
GVEFLNFEYP QLYSRLQAEI RAGKNTLNLI ADLQGNLYIM ASEGFLSDLK DLKFEGKTFI
ETLEKFAYVK GEKVFIPWLQ ATYVMAVNKK AFDYLPRGLS KEDVIRGTEK WTYDALLEWA
KNIYEKTKQP LLGFPIGPKG LWHRFLHGYI YPSFTGAQAL KFDSVRAVEM WNYLKELFKY
VHPASSTWDG MADPLLREEV WIAWDHTARL KPAIVEKPND FVVVPVPRGP MGRGYIIVLV
GLAIPKGADF EEPAKVIDFL TSPEMQVEIL KNVGFFPVVQ EAVGAVPEGA LRVLAEGVIN
QSATKDSVVS FIPSLGSKSG EFTETYRMAF TRIVFQGEDP AKVVKELGER IRQLFKESGA
ELPEPDASLF