Gene Tpet_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0485 
Symbol 
ID5171252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp478691 
End bp480550 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content40% 
IMG OID640562994 
Productextracellular solute-binding protein 
Protein accessionYP_001244085 
Protein GI148269625 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGA AACTTGTTTG GTTATCGTTT TTATTACTGA TATTCACGCT TGCTTTTTCT 
CAAGTAGCCA ATGTGCCAAG GTCGGACTTA CTAATTGTTC AACACTCTCA CGGTAGAGTC
TCTGATCCTG CTAATTGCAA TATTTTCACG TCATCCTGGA GATATCCGGC CAGAGGGCTT
CATCAGTTGA TCGTTAGACC TCTTTGGATG GTTGATCCTG CGAAAATGGA AATTATCAAT
GTTCTTGCAG AAACTAGCCC TATTTACAAC GATGATTTCA CGGAAATGAC TGTTAAACTT
CGCAAAGGGA TATATTGGAG TGATGGAGTG GAATTCACTG CCGATGATGT TGTCTTCGGA
GTGAAATTAA CGATTCAAAA TGAAGGTATG TCTAACCACG TTCAACTCAA AGAATGGGTC
AAAGATGTCG AGGCAAAAGA TAAATACACT GTCGTTTTTA AGCTAAACAA ATCCAATCCT
AGATTCCATT ATTATTTTGT CGACAGATGG GGTTGCTGGA GACCATTCCC CAAACATATA
TTTGAGAAAG TAGAAGATCC CGTTAAATTC AATTTCTATC CACCTGTTGG AACGGGTCCT
TATGTTTTAA AATCCTATGA TCCAAATGGA TATTGGTTTT TGTATGAGAG AAGAGAAGAT
TGGGAAAGAA CACCAGATGG TATACTTTAT GGAATGCCAC AACCTCGCTA TGTGCTCTTC
CAGGCTTATG ATTCTCCTAG CCAAATGATT CTTGCCATGA GACAACACCA ACTGGACGTC
ACATATACAT TTTCTTTGGA AATGGTTAAA TCTTTTTTGA AAATACCTAC TGTTAGAGTC
TTCAGAAAAG ATTTCCCTTG GGGTGAAACA CTAGAACCAA CAGTTACAGG CATTACTCTG
AATACAATGC GCTATCCTTA CAACATTCGT GATGTAAGAT GGGCATTGGT ACTTGCTATA
AACATACTGG AAGTTGCAGA TCTAGTTGCT GATGGGGCTG TTAGATTTGC CCCATTACAT
GTGCCACCAG AACCTGTCTG TGAAAAGTAT TACTATACAG CCTTGAAAGA TTTCTTAGAG
AACTTCACAT TGCAAATTGA CGATCAGACG GTATTCAAAC CTTTTGATAC CACACTACCC
GATAAGCTGG CAGAAATGGC AAGGAAGAAG GGTTATAAAG TCTCCAAAGG TCAACTAGAA
GAGTTGTTTG GCATCGGTTG GTGGAGATAT GCTCCTGACG TAGCAGAAAA ATTACTGAAA
AAACATGGTT TCAAGAGAAA TGAACAGGGA CAGTGGTTGT TACCCAACGG AACCCCATGG
AAGATGGAAA TAATTGTTAA CCCAGACGCA AACAGGCCAG ACAATAGGGT ACCTGCAGCA
ATTGCGCAAC AGTGGAAAAA ATTTGGAATC CAAATTGAAA TAAGACCAAC TTCTGATTCT
ACTGTGCACG CTTATGGAGA ATTTGATGCG TGCAGTGCCT GGCCAGCAGT AGAAACATGG
GGAGGAGTAG CAGATATTTA TAGAACATTA TCTCCTTTCG CGAGCAGATA TCAGCGACCA
ATAGGCGAAT TCAACGCTGG TCATGCTTCC AGATGGTCTG ACCCTAGAAT GGATGAAATA
TTAGAAAAAA TGAAAAAAAC ATCTCCATTT GATCCAGAAA CAATAGAGTT AGGTAAAGAA
GGATTAAAAT TGTTAATTGA AGAAATGCCA AGCATTCCCG CTTTCCAACT GACATGGTTT
GTGATTTACG ATGAATACTA CTGGACAAAC TGGTCAACCG TGGAGAATAT CTACGTACAT
CCTGTCCATA CTTGGCCTAA TTTCGGTTTT GAATTGCCGT ATCTGAAAAG AACAAAATAA
 
Protein sequence
MKRKLVWLSF LLLIFTLAFS QVANVPRSDL LIVQHSHGRV SDPANCNIFT SSWRYPARGL 
HQLIVRPLWM VDPAKMEIIN VLAETSPIYN DDFTEMTVKL RKGIYWSDGV EFTADDVVFG
VKLTIQNEGM SNHVQLKEWV KDVEAKDKYT VVFKLNKSNP RFHYYFVDRW GCWRPFPKHI
FEKVEDPVKF NFYPPVGTGP YVLKSYDPNG YWFLYERRED WERTPDGILY GMPQPRYVLF
QAYDSPSQMI LAMRQHQLDV TYTFSLEMVK SFLKIPTVRV FRKDFPWGET LEPTVTGITL
NTMRYPYNIR DVRWALVLAI NILEVADLVA DGAVRFAPLH VPPEPVCEKY YYTALKDFLE
NFTLQIDDQT VFKPFDTTLP DKLAEMARKK GYKVSKGQLE ELFGIGWWRY APDVAEKLLK
KHGFKRNEQG QWLLPNGTPW KMEIIVNPDA NRPDNRVPAA IAQQWKKFGI QIEIRPTSDS
TVHAYGEFDA CSAWPAVETW GGVADIYRTL SPFASRYQRP IGEFNAGHAS RWSDPRMDEI
LEKMKKTSPF DPETIELGKE GLKLLIEEMP SIPAFQLTWF VIYDEYYWTN WSTVENIYVH
PVHTWPNFGF ELPYLKRTK