Gene Tpet_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0503 
Symbol 
ID5171360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp498637 
End bp499968 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content48% 
IMG OID640563010 
Productextracellular solute-binding protein 
Protein accessionYP_001244101 
Protein GI148269641 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000478191 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGC TTCTACTTGT GATCATCTTG ACCATAGGTC TTTTGATGTT CGCAGAAGAG 
ATCACCATCA CCGCGTGGAC CGTGGGACCC GATAATCCAT CTTACTACAG GTTTGACAAT
CTGAAAACAG CTGCAGAGAG GCTGAACAAG ATTCTGAAGG ACCTTGGAGT CGATCTCACC
GTCAAAGTCG ATGGCTACTT TGATACGACG GATTGGAGTT CTTTCAAGCA GAAGGTTGTC
TTTGGAATCA AGTCTGGTCA GGTAGTGGAT ATCATCTGTT CGGGACACGA CGACATAGGA
GCCTGGGCGA AAGCGGGTTA CATCATTCCT CTAGACGACT ATGTGAAGAA GTACTGGGAC
GAGGAGTACT ATGACTTCAT TCCATCGCTC TGGGAGGCTA CCAAGTACAA AGGAAAGATC
TACGGAATCC CACAGGACAC AGAAGCAAGA CCTTTCTACA TCAACAAGCA GGTCTTGAAA
AAACTCGGCT GGTCGGATGA AGAAATCAAC TCTCTGCCAG ACAGGATCGC CAGGGGAGAG
TTCACCTTTG AAGACTTCAT TGAAGTGGCA AAAGAAGCCG TTGAAAAGGG TCTTGTGGAG
TGGGGACTCT ACCACAGACC GAAAGCAGGT ATCGACTACT ACCAGCTCAT GATCAGCATG
GGTATCGACT TCTACGATGA GGAAAAGGCT GTGTTCGTTT ACAACGTAAA AGAGATGAAA
GACTACTTCA AGCTCCTCTA CGACCTTGCG AACACCTACA AGATTCTTCC AAAGAACATG
ATAGGAACAC CGTGGACCTC TGTTCACAAG GATGTCACGA GCGGAAAGGT CCTGGCATGG
ATGGGTGGTA CATGGAACTG GGCTGAATGG AAGAAAGATT ACGGAAAGAC AGAGGAAGAA
CTGCAGGACA TGTTCATCCT CGCTCCTGTT CCGAAGTTCA AAGGCAGAGG AAGGCCGAAC
ACCCTGTCCC ATCCAATCGT TTACATGATC CCGAGCACGA GTAAATACCC AGACATTGCA
TTCTTGCTCA TAACCTTGGC TTCTGCACCG GAACTCAACA TGAGACACGC TGTTGAGAGC
GGGCACCTGC CGATCAGATG GCAGCAAACG GTGCTTCCTG AGTACACCAA GGAATTTATC
ATGGTGGAAG GAACGAAGCT TTTGCCGTAC TCCGGTTTCA TTCCAAACGA TGACATGTTC
AACCTCTACA CCCAGATCAT CTTCGAAGGA ATGCAGATGG CCGAAAGCGG TATGGATCCA
GAGAAGGTGG CAGAAGACGT TGCAAAAAGG CTGAAGTCCC AGCTGAAAGA TAGGGTTGTG
ATCGTGGAGT GA
 
Protein sequence
MRKLLLVIIL TIGLLMFAEE ITITAWTVGP DNPSYYRFDN LKTAAERLNK ILKDLGVDLT 
VKVDGYFDTT DWSSFKQKVV FGIKSGQVVD IICSGHDDIG AWAKAGYIIP LDDYVKKYWD
EEYYDFIPSL WEATKYKGKI YGIPQDTEAR PFYINKQVLK KLGWSDEEIN SLPDRIARGE
FTFEDFIEVA KEAVEKGLVE WGLYHRPKAG IDYYQLMISM GIDFYDEEKA VFVYNVKEMK
DYFKLLYDLA NTYKILPKNM IGTPWTSVHK DVTSGKVLAW MGGTWNWAEW KKDYGKTEEE
LQDMFILAPV PKFKGRGRPN TLSHPIVYMI PSTSKYPDIA FLLITLASAP ELNMRHAVES
GHLPIRWQQT VLPEYTKEFI MVEGTKLLPY SGFIPNDDMF NLYTQIIFEG MQMAESGMDP
EKVAEDVAKR LKSQLKDRVV IVE