Gene Tpet_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0636 
Symbol 
ID5170374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp639529 
End bp640776 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content43% 
IMG OID640563143 
Productextracellular solute-binding protein 
Protein accessionYP_001244232 
Protein GI148269772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000210277 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGT TACTGGTATT TCTGGTAGTT CTTGTTTTAG CTCTTCCACT CATAGCCAAG 
ATTCAAATTA CGTTCATGAC GCCACTCTCC GGTGCTGATG GAGCGTATAT GGATCAGATC
ATTCAGAAGT TCAACGAAAC ACATCCTGAT ATTGAGATTG TTCATCTTGT CGTAGGAAGT
TCTCTGGAAT ACAAACAAAA GCTTGCCACA GGTATTTCCA CGAAATCTGC TCCCCAGGTT
CTGTTTATTA GAAAACATGA CATGCCGCTG TTTCTTGATC ACTTCAGAAC CTTCACAAAA
GAAGAACTCC AACAGTGGGG TATCGATATC GATGATATTT ATCCCTCTGT CCTTGAAGGA
CTTGTAACAA AAGACGGTAA GTATTATGGA ATACCAATTG ACGTCTGGAT TTTCTACATG
GCTTACAGGA AAGACAATTT CAAAAAAGCT GGTCTTGATC CAGACCTTCC ATTGAAGGAA
GGGCCACTCA ACAGCGAACA GTTTGTAAAC GTTCTGAGAG CTCTCAGAAA AGTCACACCA
GAAGGTTCAT TCCCATGGTG TGAGTCTCCA AGCTGGGATT GGGAATTTGT ACATTTGCTG
TGGCAGTTTG GTGGAGATAT TCTGACACCT GACTTCAAGC GTCCTGCATT CAAAGAAGCT
GGTATAAAAG TTCTCAAATT CCTCCAGGAA CTTCAAAAAG AAGGATTGTA TCCTGATCAA
CCTATCGATG CAGGGCCAAC CTTTGAGTCT GGAGCAGGTT CTATCTTGAT AACAGGTATC
TGGACAATCA ATCCATGGCT TGATCTGCTT GGAAATGACT TTGGTTACGC ACCAGCTCCT
CAGCTTGGAA CAACAAAATC CGTGTTTGGT GGTTCACATG TGATCGCAAT TCCAAAGGTC
ATGGTGGAAG ATGAAAAGAC CTTCAACGCC GTGATGACTT GGGTTAAGTA TCTGTGGGAT
CACGCAATCG AATGGTATGC GGCTGGTCAG ACACCCGCCA GGAAATCCAT AGCTGAGAGC
GAAGAATTTA AAGAAAAGTT CCCACATCTG TACGTCGCTG CTCAACAGGT ATCTTATGTT
AAAACCTTCC AGATGTTCCC GTACATAGCT GAGATCCTTG CCGAGATAGT GCCATACATT
GAAGAAGTGC TTATCAATAA GAGCATGACG CCTGAGGAAG CAATGGAGGA AGCCGAAATG
GTTGCTCAGG AAATAATTGA TGATTACTGG GCAACAGTTG GAGAATGA
 
Protein sequence
MRKLLVFLVV LVLALPLIAK IQITFMTPLS GADGAYMDQI IQKFNETHPD IEIVHLVVGS 
SLEYKQKLAT GISTKSAPQV LFIRKHDMPL FLDHFRTFTK EELQQWGIDI DDIYPSVLEG
LVTKDGKYYG IPIDVWIFYM AYRKDNFKKA GLDPDLPLKE GPLNSEQFVN VLRALRKVTP
EGSFPWCESP SWDWEFVHLL WQFGGDILTP DFKRPAFKEA GIKVLKFLQE LQKEGLYPDQ
PIDAGPTFES GAGSILITGI WTINPWLDLL GNDFGYAPAP QLGTTKSVFG GSHVIAIPKV
MVEDEKTFNA VMTWVKYLWD HAIEWYAAGQ TPARKSIAES EEFKEKFPHL YVAAQQVSYV
KTFQMFPYIA EILAEIVPYI EEVLINKSMT PEEAMEEAEM VAQEIIDDYW ATVGE