Gene Tpet_0322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0322 
Symbol 
ID5171102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp302914 
End bp304173 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content52% 
IMG OID640562825 
Productextracellular solute-binding protein 
Protein accessionYP_001243927 
Protein GI148269467 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAT TTCTCGTTGT TTTCATGGTG ATTCTCAGTG TTTTTGCCCT CGCGAAGGTG 
AAAGTCACGT TCTGGCACGC CATGGGCGGG GGACACGGCA AAACTCTCCA AGAGATAGTG
AACACCTTCA ACGAACTTCA CCCGGATATC GAGGTCGAAG CGGTCTACGT TGGAAACTAC
GGTGCACTCT CTCAGAAGCT CCTCGCAGCA GCGCAGGCAG GAGAACTTCC CACGATCGCA
CAGTCTTATT CCAACTGGAC GGCAAAACTC ATCCAGAGCG GTGTCGTGCA GCCTCTGAAC
GAGTTCGTGA ACGATCCGAA GATAGGGCTG ACCAGGGAAG AGTGGGAAGA TGTCTTCAAA
CCTCTGAGAG ACAACTGTAT GTGGGGAGAC ACCGTCTACG CGGTTCCGTT CAACAAGAGT
CTCTACATAC TCTACTACAA CGCGGACGCC TTTGCAATGT ACGGTGTGGA TGTGCCCAAA
ACGATCGATG AACTCTACGA AGCCGCGAGA ATCATGACGG AAGATCTTGA CGGAGATGGG
AAGATCGACC AGTACGGTTT TGGCTTCAGG ACGACCGTTG ACTTCTTCCA GATACTCCTC
ATCCTCCGCG GTGGTTCCAT CCTGAAACAG GTCGACGGCA AATGGGTTTC CAACATCGAC
AGCCAGGAAA CAAGAGAGGT CCTTGCCTTC GTGAAGAAGA TGGTGGACGA TGGTATCGCG
TACTTCCAAG GTGGATACCT CAACGATATC TTCGGTCAGC AGAAGATCAT GATGTACATC
GACACAATAG CGGGAAGGCC CTACGTGGAA AGCTCCACAA AGGGGAAGTT CACCTGGAGC
TGGGCTCCAG TTCCCACCTG GGTGACGAAC AAGGTGCCGT TCGCCGGAAC AGACATCATC
ATGTTCAACA CGGCAAGCGA TGAGGAAAAA CGTGCCGCCT GGGAGTTCAT GAAGTACCTC
ATCTCTCCTG AGGTGACTGC TTACTGGGCG ATCAACACGG GTTACATTCC TGTGAGAAGA
AGCGCCCTCG AAACGTCGAT CTGGAAGGAA GCGGCTAAAT CCGATCCTCT GATCGAAATA
CCTCTGAAGC AGATAGACAA CGCCGTGTTC GATCCACAGA TCGGTGTATG GTACGAGATC
AGAACGGTGG TTGGAAACAT GTTCTCCGAT TTCATCAACG GAAAGGTAGA CATGGAAACT
GCGATAAAGA CGGCGGATCA GAAGATAAGG GAGTATCTCA AGGAAGAGTA CGGCGAGTGA
 
Protein sequence
MKKFLVVFMV ILSVFALAKV KVTFWHAMGG GHGKTLQEIV NTFNELHPDI EVEAVYVGNY 
GALSQKLLAA AQAGELPTIA QSYSNWTAKL IQSGVVQPLN EFVNDPKIGL TREEWEDVFK
PLRDNCMWGD TVYAVPFNKS LYILYYNADA FAMYGVDVPK TIDELYEAAR IMTEDLDGDG
KIDQYGFGFR TTVDFFQILL ILRGGSILKQ VDGKWVSNID SQETREVLAF VKKMVDDGIA
YFQGGYLNDI FGQQKIMMYI DTIAGRPYVE SSTKGKFTWS WAPVPTWVTN KVPFAGTDII
MFNTASDEEK RAAWEFMKYL ISPEVTAYWA INTGYIPVRR SALETSIWKE AAKSDPLIEI
PLKQIDNAVF DPQIGVWYEI RTVVGNMFSD FINGKVDMET AIKTADQKIR EYLKEEYGE