Gene Tpet_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0118 
Symbol 
ID5171235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp111909 
End bp113171 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content47% 
IMG OID640562619 
Productextracellular solute-binding protein 
Protein accessionYP_001243723 
Protein GI148269263 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGT TTTTGGTCAT TCTCATGGTA GTTCTTCTCG CAGTTCTGGC ACTGTCCAAA 
ACCAAGATAG TCTTCTGGAC CATGTCGTTG AAACCGACCT TCACAGATTT CATTCAGGGA
ATCATCGACA GGTATGAAGA GTTGAACCCG GATGTTGAAA TCGTCTGGGA AGATGTTCCA
TGGGACGTTC TCCAGCAGAA GCTTCTTGCG GCCTTCTCTT CTGGAAATCC ACCCGATGTT
GTGAACCTGA ACGCTCAGTG GACCATCGAA TTCGCTCAGA AGAAAGTCCT GTTCCCCTTG
AACGATTTGC TACCCGAAGA AGTTATCAAC CAGTACTTCG ACAACATGAT CAAAGGACTC
ACTTGGAAAG ACGGAATTTA TGGAATTCCC TGGTACACAG CTGTGGACGT GATATTCTAC
AACAAAGAGA TCTTCGAAAA AGCTGGACTG GATCCGAAGT ATCCACCTCG AACCTGGGAT
GAAATACTCC TCTACTCAGT TTTGATCAAG GAAAAAACGG GAAAATACGG TGCTCTTCCT
ACGATCTTCC AAGATCCCTC TGCGATCTTC AACTGGGACG GATTGAATCT CTACACGGTG
GATGAAAACA ACAGAATAAA AGAAGTGCTC TTCGACAGAC CGGAATACGC TCACACTCTC
AACAAATGGG CCACTCTCTA CAAACAGAAG TACATCCCGA GTGAAATCGT CCAGGGTGGA
GAATGGACGA GAGCAACAGA ACTCTATCAG GCTGGAGAAC TCGCCATGTT GATCACTGGT
GTTCAGTTCG CGGACAGAGT GAAATGGAAC GCTCCGGAAA TATACGAAAA ATCCGATGTT
GCTCCTATTC CAGCTCCAAA ACCGGGTGTG AGAATGAGTG GATGGTACTC AACTCTGAAC
GTAGTCAGAG GATCCAAGAA TCCTAAGGAA GCCGCTAAAT TCGCAGCGTT CGTTGCAAAC
CTCGAGAACC AGATCGCATT CTGTAAGCTC GTGACCATAT TCCCGACTCT CAAAGCAGCG
GTGAACGATC CGTGGTTCTC AAAAGACGAT GGAACGCTCG CTGCCAAAGC CAGGATCATG
GGAGCCAAGT ATCTTGAGAA CATCACGTTC TACAACGATG ACATACCATT CAGAAAAGAA
GCGTTCGACA GACTGAAGGA TGCCATTATT CAGGTGTTCC TTGGACAGAA AGATCCCGAA
ACGGCGCTCA AAGAGACCGC GAAGTACTGG AGATATCTCA TTCAGACTCA GCAATCGAAA
TAA
 
Protein sequence
MRKFLVILMV VLLAVLALSK TKIVFWTMSL KPTFTDFIQG IIDRYEELNP DVEIVWEDVP 
WDVLQQKLLA AFSSGNPPDV VNLNAQWTIE FAQKKVLFPL NDLLPEEVIN QYFDNMIKGL
TWKDGIYGIP WYTAVDVIFY NKEIFEKAGL DPKYPPRTWD EILLYSVLIK EKTGKYGALP
TIFQDPSAIF NWDGLNLYTV DENNRIKEVL FDRPEYAHTL NKWATLYKQK YIPSEIVQGG
EWTRATELYQ AGELAMLITG VQFADRVKWN APEIYEKSDV APIPAPKPGV RMSGWYSTLN
VVRGSKNPKE AAKFAAFVAN LENQIAFCKL VTIFPTLKAA VNDPWFSKDD GTLAAKARIM
GAKYLENITF YNDDIPFRKE AFDRLKDAII QVFLGQKDPE TALKETAKYW RYLIQTQQSK