Gene Tmel_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_1820 
Symbol 
ID5297466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp1804569 
End bp1805804 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content35% 
IMG OID640770088 
Productextracellular solute-binding protein 
Protein accessionYP_001307040 
Protein GI150021686 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAT TAGTTGTTTT AGGCATTTTA TTGTCACTTT TTGTATTATC GCTTGGAATT 
ACTACAATTA CTATGACATC CGGTGGTGTG GGAAAAGAAC TTGAAGTTTT ATACGCACAG
CTAAAAGAGT TTATGAAGGA AAATCCTGAT ATAGTAGTTA CTGTTATTCC AATGCCAGAT
TCTTCTACAG AAAGACATGA TTTGTATGTT ACGTATTTGG CTGCAGGAGA AAGTGATCCT
GATGTGTTAA TGTTAGACGT AATTTGGCCT CCAGAATTTG CTCCATTTTT AGAGGATTTA
ACAGATGATT ATCAATACTT TGAACTTGAT AAGTTTTTAC CAGGTACAGT TAAATCTGTT
ACAGTAATGG GAAGAATTGT TGCAGTTCCA TGGTTTACTG ATGCAGGCCT TTTGTATTAC
AGAAAAGATT TACTTGAAAA ATATGGATAC AAAAAACCAC CTGAAACATG GGATGAATTG
GTTGAGATGG CAAAAAAGAT TTCACAAGCC GAAGGTATAG AAGGTTTTGT TTGGCAAGGT
GCAAGATACG AAGGATTAGT ATGTGATTTT ATGGAATATT TATGGTCTTT TGGAACAGAT
GTTTTAAATG AAAATGGAAA TGTTGTAGTC AATAATCCTA AAGCGGTAGA AGCATTGCAA
TTTATGGTAG ATTTAATTTA CAAAAATAGA ATTTCTCCTG AAGGTGTAAC AACGTACATG
GAAGAAGATG CAAGAAGAAT TTTCCAAAGT GGTAATGCAG TATTTATGAG AAATTGGCCA
TATGCATGGT CGCTTGCAAA TTCCGATGAT TCACCTATAA AAGGAAAAGT GGGTATTGCA
CCACTCCCAA AAGGTCCTGG TGGTAAACAC GCAGCAACAT TGGGTGGATG GAATTTAGGA
ATTAATGCAA ATTCTTCACC AAAAGAAAAA GAAGCTGCAA AGAAATTAAT TAAGTTCCTT
ACAAGTCACA ATCAACAATT GTATAAGGCA ATTAATGCAG GGCAAAATCC AACAAGAATG
GCCGTATATG AAGAGCCAAA ATTGAAGGAA GTTAATCCAT TTATGGTCGA GTTGTTTGAT
GTATTCATTA ATGCTCTTCC AAGACCAAGG GCAGTAAATT ACGCGGAAAT TTCTGATGCA
ATTCAAAGAC ATATTCACGC GGCCTTGACA AGACAGGTTA CACCTGAACA AGCTATAAAA
GATTTAGAGA AGGAGTTAAA AATGCTTATA AAATAA
 
Protein sequence
MKRLVVLGIL LSLFVLSLGI TTITMTSGGV GKELEVLYAQ LKEFMKENPD IVVTVIPMPD 
SSTERHDLYV TYLAAGESDP DVLMLDVIWP PEFAPFLEDL TDDYQYFELD KFLPGTVKSV
TVMGRIVAVP WFTDAGLLYY RKDLLEKYGY KKPPETWDEL VEMAKKISQA EGIEGFVWQG
ARYEGLVCDF MEYLWSFGTD VLNENGNVVV NNPKAVEALQ FMVDLIYKNR ISPEGVTTYM
EEDARRIFQS GNAVFMRNWP YAWSLANSDD SPIKGKVGIA PLPKGPGGKH AATLGGWNLG
INANSSPKEK EAAKKLIKFL TSHNQQLYKA INAGQNPTRM AVYEEPKLKE VNPFMVELFD
VFINALPRPR AVNYAEISDA IQRHIHAALT RQVTPEQAIK DLEKELKMLI K