Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_1820 |
Symbol | |
ID | 5297466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | - |
Start bp | 1804569 |
End bp | 1805804 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640770088 |
Product | extracellular solute-binding protein |
Protein accession | YP_001307040 |
Protein GI | 150021686 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.148067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAT TAGTTGTTTT AGGCATTTTA TTGTCACTTT TTGTATTATC GCTTGGAATT ACTACAATTA CTATGACATC CGGTGGTGTG GGAAAAGAAC TTGAAGTTTT ATACGCACAG CTAAAAGAGT TTATGAAGGA AAATCCTGAT ATAGTAGTTA CTGTTATTCC AATGCCAGAT TCTTCTACAG AAAGACATGA TTTGTATGTT ACGTATTTGG CTGCAGGAGA AAGTGATCCT GATGTGTTAA TGTTAGACGT AATTTGGCCT CCAGAATTTG CTCCATTTTT AGAGGATTTA ACAGATGATT ATCAATACTT TGAACTTGAT AAGTTTTTAC CAGGTACAGT TAAATCTGTT ACAGTAATGG GAAGAATTGT TGCAGTTCCA TGGTTTACTG ATGCAGGCCT TTTGTATTAC AGAAAAGATT TACTTGAAAA ATATGGATAC AAAAAACCAC CTGAAACATG GGATGAATTG GTTGAGATGG CAAAAAAGAT TTCACAAGCC GAAGGTATAG AAGGTTTTGT TTGGCAAGGT GCAAGATACG AAGGATTAGT ATGTGATTTT ATGGAATATT TATGGTCTTT TGGAACAGAT GTTTTAAATG AAAATGGAAA TGTTGTAGTC AATAATCCTA AAGCGGTAGA AGCATTGCAA TTTATGGTAG ATTTAATTTA CAAAAATAGA ATTTCTCCTG AAGGTGTAAC AACGTACATG GAAGAAGATG CAAGAAGAAT TTTCCAAAGT GGTAATGCAG TATTTATGAG AAATTGGCCA TATGCATGGT CGCTTGCAAA TTCCGATGAT TCACCTATAA AAGGAAAAGT GGGTATTGCA CCACTCCCAA AAGGTCCTGG TGGTAAACAC GCAGCAACAT TGGGTGGATG GAATTTAGGA ATTAATGCAA ATTCTTCACC AAAAGAAAAA GAAGCTGCAA AGAAATTAAT TAAGTTCCTT ACAAGTCACA ATCAACAATT GTATAAGGCA ATTAATGCAG GGCAAAATCC AACAAGAATG GCCGTATATG AAGAGCCAAA ATTGAAGGAA GTTAATCCAT TTATGGTCGA GTTGTTTGAT GTATTCATTA ATGCTCTTCC AAGACCAAGG GCAGTAAATT ACGCGGAAAT TTCTGATGCA ATTCAAAGAC ATATTCACGC GGCCTTGACA AGACAGGTTA CACCTGAACA AGCTATAAAA GATTTAGAGA AGGAGTTAAA AATGCTTATA AAATAA
|
Protein sequence | MKRLVVLGIL LSLFVLSLGI TTITMTSGGV GKELEVLYAQ LKEFMKENPD IVVTVIPMPD SSTERHDLYV TYLAAGESDP DVLMLDVIWP PEFAPFLEDL TDDYQYFELD KFLPGTVKSV TVMGRIVAVP WFTDAGLLYY RKDLLEKYGY KKPPETWDEL VEMAKKISQA EGIEGFVWQG ARYEGLVCDF MEYLWSFGTD VLNENGNVVV NNPKAVEALQ FMVDLIYKNR ISPEGVTTYM EEDARRIFQS GNAVFMRNWP YAWSLANSDD SPIKGKVGIA PLPKGPGGKH AATLGGWNLG INANSSPKEK EAAKKLIKFL TSHNQQLYKA INAGQNPTRM AVYEEPKLKE VNPFMVELFD VFINALPRPR AVNYAEISDA IQRHIHAALT RQVTPEQAIK DLEKELKMLI K
|
| |