Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_0596 |
Symbol | |
ID | 5296780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | - |
Start bp | 628161 |
End bp | 629333 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640768858 |
Product | extracellular solute-binding protein |
Protein accession | YP_001305848 |
Protein GI | 150020494 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00439649 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC TATTGGTATC TCTTTTAATG ATTATTATGG TTTTTAGTGT TTTTGCTACA AAGATTACTA TTTGGACATC AGAGGCACAA GCACCTATTT TAACAAAACT TGCTGATGAA TTCAAGTCTA TTTACGGTAT TGATGTAGAA GTTGTTCAAG TTAATTTTAA TGATATAAAA TCCAAGTTTT TAACTGCGGC ACCAGCTGGT GAAGGTGCAG ATATTATAGT TGGTGCACAC GACTGGGTAG GAGAATTAGC AGCAAATGGT CTTTTAGAAC CAATTCCTGT TCTTCCTGAA AAAGACAAAT ACTTACCAAC TCCACTTAAA GCATTTACTT ACAACGGAAA GTTATACGGT ATACCGTATG CATTTGATGG ACCAGCTTTG ATTTACAACA AAGATTATGT TGAAGAACCA CCAAAGACAT TTGATGAATT AATTGGACTT GCTAAACAAA TTCAAGATGA ATATGAAGGA GAAGTAAGAG GATTAGTTTA TGACTTTAAA AACTTCTACT TCAGCAGCTA TGCAATTTTT GGTTTTGGAG GATACGTTTT TGGTGAAAAA GATGGAAAGA TTAATGTAAG AGATATTGGA CTTGCAAATG AAGGAGCAGT AAAAGGGTTA ACTTTGATTA AAAAATTAGT TGATGAAGGT CTTCTTGAAT CTGGAGATAA CTATAATGTT ATGGATAATA TGTTCAAAGA TGGTCAAGCT GCTATGATAA TTAATGGGCC ATGGGCAGTG CCTGGATGGA AAGAAGCAGG TATCGATTTT GGGGTTGCAC CAATTCCAGA ACTTGAACCA GGCGTAAAAC CAAAACCATT CTTTGGTGCA CAGGGATTTA TGGTAAATGC AAAATCTCCA AACAAACTTT ATGCAATTGA ATTTTTGACC AAGTTTATTG CTACAAAGGA TGTAATGTAC AGAATATATC TTGCCGATCC AAGAGTTCCA TCTAGGAAAG ACTTATTACC AATGGTTGAT GAGGTAACAG CTGCATTTGC AGATTTTCTT GGAAATTATG GACTTCCAAT GCCAAATGTT CCAGAAATGG CCGGTGTATG GGGGGCAATG GGCGATGCAC TTTCTAAAGT TCTTGACCAA GGAGTTCCAG TCGAACAAGC TTTAAAAGAA GCTGTAAATA CAATTTTAGC AGGAATTAAG TAA
|
Protein sequence | MKKLLVSLLM IIMVFSVFAT KITIWTSEAQ APILTKLADE FKSIYGIDVE VVQVNFNDIK SKFLTAAPAG EGADIIVGAH DWVGELAANG LLEPIPVLPE KDKYLPTPLK AFTYNGKLYG IPYAFDGPAL IYNKDYVEEP PKTFDELIGL AKQIQDEYEG EVRGLVYDFK NFYFSSYAIF GFGGYVFGEK DGKINVRDIG LANEGAVKGL TLIKKLVDEG LLESGDNYNV MDNMFKDGQA AMIINGPWAV PGWKEAGIDF GVAPIPELEP GVKPKPFFGA QGFMVNAKSP NKLYAIEFLT KFIATKDVMY RIYLADPRVP SRKDLLPMVD EVTAAFADFL GNYGLPMPNV PEMAGVWGAM GDALSKVLDQ GVPVEQALKE AVNTILAGIK
|
| |