Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0849 |
Symbol | |
ID | 7407424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 942006 |
End bp | 943682 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643715227 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572737 |
Protein GI | 222528855 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.206456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAATTTT TAAGAAAAAT TTCGATTGTG GTAGCGCTTG TTTTCATTAT TTCCGCGGTG CTGGGTGGGA TTGTACCTGT ATCTTCCCAG AAGGTTGAAG GTGCATCAAA AAAGGTTGTA ACTTTTACAA TGTTTAGTGC AGATGCGACA GTACAGTATC ACCCAGATAT TTTCAGTACT GCTATTGGGC AAGAGATTAC AAAAAGGACA GGCGTAAGAT TGAAAATCGA ACACTTTGTA GGAATGGACC AGGCAACAAA GATATCACTT ATGCTTGCAT CTGGTGATTT ACCAGACTTG GTTTATGGCA GTGGTGAGCA CAAACAATTT ATTCAGAACA AAGCTTTAGT TCCGCTTGAT AACTACATTC AAAAATATGG TCAGTGGACA AAGAAGGCGT ACTCTCAGGC AGATTTGAGG AAACTTCGCC AAGCTGATGG ACATATTTAT TTCTTGAGCT ACACAAGAGG TGAAGTGTCA CCAAGTGCAA GTGGTGAAGG TTTATATGTA ATGATTGACA TGTTACAAAA GAATAACTGG CCAAGATTAA AGTACTGGGA AGATTTGATG CCAATGTTAA GAAATTATGT TAAGAAATAT CCAAAGTACA AAGGTATGCC TGTAATAGGC ATGTCAGCAA TTACAGAAGG CGCAAGATTC TATGTAATTC AAGATCCTGC AACAGGTTTG AATGGTCTTA TAGCAGATAC TGTACAAGTT GATCCGAAAA CATATAAAGC AAGCTATGAC CCTGCAGGAA TAGGAATGTA CAAAGCTTAC AAGGCACTAA ATGCTCTGTG GAATGAAGGT TTGTTTGATA AAGAAGCATT TGTTCAGACA TATGACCAAT GGGCTGCGAA AGTTGCTCAA GGTAGAGTTG TGACAAGCTG GGGAAGGTCA TGGCACTTTA ACACAGCATT CAATACACTA CGAAAGAATG GCGAAGATGA TAGAATCCTT GTTCCATTTG GCATTGTATT TAAAGGGGTT AAAAAATCAA GATATGTGAT GCTTCAGTCA ATTGGAACAA GAGATGGCAT AAGCATTACA AAGAAGTGTA AAGATCCTGT AAGAGCATTC CAGTTCTTAG ACCAAATGCT CAATCCAGAT ATTCAGAAAC TTATGTTCTG GGGTATTAAA GGAAGAGATT ATCTTGTTGA CAATAAAGGT AAGATGTACA GAACACAAGC TATGATTGAC AAAGCAAGAG ACCCTGTTTA CCAGAAACAA GAAGGGCTTG GCTACTGGAA CATCTGGCCA AGATGGCAGC TCAAGCTTCC AGATGGAAAT TATGTAAAAC CTGAACTTGA TCCAGATATT GCATATATGC AATGGGCACC AGCACAAAAG AAAGTGCTTG AAGCATACAA AGCAAAAACA TTTGTTGAAC CACCATTTGC TGATGAACCT GAATGTCCAC CTTGGGGATA TGCATGGGAG ATCAACGTTC CACCTGAAAA GCAAAAAGAA ATCCAGGTTC CACTCAACAT TGCTAACGAC CTTGCAAGGA AGTATATACC AATGCTTATA ATGGCTCCAA AGGGCAAGTA TGATGAGGTA TGGAACAAAT ACAAAGCAGA GGTTAGATCA AAGATTAACA CAAAACCAAT TGAAGAGTTC TATACACAGG AAATGAGACA GAGAATGGCA GATTGGTACG GGATTAAAGT TAAGTAA
|
Protein sequence | MKFLRKISIV VALVFIISAV LGGIVPVSSQ KVEGASKKVV TFTMFSADAT VQYHPDIFST AIGQEITKRT GVRLKIEHFV GMDQATKISL MLASGDLPDL VYGSGEHKQF IQNKALVPLD NYIQKYGQWT KKAYSQADLR KLRQADGHIY FLSYTRGEVS PSASGEGLYV MIDMLQKNNW PRLKYWEDLM PMLRNYVKKY PKYKGMPVIG MSAITEGARF YVIQDPATGL NGLIADTVQV DPKTYKASYD PAGIGMYKAY KALNALWNEG LFDKEAFVQT YDQWAAKVAQ GRVVTSWGRS WHFNTAFNTL RKNGEDDRIL VPFGIVFKGV KKSRYVMLQS IGTRDGISIT KKCKDPVRAF QFLDQMLNPD IQKLMFWGIK GRDYLVDNKG KMYRTQAMID KARDPVYQKQ EGLGYWNIWP RWQLKLPDGN YVKPELDPDI AYMQWAPAQK KVLEAYKAKT FVEPPFADEP ECPPWGYAWE INVPPEKQKE IQVPLNIAND LARKYIPMLI MAPKGKYDEV WNKYKAEVRS KINTKPIEEF YTQEMRQRMA DWYGIKVK
|
| |