Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0598 |
Symbol | |
ID | 7406939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 674788 |
End bp | 676128 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643714981 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572497 |
Protein GI | 222528615 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.764573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGACT CATTCAATAA AAGGCGTTTA TTATCAATAA GTTTTTTAGT ACCACTTGTA ACTACTCTGG TGCTAATAAT TGTACTCATT TTGAATACGC AAAAGACAAT AGAAGAAAGT GTAACTATTG AAAATGAAGA GTTAGAAGTT AATACGAAAA TAAGATTTTT AAGTCCATGG GGTGGGAGTG ACCCTTATGC TGAAACACTC TCGTTTGTAC TTCAAAAGTT TCAGGAAGAG AATCCTGGTG TAACTATTGT AAACGAATCT CTGTTTGGTG ATGATTTTTT GATAAAACTT CAGACAGATT TTGCATCTGG AAACCCTCCT GATGTTTTTG GTCTTTTTCC AGGTTCTGTT AGAGATTTAC TAATTAAAAG AAAACAAATA GCTGAATTAA CAAATATATT AAAAAAAGAT GTAAAGTGGT ATCAGAGTTT TTATTCTAAC ATGTGGAAAT ATGTGACTTT CAATGGTAAA ATTTATGGCG TTCCGCTTGA AACAATAGTT GAGTGCCTAT TTGTTAACAA GGATATATTT GAAAAATACA ATTTAAAAGT TCCTCAAACG TTAGATGATT TGATAAGTGT CTCAAAAATA CTAAAGAGCA AAGGAATAAT TCCCATTGCT TTCAATGCGC AGCCAGAAGG AACATATATA TACCAAAACA TTATTGTTTC TATAGGGACA AAGTATGAAG TTGAAAACCC TATCAAAAAC GGTGAATTTT CCTTACCTTA CATAAAAGCT CTTGACTATT TAAAAGTTCT ATACAAGGCA GGTGCATTTC CAGCAAACTA CTATTCACTT ACAAGTAAAC AGCGAAATGA TTTGTTTTTG ACAAAAAAAG CTGCAATGAT AGTTCAGGGT TCATGGTTCA TACCAAAGTG TGATCCAAAA ACAGTTGACA TATACATTTT TCCTCAGGCT AATGAGAAAG GCAAAAAACA TTTAATTTAC GGTCTTGGTG CAGGAACGTT TTATGTCAGC AGTCAAGCAT GGCAAGACAT AGAAAAAAGA AACAGTGCAA TAAAACTTTT AAAATTTTTG TCTTCTGAAA AGATTGCAAG AATATTTGTG GAAAGAACGG GATTAATTTC AAATGTAAAG ATAAAGAATC CGCCAAATGT CAAAAATTCT CTGCGTTCAA AGGTGGAAGG GCTTATAAAA GAAGCTGATG TGCTTGTTGC ACCGCCTGAC CATTTTGTTG ATAGAATGGT TTGGGAAGAG GTAATAACAA AAAATATTCC TTATTATCTA CAAGGAACCA TTTCTTCAAA GTTATTTTGG GCAAGAGCAG TCAAGGCGTG GAAAGAGAAT ATGGAGAAAT TAGGTGAATG A
|
Protein sequence | MNDSFNKRRL LSISFLVPLV TTLVLIIVLI LNTQKTIEES VTIENEELEV NTKIRFLSPW GGSDPYAETL SFVLQKFQEE NPGVTIVNES LFGDDFLIKL QTDFASGNPP DVFGLFPGSV RDLLIKRKQI AELTNILKKD VKWYQSFYSN MWKYVTFNGK IYGVPLETIV ECLFVNKDIF EKYNLKVPQT LDDLISVSKI LKSKGIIPIA FNAQPEGTYI YQNIIVSIGT KYEVENPIKN GEFSLPYIKA LDYLKVLYKA GAFPANYYSL TSKQRNDLFL TKKAAMIVQG SWFIPKCDPK TVDIYIFPQA NEKGKKHLIY GLGAGTFYVS SQAWQDIEKR NSAIKLLKFL SSEKIARIFV ERTGLISNVK IKNPPNVKNS LRSKVEGLIK EADVLVAPPD HFVDRMVWEE VITKNIPYYL QGTISSKLFW ARAVKAWKEN MEKLGE
|
| |