Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0610 |
Symbol | |
ID | 7406951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 691938 |
End bp | 693851 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643714992 |
Product | glycoside hydrolase starch-binding |
Protein accession | YP_002572508 |
Protein GI | 222528626 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGC TATTAAAAAG GTCTCTTGCT CTGCTGGTAA GTATTGTCCT TGTATTTTCG TTGTTCTTAA GTGTTTTTCC ACAGCAAGCA AGAGCACAAG ATACCATAAA AATTGTAGGT AACTGGCAGG ATGCTGGTAA CTGGAATTTT GACAGCTCTA ATATTGTTTT GAGTGAAACC TCAACTCCGG GACTTTATTA TGGTGAGTAT ACTTTTAAAA CTGGTGGTTC TTATGAGTTC AAAGCAGTTA TAAATGGTAG TATATGGTGT ACAGGCGCTC CAAAAGTAGC AGATAATAAT ACTAACATAC CTCTTAATGT TACAGATGGT CAGACAGTTA AGTTTTGGTT CTTCAAAAAT TCTCAACTTG TAATTGACAG CACTCATTTT CCAAATGGGC CTGATAGTTT AGTAGGGCAA AATTCTTTTA AATTTGTAGG AGTAAATAAT GAATGGAACC CTAATGATGG TAAGTATCAA TTTATAAGAG TCCCAGATGC AACATATACC TATTTATATG ATAATAGTTA CAATGATTTT TCGATACCAT ATGGATTTAA GATTATAATA GGTGGTTTTG GGAACCTATC ATGGGCATGG AATGGTGCAA AAGAAGGTTC AGTTGTTAAA TTTAAAGAGG GTGGAGATAA TATTGACCTG AAAGAATTTA AAGAATCTAA TGATAATCTT CTAAAAACAA AATTTTTCCT TGATATCCTA AACGGCTGGC TCTTTACAGA AAAAGATTTG ACAAATATCC AACCAGTAGA TTTTGCAAAT AACGGCTCTG TAATTGGTGG AACTTCAGTT CATCTTACAT GGACACCATA TACACCAAAT GCAAATCCAC TTTCAGCCAA GCTTTACTAC AAAATAATTG ATGTGAGCAA TGATAGCGAG TTGGTTAGTC TCACCGATTA CAGCACAATC CACAAAATAG ATATCCCAAG AGAGTGGATA GGAAAGACAA TTAAAATTAT TGCAAATGCC AAGATAGGTG AAATAACAGG ACCTAATGTT GAGTTTACAT TAAATATTGT TGACTTGCCA CAAAACTTAA TAGTTGAAGC AGCTAACTAT ATAACATACA ACTCTATAGA TGAGACATTT TTAAATTTGG CTCAAGGTGA TACAGTTCAA AACATAACTT CTAACTTCGC AGTTGTTACA TCATATGTTT ACAATGTGGT GTACGAGGGA AATAGTTATC CACTGACATT TAATATTGAC TGGGAGAGCA GTAATTCATC TGTATTAACA ATAAATGGTA GTACAGTTGT GGTTACAAGG CCAACACAAG GTGACCTTCA GGTTGAGTTG AGAGCAAGAG CAAGATTTGG TGCTATTTCA GCAGATGGAC AAAAAATATT TACTCTTACA GTAAAGAAAT TCCTATTAGG AGTAGATGGT GGAATACCTG TTACATTTAA CGTAACAGTT CCAGATTATA CTCCTGATAA TGACAATATT TACATTGCTG GAGATTTTAA GACTGATAAG CTTCCAAAAT GGGACCCTGT AGGTATCAAA TTGATAAAGG TCGGAGATAA AAAATATAGT ATAACAATGT ATCTTCCACC GAATGTAACA ATTGAATATA AATATACCCG CGGAAGTTGG TCAAAAGTAG AAAAGGATGC ATTTGGTAAT GAAATATCAA ATAGAGTTCT GCAGATAAAT AATCAGGCTG TGACAAAAAA TGATGTAGTT GAAGCATTTG CGGACCTTGG AGCTGTCAAG CAAGGTCTTC CAACGGTGAA TTTGGTCATC AATGTTCCTC AACAAACTGT AATTGATACA GGTGATGGAG GAAGGGTAAT AAAAGCAGAA GGTGGTGCTA TTTCAGTTCC AAATACTAAG AACACTGTTT TGATAGATAT TCGGAGTGCA TTAAATGCTA TTATATTAAA GTAA
|
Protein sequence | MRMLLKRSLA LLVSIVLVFS LFLSVFPQQA RAQDTIKIVG NWQDAGNWNF DSSNIVLSET STPGLYYGEY TFKTGGSYEF KAVINGSIWC TGAPKVADNN TNIPLNVTDG QTVKFWFFKN SQLVIDSTHF PNGPDSLVGQ NSFKFVGVNN EWNPNDGKYQ FIRVPDATYT YLYDNSYNDF SIPYGFKIII GGFGNLSWAW NGAKEGSVVK FKEGGDNIDL KEFKESNDNL LKTKFFLDIL NGWLFTEKDL TNIQPVDFAN NGSVIGGTSV HLTWTPYTPN ANPLSAKLYY KIIDVSNDSE LVSLTDYSTI HKIDIPREWI GKTIKIIANA KIGEITGPNV EFTLNIVDLP QNLIVEAANY ITYNSIDETF LNLAQGDTVQ NITSNFAVVT SYVYNVVYEG NSYPLTFNID WESSNSSVLT INGSTVVVTR PTQGDLQVEL RARARFGAIS ADGQKIFTLT VKKFLLGVDG GIPVTFNVTV PDYTPDNDNI YIAGDFKTDK LPKWDPVGIK LIKVGDKKYS ITMYLPPNVT IEYKYTRGSW SKVEKDAFGN EISNRVLQIN NQAVTKNDVV EAFADLGAVK QGLPTVNLVI NVPQQTVIDT GDGGRVIKAE GGAISVPNTK NTVLIDIRSA LNAIILK
|
| |