Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0187 |
Symbol | |
ID | 7407178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 235084 |
End bp | 236250 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714589 |
Product | glycoside hydrolase family 39 |
Protein accession | YP_002572112 |
Protein GI | 222528230 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3664] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA ATATTTATCC AGAAATTAAG TTAGGCAAGA TAAATAAATT CTGGACAAAG TGTGTAGGCA GCTGTCATGC AGCAACTGCC CTGAGGGAGG ATTGGAGAAA GCAGCTGAAA AAGTGTAGGC AAGAACTTGG GTTTAAATAT GTCAGATTTC ATGGCTGGCT TAACGATGAT ATGAGTGTTT GCTTCAGAAA TGATGAGGGC AAACTTTGGT TTTCATTTTT CAACATTGAT TCTATTATTG ACTTTTTATT GGATATTGGA ATGAAACCTT TTATAGAATT GAGCTTTATG CCTGAAGCTT TAGCATCTGG TACAAAGACA GTTTTTCATT ATAAGGGCAA TATAACTCCG CCAAAATCCT ATGAAGAGTG GGGACAACTT GTTAAAGAAC TTACAAGACA TCTTGTAAAA AGATATGGTA AAAATGAGGT AAGAGAGTGG TTTTTTGAGG TATGGAACGA ACCGAATTTG AAGGACTTTT TCTGGGCAGG GACAATGGAG GAGTATTTTG AGCTTTATAA GCATGCAGCG TTTGCAATCA AGATAGTTGA CTCTGAACTG AAAGTAGGAG GACCTGCATC AGCAGTAGAT GCGTGGATTT TGGAGCTGAA AGAGTATTGT CAGAAAAATG GAGTACCAAT AGATTTTATA ACAACACACC AGTATCCAAC AGATTTAGCG TTTAGCACAA GTTCTAATAT GGAAGAAGCC ATGGCAAAAG CAAAAAGAGG TGAGCTTGCT GAAAGAGTAA AAAAGGCTTT GAGTGAAGCA GAACCTTTGC CTTTGTATTA TACTGAGTGG AATAACTCGC CGAGTCCGCG AGACCCGTAC CACGATATTC CATATGATGC AGCATTTATT GTGAAGACTA TAATAGACAT TATTGATTTG CCGCTTGGTT GTTACTCATA CTGGACATTC ACAGATATAT TTGAAGAGTG TGGGCTAAGT TCATTGCCTT TCCATGGAGG GTTTGGACTT TTAAATATTC ACGGTATACC AAAGCCGTCA TATAGAGCAT TTCAAATATT GAATAAATTA GATGGAGAAA AGGTAGAACT TAAAGTTGAA GAAAAAAGTC CAACTGTAGA CTGCATAGCA GCTATAAATG GCAATGAATT AGTTCTTGTT ATTTCAAACC ATAATGTTCC GCCTTAG
|
Protein sequence | MKINIYPEIK LGKINKFWTK CVGSCHAATA LREDWRKQLK KCRQELGFKY VRFHGWLNDD MSVCFRNDEG KLWFSFFNID SIIDFLLDIG MKPFIELSFM PEALASGTKT VFHYKGNITP PKSYEEWGQL VKELTRHLVK RYGKNEVREW FFEVWNEPNL KDFFWAGTME EYFELYKHAA FAIKIVDSEL KVGGPASAVD AWILELKEYC QKNGVPIDFI TTHQYPTDLA FSTSSNMEEA MAKAKRGELA ERVKKALSEA EPLPLYYTEW NNSPSPRDPY HDIPYDAAFI VKTIIDIIDL PLGCYSYWTF TDIFEECGLS SLPFHGGFGL LNIHGIPKPS YRAFQILNKL DGEKVELKVE EKSPTVDCIA AINGNELVLV ISNHNVPP
|
| |