Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1104 |
Symbol | |
ID | 7409661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1196600 |
End bp | 1197916 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643715470 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002572978 |
Protein GI | 222529096 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAAAA TAGCTATCAT AGGTGCAGGA AGTGGAGTTT TCACAAGGAA CTTGGTAAGA GACATTTTGT CATATCCAGA GCTAAGAAAC TCTACAATAG CGCTTATGGA CATTGACAGT ATAAGGCTTG AATTTATGAG AAAAGCTCTG CAAAAGCTCA TTGAACAGGA AAAGTATCCT ACTAAACTTG AAGCTACAAC TGATAGAAAA GAGGCTTTGA AAGGTGCAAA ATATGTTATT GTCACAATAC AGATTGGAGG TTTAAAACCT TTCGAGTATG ACATTTACAT TCCTCTAAAA TATGGTGTAA AACAGGCAGT TGGTGATACA ATAGGTCCGG GTGGAGTTTT CAGGGCTTTG CGAACAATAC TGGTTTTACT TGACATTGCA AAGGACATGG AAGAGCTGTG TCCTGACGCA CTTTTGCTCA ATTATGTAAA TCCAATGGCA ATGAATTGCT GGGCGTTAAA TAAGGCTACG AATATAAAAA ATGTAGGGCT TTGTCACAGT GTTCAAGGAA CTGCTGAATT TTTAGCAAAA ATTATTGGGG CAAAAATGGA AGAAATTTCA TACTTATGCG CAGGTATAAA CCATATGGCA TGGTTTTTAA AATTTGAGTG GAATGGGAAA GATGCATATC CTCTTATAAG GGAAAAAGCA AGTGACCCCG AAATCTATAC ACAGGATGTT ACAAAATTCG AAATACTAAA ACATTTTGGA TATTATGTAA CAGAGTCAAG TTTTCACATG TCTGAATATG TTCCCTATTT TAGAAAGAGC GACGATTGGA TAAATAAAAT CCATAGAACA CATTCATGGC ACAAAGAACA TTACAATGGT ATGTATCTGC ACTGCTGTTT AGATGCTGCG AAAACTTTGC TTGAAGACCT GAGGAAAATG GCAGAGGCAG ACTACATCGA CCCCAAGAGA AGTAACGAAT ACTGTGCAAC TATCATCCAT TCCATAGAGA CAAATACTCC AGCTGTGATA AATGGTAATG TTGAAAACAA AGGTTTAATA ACAAATCTAC CTGAAGGATG TTGCGTTGAA GTACCATGTT TGGTTGACAA AAATGGTATT CAGCCAACTT ATGTTGGAAA TCTACCACCA CAGCTTGCAG CTTTGAACAG AACAAACATA AACGTTCAAG AGCTAACTGT TCTTGCTGCT TTGACAGGTG ATAGGGAAGC AGTTTATCAT GCAATTATGA TGGACCCTCT CACAAGTGCT GTTTTGGATT TAGACCAGAT ACGCCAGATG GTAGATGAGA TGTTTGAAGC TGAAAAAGAA TGGCTGCCAG AAAAGTTTTA TAGGTAA
|
Protein sequence | MLKIAIIGAG SGVFTRNLVR DILSYPELRN STIALMDIDS IRLEFMRKAL QKLIEQEKYP TKLEATTDRK EALKGAKYVI VTIQIGGLKP FEYDIYIPLK YGVKQAVGDT IGPGGVFRAL RTILVLLDIA KDMEELCPDA LLLNYVNPMA MNCWALNKAT NIKNVGLCHS VQGTAEFLAK IIGAKMEEIS YLCAGINHMA WFLKFEWNGK DAYPLIREKA SDPEIYTQDV TKFEILKHFG YYVTESSFHM SEYVPYFRKS DDWINKIHRT HSWHKEHYNG MYLHCCLDAA KTLLEDLRKM AEADYIDPKR SNEYCATIIH SIETNTPAVI NGNVENKGLI TNLPEGCCVE VPCLVDKNGI QPTYVGNLPP QLAALNRTNI NVQELTVLAA LTGDREAVYH AIMMDPLTSA VLDLDQIRQM VDEMFEAEKE WLPEKFYR
|
| |