Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2028 |
Symbol | |
ID | 7408240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2139338 |
End bp | 2140729 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716394 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002573878 |
Protein GI | 222529996 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATACCA CATCAGAACA ACAGAAACAA ATCAATATAG CATACATTGG TGGGGGCTCA CGCGGCTGGG CTTGGAGGTT GATGACTGAT TTAGCTTTAG AGAAGGATTT GAGTGGTACG GTGAGACTTT ACGACATTGA TTTTGAAGCT GCTAAGACAA ATGAAGTGAT AGGCAATAAG CTTTCAAACA AGCCTGAGGT AGTAGGGAAA TGGCAATATG TTGCTGTAAA AAGTTTGGAT GAAGCATTAT ATGGTGCAGA TTTTGTTATC ATTTCAATTT TGCCTGGGAC ATTTGAAGAG ATGTATTCGG ATGTCCATGC TCCTGAGAAA TATGGTATAT ACCAGTCTGT AGGTGACACA ACAGGACCAG GTGGTCTTAT CAGAGGACTT AGGACTGTTC CTATGTATGT TGAATTTGCA GAAGCTATAA AGAGAAATTG TCCAGATGCT TGGGTTATTA ATTACACAAA TCCAATGGCT ATATGCTTAA AAGCTCTATA TGAGGTATTT CCTAAAATAA AAGCTTTTGG CTGCTGTCAT GAAGTTTTTG GTACACAGAA GCTTTTGACA GAGGTTGTAA AAGAATTTTT GGGAGAAGAA AGAGAAATCT CAAGAAGAGA AATTAAGGTG AATGTTCTTG GTATAAATCA TTTTACGTGG TTTGACAAAG CATCATACAA AACTCATGAT TTATTCCCTC TTTACAAAGA ATTTGTGAAC AAGTACTATG AAGAAGGTTT TGAAAAAACA AAAGGATTGT GGGAAAAAGA TTATTTTGCT TCTGCAAATA GAGTAAAGTT TGACTTATTT AAACGCTTTG GACTTATCGC TGCAGCAGGT GATAGACACT TGGCTGAATT TGTTCCTTAC ATTTATCTTA CAGACAAAGA GACAGTTTAT AAATGGAAAT TTAACTTGAC GCCTGTGGAA TGGAGAATTA AGCACAGAGA AGAGTTGATT AAACTTAGCA AAGAATACGC ATCAGACCAA AAAGAAGTTC CATTGAATCC TTCAGGTGAA GAAGGAGTTA TGCAAATGAA AGCTATTTTA GGTTTAGATA CGCTGGTGAC AAACGTGAAT TTGCCAAATA TGGGTCAAAT ACCAAACCTT CCAATAGGAG CTATAGTTGA GACGAATGCA GTATTTACTC ATGATGATGT CAGACCTGTT TATGCAGGAA AGCTGCCTTC AGATTTGGCA AGTATAATGA TTAGGCACAT CAGCAATCAG GAACTTATAG TTAAAGCAGC TTTAGAAAAG GATTTGAACC TTGCAAAAAG AGCATTCTTA AATGACCCTG CAGTTGAAAG ACTACCGCAG AGCAAAGCTG AGCAGCTTTT TGATGAGATG ATAAATAACA CTAAAAAGTA TTTGGCATAT TTAGGCTTGT AA
|
Protein sequence | MNTTSEQQKQ INIAYIGGGS RGWAWRLMTD LALEKDLSGT VRLYDIDFEA AKTNEVIGNK LSNKPEVVGK WQYVAVKSLD EALYGADFVI ISILPGTFEE MYSDVHAPEK YGIYQSVGDT TGPGGLIRGL RTVPMYVEFA EAIKRNCPDA WVINYTNPMA ICLKALYEVF PKIKAFGCCH EVFGTQKLLT EVVKEFLGEE REISRREIKV NVLGINHFTW FDKASYKTHD LFPLYKEFVN KYYEEGFEKT KGLWEKDYFA SANRVKFDLF KRFGLIAAAG DRHLAEFVPY IYLTDKETVY KWKFNLTPVE WRIKHREELI KLSKEYASDQ KEVPLNPSGE EGVMQMKAIL GLDTLVTNVN LPNMGQIPNL PIGAIVETNA VFTHDDVRPV YAGKLPSDLA SIMIRHISNQ ELIVKAALEK DLNLAKRAFL NDPAVERLPQ SKAEQLFDEM INNTKKYLAY LGL
|
| |