Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0228 |
Symbol | |
ID | 7407219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 276624 |
End bp | 278459 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714628 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_002572151 |
Protein GI | 222528269 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR01577] oligosaccharide amylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.159353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGC CACATATTAT AGAAGCTATA ATTGGGAACA CAAAGGTTTT AGGGCAGCTT GATTCAAATG GCATATTGCA AAGGTTTTAT TGGCCTGCAG TAGATTATTA TCAGCAGCTA AAACTCTTTT TGGCAGCTGT TTTTTTGGAT GGGCTTGTAT TTTTCGAGGA TGAAAATTTC AAGATAAAAA GTGGATTTGT GGATGACTTT GTGTACTTTT TTGAATATAA AATTGCAGAC AAGACAATTT TTCAGCTTGA CTTTGTTGAC TTTGAAACAG ATAGCTTGGT TCGTTTATGG GAAACTGGCT TCGAAGACTT CTATGTCTTT TTAGAACCCA TGATAAATTC TTCAAGTCTT TTTAATGCTG CAAAGGTTGA TAAGGAAAAT GAAATAGTCT ATGCATATTT TAAAGGGACA TATATAGGTC TTGCTTTTGA GAATAAGATA AAAAGCTTTA CAGTTAAAAA CGGAATTGAT GATGCAAACG ATAATCAACT GGAAGGCTGG AATGAAGCTA CAAATCCTCA GATTGCCGTA AAACTTAAAA ATACAGGAAA GGTTGTATGT TTTCTTGCTT TTGGGAACTC AAAAGATGAA ATCTATCAAA AGCTTTCTTA TTTAAAGCAA AAAGGGTATG ACGAAGTTTA CAGGCAAAAC AAAGCCTTTT GGGAAAAAAA ATTCTCAAAA GTAAAGCTCA TTTGCACACA AGACCCAAAA GATATGCAGC TTCAGAAAAG AAGTGCATAT GTATTTTATG TACTGCAGAA CTCCAAAACA GGTGGAATTT TAGCTGCATC AGAGGTTGAC GAGAAGTTTT TCCACTGTGG CGGGTATGGT TTTGTCTGGG GAAGAGACGC TGCGTTTATA GTATCTGCAA TGGATGAGCT TGGGCTCTCA AGGGAGGTTG AAAAATTTTT TGGATTCAAA TTTTCTTGTC AGGAAAAGGA AGGATTCTGG GACCAGAGAT ATTACACAGA TGGCAGCTTA GCTCCAAGTT GGGGAATTCA GATTGATGAG ACAGCTTCTG TTGTGTGGGG ATTTTTAGAA CATTGCGAGA AGCAAAATTC TCTTCATTTG ATTGATTTGC ATAAAGAACA GCTCAAAAAA GCACTGCTGT TTTTGATAGC TGCTGTGGAT AGCGAAAAGG GAGTTATCTT TAGAAGCTTT GACCTGTGGG AAGAAAGAGA AGGAATTCAT CTTTACTCAA ATGCAAGCAT ATATGCAGCG CTAAAGAAAG CCAAAAAATA TTTTCCTGAG CTTGAAAGTG AAATTGAAAA GAAGCTAAAG GCAATAAAAA ATCAGATGGC AACAAGATTT TACAGTCCTA AACTTTCCCG GTATGTAAGG TCAACAGATG TTAGAATTCC ACATGAGGAA TTTTTAAAGC TTCCTGAAGA GAACAGGTAC ATGCAAAAAG ATGAGAGATA TGAGATAACC TATTATTTCA AAAAGCAAGA TGAAGTTGTT GACATTTCAA TGCTTGGCAT TTATTATCCT TTTGAAATGG TAGATAGCAG CGATAAGGCT TTCAAAGCAA CCATTTTGGC TATTGAAAGG GAGTGTCAAA ATTCAATTGT CGGGGGCTAC AAGAGATACT CTGATGACAG ATACATTGGT GGAAATCCAT GGATACTGAC AACACTCTGG CTTGCAATTT ACTACAAAAA AACAGGGCAG ATTGACAGGG CAGAAAAACT TTTTGAGTGG GCAAAAGCGC ACAGTTTGCC AAACGGACTT TTTCCAGAGC AGGTTGACAG AATAACAGGA AAGCCTGCAT GGGTTGTTCC TTTAGCATGG TCTCATGCAA TGTATGTGCT GTATCTTTAT GAATAA
|
Protein sequence | MRKPHIIEAI IGNTKVLGQL DSNGILQRFY WPAVDYYQQL KLFLAAVFLD GLVFFEDENF KIKSGFVDDF VYFFEYKIAD KTIFQLDFVD FETDSLVRLW ETGFEDFYVF LEPMINSSSL FNAAKVDKEN EIVYAYFKGT YIGLAFENKI KSFTVKNGID DANDNQLEGW NEATNPQIAV KLKNTGKVVC FLAFGNSKDE IYQKLSYLKQ KGYDEVYRQN KAFWEKKFSK VKLICTQDPK DMQLQKRSAY VFYVLQNSKT GGILAASEVD EKFFHCGGYG FVWGRDAAFI VSAMDELGLS REVEKFFGFK FSCQEKEGFW DQRYYTDGSL APSWGIQIDE TASVVWGFLE HCEKQNSLHL IDLHKEQLKK ALLFLIAAVD SEKGVIFRSF DLWEEREGIH LYSNASIYAA LKKAKKYFPE LESEIEKKLK AIKNQMATRF YSPKLSRYVR STDVRIPHEE FLKLPEENRY MQKDERYEIT YYFKKQDEVV DISMLGIYYP FEMVDSSDKA FKATILAIER ECQNSIVGGY KRYSDDRYIG GNPWILTTLW LAIYYKKTGQ IDRAEKLFEW AKAHSLPNGL FPEQVDRITG KPAWVVPLAW SHAMYVLYLY E
|
| |