Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0165 |
Symbol | |
ID | 7407156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 203747 |
End bp | 205420 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714567 |
Product | alpha amylase catalytic region |
Protein accession | YP_002572090 |
Protein GI | 222528208 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000167663 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTGC ACAAAAAGTG GTGGAAGGAA GCTGTTGTAT ATCAAATCTA TCCTCGAAGT TTTTATGACT CAAACGGTGA TGGAATTGGA GATTTGCCAG GAATAATAGA AAAACTTGAT TATTTACAAG AACTTGGAGT GGATGTTATT TGGCTAAATC CTATTTACAA ATCCCCAAAT GCAGACAATG GTTATGACAT CAGCGATTAC TATGATATCA TGGACGAGTT TGGTACAATG GAAGACTTTG ATAGGCTCTT GAATGAAGCT CACAAAAGAG GAATTAAAAT TGTAATGGAC CTTGTTGTGA ACCACACATC TGATGAACAC AAATGGTTTT TGGAGTCCAG AAAATCAAAA GACAATCCTT ACAGAGATTT TTATTTTTGG CGGCCTGGTA AAAATGGAGG GCCTCCTAAT AACTGGACAT CCTTTTTCAG CGGCCCTGCT TGGGAATACG ATGAGCTGAC AGGAGAATAT TACCTTCACC TTTTTGCTGT AAAGCAACCA GACCTCAACT GGAACAATCC TCAGGTTCGT CAAGAAATTT ACAAAATGAT GAAGTGGTGG CTTGACAAAG GCATAGATGG TTTTAGAATG GATGTAATAA ACCTCATTTC AAAAGTTGAA GGACTGCCAG ATGACCCAGA TGAAAATCAA GGAGGTTTGA TTGGTTTTAA ATACTATGCC AACGGTCCAA GGGTGCATGA ATATCTTCAG GAAATGAACA GAGAAGTACT TTCTAAATAC GATATCATGA CAGTGGGAGA GACTCCATTT GTAACTCCTG AGATTGCAAA GCTGTATGTA GAGTATGACA GAAATGAGCT TAATATGCTC TTTCATTTTG AGCACATGGA TATGGACTGC AGCGGCAGCA AGTGGAATAT AAAACCTTGG AAACTCACTG ATTTGAAAAA GATAATGTAC AAGTGGTATT TGGCTTTAAA AGACAAGGGC TGGAATTCGC TTTATCTTAA CAACCATGAC CAGCCAAGGA TGGTCTCACG CTTTGGAAAC GACAAAGAGT ACAGGGTTGA GTCTGCAAAG CTTTTAGCAA CCCTGCTTCA CACATGGCAG GGAACTCCTT ATATCTACCA AGGTGAAGAG ATTGGTATGA CAAACTGCAA GTTTGAAAGT ATTGATGAGT TCAGGGACAT TGAAACACTC AACTGGTACA AAGAGATGAA AAAGCTTGGA AAATCAGATG AAGAGCTTTT AGAAATCTTA AACAAAAGAA GCAGAGACCA TGCACGAACA CCTATGCAGT GGGATGATTC TGAAAATGCA GGATTTACAA AAGGTACCCC ATGGATAAAG GTAAATCCAA ATTATAAGGA AATAAATGTT AAAAAAGCTT TAGAAGATAA AAATTCAGTG TTTTATTACT ATAAGAAATT GATTGAACTT AGAAAAAAGC ATCCTGTCAT TGTATATGGG GATGTTCAAA TGCTTTATGA GGACGATGAA AAGATCTTTG CATATACAAG AAGCTATGAA GATGAAAGGC TTCTTGTTGT AATGAACTTT TCTGAAGAAG AAAGCGAGTT TTTAGCACCA AATGAAATAT TTACCCAAAA ACCTGAACTT CTGATAAGTA ACTATGAAAT TGACGACAAT ATCCAAGAAA AAATTGTTTT AAAACCTTAT GAATCGAGGG TATACAAAAT ATAA
|
Protein sequence | MDLHKKWWKE AVVYQIYPRS FYDSNGDGIG DLPGIIEKLD YLQELGVDVI WLNPIYKSPN ADNGYDISDY YDIMDEFGTM EDFDRLLNEA HKRGIKIVMD LVVNHTSDEH KWFLESRKSK DNPYRDFYFW RPGKNGGPPN NWTSFFSGPA WEYDELTGEY YLHLFAVKQP DLNWNNPQVR QEIYKMMKWW LDKGIDGFRM DVINLISKVE GLPDDPDENQ GGLIGFKYYA NGPRVHEYLQ EMNREVLSKY DIMTVGETPF VTPEIAKLYV EYDRNELNML FHFEHMDMDC SGSKWNIKPW KLTDLKKIMY KWYLALKDKG WNSLYLNNHD QPRMVSRFGN DKEYRVESAK LLATLLHTWQ GTPYIYQGEE IGMTNCKFES IDEFRDIETL NWYKEMKKLG KSDEELLEIL NKRSRDHART PMQWDDSENA GFTKGTPWIK VNPNYKEINV KKALEDKNSV FYYYKKLIEL RKKHPVIVYG DVQMLYEDDE KIFAYTRSYE DERLLVVMNF SEEESEFLAP NEIFTQKPEL LISNYEIDDN IQEKIVLKPY ESRVYKI
|
| |