Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0594 |
Symbol | |
ID | 7406935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 668619 |
End bp | 670886 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643714977 |
Product | Cellulase |
Protein accession | YP_002572493 |
Protein GI | 222528611 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAA TTATTTTAAA GTTTTGTGCA CTCATGATGG TAGTGATTTT GATTGTTTCC ATTTTACAAA TATTACCTGT ATTTGCCCAG AGCATACTGT ATGAAAAGGA AAAATATCCA CATCTTCTTG GCAATCAGGT AGTTAAAAAA CCATCGGTTG CCGGCAGACT GCAGATTATT GAAAAGGACG GAAAAAAGTA TTTAGCTGAC CAGAAAGGAG AAATAATTCA GCTTCGTGGT ATGAGTACAC ATGGACTTCA GTGGTATGGT GATATTATAA ACAAAAATGC ATTTAAAGCT CTTTCAAAAG ATTGGGAGTG CAACGTTATA AGGCTTGCGA TGTATGTGGG TGAAGGCGGA TATGCTTCAA ACCCAAGTAT TAAAGAAAAA GTTATAGAAG GGATTAAGCT TGCTATTGAG AATGACATGT ATGTAATTGT TGACTGGCAT GTATTAAATC CCGGTGACCC GAACGCAGAA ATTTATAAAG GGGCAAAAGA CTTTTTCAAA GAGATAGCTA CAAGTTTTCC CAATGACTAT CACATAATAT ATGAACTTTG CAATGAACCA AATCCAAATG AACCGGGAGT AGAAAATAGC TTGGATGGCT GGAAAAAAGT AAAGGCTTAT GCACAGCCCA TCATAAAAAT GCTCAGAAGT TTGGGGAATC AGAACATTAT AATTGTAGGT TCGCCAAACT GGAGTCAGAG ACCTGACTTT GCAATTCAAG ACCCTATAAA TGATAAGAAT GTTATGTATT CAGTTCATTT TTACTCTGGA ACTCACAAAG TTGATGGATA TGTTTTTGAA AACATGAAAA ATGCGTTTGA AAATGGCGTG CCAATTTTCG TGAGTGAATG GGGAACAAGT TTGGCAAGCG GTGATGGTGG ACCGTATCTT GATGAAGCAG ATAAGTGGCT TGAATATTTA AATTCAAACT ATATTAGCTG GGTGAACTGG TCGCTGTCAA ACAAAAATGA GACATCAGCT GCTTTTGTTC CATATATAAA TGGTATGCAT GATGCCACAC CACTTGACCC TGGTGATGAT AAGGTGTGGG ACATAGAAGA GCTTAGTATT TCTGGAGAGT ATGTGAGGGC AAGGATAAAA GGAATTGCTT ATCAGCCAAT TAAGAGAGAT AACAAAATAA AAGAAGGAGA AAATGCACCT TTAGGCGAAA AAGTCTTACC ATCCACGTTT GAAGATGACA CTCGTCAGGG CTGGGATTGG GATGGACCAT CTGGTGTGAA AGGTCCTATT ACTATCGAAA GTGCGAATGG TTCAAAAGCG CTATCTTTTA ATGTTGAGTA TCCAGAGAAA AAACCACAAG ATGGCTGGGC AACAGCTGCA AGGCTTATAC TTAAAGACAT AAATGTAGAA AGGGGAAATA ATAAATATTT GGCTTTTGAT TTTTATTTGA AACCAGATAG GGCTTCAAAA GGTATGATTC AGATATTTTT AGCTTTTTCA CCACCTTCCT TAGGTTACTG GGCTCAGGTA CAAGACAGTT TTAATATTGA CCTTGCAAAA CTGTCAAGTG CAAAAAAGAT AGAAGACAGA ATTTATAAGT TCAATGTATT TTTTGACTTA GACAAGATAC AAGATAATAA AGTACTGAGT CCAGACACAC TCTTGAGAGA TATAATAGTA GTCATAGCAG ATGGCAATAG CGATTTTAAG GGGAAAATGT ATATAGATAA TGTTAGATTT ACCAATATCC TTTTTGAGGA TATCAATTTT GAAAATAGCC TTTATGATGT TATAGACAAG CTTTATTCTA AAGGAATCAT AAAAGGAATT TCAGTATTTA AGTACTTGCC AGATAAAAAC ATTACAAGGG CTGAATTTGC TGCACTTTGT GTCAGGGCAC TGAACCTGAA AATTGAAAAA TACGATGGTA GATTTTCTGA TGTGAAAAGC GGCAACTGGT ATTCAGATGT AGTTTATACG GCGTATAAAA ACAAATTGTT TGAAATAAAA GAGAATAAAT TCTTTCCTGA AAATATTTTA AAAAGAGAAG AAGCAGTAGC TTTGGCAATT GAAGTGTATA AAAGATTGAC TGGTAAGATA GAAGTTAATA CAGACGATGT TCCAATTGCT GATGAAAAAC TTATAAATCC TCAATACAGA GAAAGCGTGA AGTTAGCAAT TAAGCTCGGT ATTGTTGACC TGTATTCAGA CGGAACATTT GAACCAAATA AGAGCGTTTC AAGAGGGGAG GTGGCAACAA TTCTCTATAA TCTCTTGAAC TTAGCAGGCA AGCTATGA
|
Protein sequence | MRKIILKFCA LMMVVILIVS ILQILPVFAQ SILYEKEKYP HLLGNQVVKK PSVAGRLQII EKDGKKYLAD QKGEIIQLRG MSTHGLQWYG DIINKNAFKA LSKDWECNVI RLAMYVGEGG YASNPSIKEK VIEGIKLAIE NDMYVIVDWH VLNPGDPNAE IYKGAKDFFK EIATSFPNDY HIIYELCNEP NPNEPGVENS LDGWKKVKAY AQPIIKMLRS LGNQNIIIVG SPNWSQRPDF AIQDPINDKN VMYSVHFYSG THKVDGYVFE NMKNAFENGV PIFVSEWGTS LASGDGGPYL DEADKWLEYL NSNYISWVNW SLSNKNETSA AFVPYINGMH DATPLDPGDD KVWDIEELSI SGEYVRARIK GIAYQPIKRD NKIKEGENAP LGEKVLPSTF EDDTRQGWDW DGPSGVKGPI TIESANGSKA LSFNVEYPEK KPQDGWATAA RLILKDINVE RGNNKYLAFD FYLKPDRASK GMIQIFLAFS PPSLGYWAQV QDSFNIDLAK LSSAKKIEDR IYKFNVFFDL DKIQDNKVLS PDTLLRDIIV VIADGNSDFK GKMYIDNVRF TNILFEDINF ENSLYDVIDK LYSKGIIKGI SVFKYLPDKN ITRAEFAALC VRALNLKIEK YDGRFSDVKS GNWYSDVVYT AYKNKLFEIK ENKFFPENIL KREEAVALAI EVYKRLTGKI EVNTDDVPIA DEKLINPQYR ESVKLAIKLG IVDLYSDGTF EPNKSVSRGE VATILYNLLN LAGKL
|
| |