Gene Athe_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0594 
Symbol 
ID7406935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp668619 
End bp670886 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content36% 
IMG OID643714977 
ProductCellulase 
Protein accessionYP_002572493 
Protein GI222528611 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA TTATTTTAAA GTTTTGTGCA CTCATGATGG TAGTGATTTT GATTGTTTCC 
ATTTTACAAA TATTACCTGT ATTTGCCCAG AGCATACTGT ATGAAAAGGA AAAATATCCA
CATCTTCTTG GCAATCAGGT AGTTAAAAAA CCATCGGTTG CCGGCAGACT GCAGATTATT
GAAAAGGACG GAAAAAAGTA TTTAGCTGAC CAGAAAGGAG AAATAATTCA GCTTCGTGGT
ATGAGTACAC ATGGACTTCA GTGGTATGGT GATATTATAA ACAAAAATGC ATTTAAAGCT
CTTTCAAAAG ATTGGGAGTG CAACGTTATA AGGCTTGCGA TGTATGTGGG TGAAGGCGGA
TATGCTTCAA ACCCAAGTAT TAAAGAAAAA GTTATAGAAG GGATTAAGCT TGCTATTGAG
AATGACATGT ATGTAATTGT TGACTGGCAT GTATTAAATC CCGGTGACCC GAACGCAGAA
ATTTATAAAG GGGCAAAAGA CTTTTTCAAA GAGATAGCTA CAAGTTTTCC CAATGACTAT
CACATAATAT ATGAACTTTG CAATGAACCA AATCCAAATG AACCGGGAGT AGAAAATAGC
TTGGATGGCT GGAAAAAAGT AAAGGCTTAT GCACAGCCCA TCATAAAAAT GCTCAGAAGT
TTGGGGAATC AGAACATTAT AATTGTAGGT TCGCCAAACT GGAGTCAGAG ACCTGACTTT
GCAATTCAAG ACCCTATAAA TGATAAGAAT GTTATGTATT CAGTTCATTT TTACTCTGGA
ACTCACAAAG TTGATGGATA TGTTTTTGAA AACATGAAAA ATGCGTTTGA AAATGGCGTG
CCAATTTTCG TGAGTGAATG GGGAACAAGT TTGGCAAGCG GTGATGGTGG ACCGTATCTT
GATGAAGCAG ATAAGTGGCT TGAATATTTA AATTCAAACT ATATTAGCTG GGTGAACTGG
TCGCTGTCAA ACAAAAATGA GACATCAGCT GCTTTTGTTC CATATATAAA TGGTATGCAT
GATGCCACAC CACTTGACCC TGGTGATGAT AAGGTGTGGG ACATAGAAGA GCTTAGTATT
TCTGGAGAGT ATGTGAGGGC AAGGATAAAA GGAATTGCTT ATCAGCCAAT TAAGAGAGAT
AACAAAATAA AAGAAGGAGA AAATGCACCT TTAGGCGAAA AAGTCTTACC ATCCACGTTT
GAAGATGACA CTCGTCAGGG CTGGGATTGG GATGGACCAT CTGGTGTGAA AGGTCCTATT
ACTATCGAAA GTGCGAATGG TTCAAAAGCG CTATCTTTTA ATGTTGAGTA TCCAGAGAAA
AAACCACAAG ATGGCTGGGC AACAGCTGCA AGGCTTATAC TTAAAGACAT AAATGTAGAA
AGGGGAAATA ATAAATATTT GGCTTTTGAT TTTTATTTGA AACCAGATAG GGCTTCAAAA
GGTATGATTC AGATATTTTT AGCTTTTTCA CCACCTTCCT TAGGTTACTG GGCTCAGGTA
CAAGACAGTT TTAATATTGA CCTTGCAAAA CTGTCAAGTG CAAAAAAGAT AGAAGACAGA
ATTTATAAGT TCAATGTATT TTTTGACTTA GACAAGATAC AAGATAATAA AGTACTGAGT
CCAGACACAC TCTTGAGAGA TATAATAGTA GTCATAGCAG ATGGCAATAG CGATTTTAAG
GGGAAAATGT ATATAGATAA TGTTAGATTT ACCAATATCC TTTTTGAGGA TATCAATTTT
GAAAATAGCC TTTATGATGT TATAGACAAG CTTTATTCTA AAGGAATCAT AAAAGGAATT
TCAGTATTTA AGTACTTGCC AGATAAAAAC ATTACAAGGG CTGAATTTGC TGCACTTTGT
GTCAGGGCAC TGAACCTGAA AATTGAAAAA TACGATGGTA GATTTTCTGA TGTGAAAAGC
GGCAACTGGT ATTCAGATGT AGTTTATACG GCGTATAAAA ACAAATTGTT TGAAATAAAA
GAGAATAAAT TCTTTCCTGA AAATATTTTA AAAAGAGAAG AAGCAGTAGC TTTGGCAATT
GAAGTGTATA AAAGATTGAC TGGTAAGATA GAAGTTAATA CAGACGATGT TCCAATTGCT
GATGAAAAAC TTATAAATCC TCAATACAGA GAAAGCGTGA AGTTAGCAAT TAAGCTCGGT
ATTGTTGACC TGTATTCAGA CGGAACATTT GAACCAAATA AGAGCGTTTC AAGAGGGGAG
GTGGCAACAA TTCTCTATAA TCTCTTGAAC TTAGCAGGCA AGCTATGA
 
Protein sequence
MRKIILKFCA LMMVVILIVS ILQILPVFAQ SILYEKEKYP HLLGNQVVKK PSVAGRLQII 
EKDGKKYLAD QKGEIIQLRG MSTHGLQWYG DIINKNAFKA LSKDWECNVI RLAMYVGEGG
YASNPSIKEK VIEGIKLAIE NDMYVIVDWH VLNPGDPNAE IYKGAKDFFK EIATSFPNDY
HIIYELCNEP NPNEPGVENS LDGWKKVKAY AQPIIKMLRS LGNQNIIIVG SPNWSQRPDF
AIQDPINDKN VMYSVHFYSG THKVDGYVFE NMKNAFENGV PIFVSEWGTS LASGDGGPYL
DEADKWLEYL NSNYISWVNW SLSNKNETSA AFVPYINGMH DATPLDPGDD KVWDIEELSI
SGEYVRARIK GIAYQPIKRD NKIKEGENAP LGEKVLPSTF EDDTRQGWDW DGPSGVKGPI
TIESANGSKA LSFNVEYPEK KPQDGWATAA RLILKDINVE RGNNKYLAFD FYLKPDRASK
GMIQIFLAFS PPSLGYWAQV QDSFNIDLAK LSSAKKIEDR IYKFNVFFDL DKIQDNKVLS
PDTLLRDIIV VIADGNSDFK GKMYIDNVRF TNILFEDINF ENSLYDVIDK LYSKGIIKGI
SVFKYLPDKN ITRAEFAALC VRALNLKIEK YDGRFSDVKS GNWYSDVVYT AYKNKLFEIK
ENKFFPENIL KREEAVALAI EVYKRLTGKI EVNTDDVPIA DEKLINPQYR ESVKLAIKLG
IVDLYSDGTF EPNKSVSRGE VATILYNLLN LAGKL