Gene Mbar_A0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0214 
Symbol 
ID3626305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp250923 
End bp251996 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content46% 
IMG OID637699105 
Productcellulase 
Protein accessionYP_303778 
Protein GI73667763 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.880444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.715084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAG GCGGAAACCT TAAAAAAATA AAATCTCTGC TTGAAAAATT CACCAATGCT 
CATGGGATCT CAGGCTTTGA GGACGACATC CGAAAACTCC TTGAAAAGGA ACTTGAACCC
TATGTTGATA CCATGCGCAA AGATTGCATG GGAAACCTAA TAGCTCTCAA AAAAGGAAAA
GGCCCTTCCA TAATGCTGGC TGCCCATATG GATGAAATCG GGCTTATGGT CAGGTATATT
GATGATAATG GCTTCCTCAG GTTTGTCGGG ATCGGAGGAT GGTTTGACCA GACCCTTCTT
AACCAGAGAG TTGTACTTCA CGGCAAAAAA GGTCCAATTC CCGGAGTCAT CGGGTCCAAG
CCTCCTCATG TAATGAAAGA GGATGACAGG AAAAAGCCCG TGAAGCTGGA CGATATGTTC
ATCGATATCG GAGCAAAAGA CAGGGAAGAT GCTGAGAACC TTGGAATTGA GATAGGTACG
GCAGTTTCTA TTGACCGGGA CTTTGTGCCT CTGGCAAACG GAAAGATAAC TTCAAAAGCC
CTTGACAACC GTGCAGGCGT TGTTATCCTT ATTGAGGTTA TGAAACGGCT TTCCAAACAT
AAAGTTGGAG CAAATGTCTA TGCCGTAGGC ACTGTCCAGG AAGAGGTAGG GTTAAAAGGA
GCAAGAACCT CTGCCTTTGG GGTTTCTCCA GACCTTGCGC TTGCCCTTGA CACAACTATT
CCTGGAGACC ATCCGGGCAT TACTAAAACC GATTCTTGCC TGGAAATCGG GAAAGGCCCT
GTAATTACAT TAGCCGATGC GTCCGGAAGA GGCCTTATAG CTCACCCACA GGTTATTAAG
TGGCTTAAAG AAACTGCTAC TGAAAATAAG ATACCTTACC AGCTTGGCGT TGGTTCGGGA
GGCACAACCG ATGCAACCTC AATACACCTT ACAAAAGAAG GTATCCCTAC AGGTACAGTC
AGCATAGCCA CACGATACAT CCATTCACCT GTTGAAGTCC TGGATGTGGC AGATATTGAC
GCGTGCGTTT CCCTTATTGT GAAAGCAATA GAAAACGTAG GCAAATATTT CTGA
 
Protein sequence
MEKGGNLKKI KSLLEKFTNA HGISGFEDDI RKLLEKELEP YVDTMRKDCM GNLIALKKGK 
GPSIMLAAHM DEIGLMVRYI DDNGFLRFVG IGGWFDQTLL NQRVVLHGKK GPIPGVIGSK
PPHVMKEDDR KKPVKLDDMF IDIGAKDRED AENLGIEIGT AVSIDRDFVP LANGKITSKA
LDNRAGVVIL IEVMKRLSKH KVGANVYAVG TVQEEVGLKG ARTSAFGVSP DLALALDTTI
PGDHPGITKT DSCLEIGKGP VITLADASGR GLIAHPQVIK WLKETATENK IPYQLGVGSG
GTTDATSIHL TKEGIPTGTV SIATRYIHSP VEVLDVADID ACVSLIVKAI ENVGKYF