Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A0214 |
Symbol | |
ID | 3626305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 250923 |
End bp | 251996 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637699105 |
Product | cellulase |
Protein accession | YP_303778 |
Protein GI | 73667763 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.880444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.715084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAG GCGGAAACCT TAAAAAAATA AAATCTCTGC TTGAAAAATT CACCAATGCT CATGGGATCT CAGGCTTTGA GGACGACATC CGAAAACTCC TTGAAAAGGA ACTTGAACCC TATGTTGATA CCATGCGCAA AGATTGCATG GGAAACCTAA TAGCTCTCAA AAAAGGAAAA GGCCCTTCCA TAATGCTGGC TGCCCATATG GATGAAATCG GGCTTATGGT CAGGTATATT GATGATAATG GCTTCCTCAG GTTTGTCGGG ATCGGAGGAT GGTTTGACCA GACCCTTCTT AACCAGAGAG TTGTACTTCA CGGCAAAAAA GGTCCAATTC CCGGAGTCAT CGGGTCCAAG CCTCCTCATG TAATGAAAGA GGATGACAGG AAAAAGCCCG TGAAGCTGGA CGATATGTTC ATCGATATCG GAGCAAAAGA CAGGGAAGAT GCTGAGAACC TTGGAATTGA GATAGGTACG GCAGTTTCTA TTGACCGGGA CTTTGTGCCT CTGGCAAACG GAAAGATAAC TTCAAAAGCC CTTGACAACC GTGCAGGCGT TGTTATCCTT ATTGAGGTTA TGAAACGGCT TTCCAAACAT AAAGTTGGAG CAAATGTCTA TGCCGTAGGC ACTGTCCAGG AAGAGGTAGG GTTAAAAGGA GCAAGAACCT CTGCCTTTGG GGTTTCTCCA GACCTTGCGC TTGCCCTTGA CACAACTATT CCTGGAGACC ATCCGGGCAT TACTAAAACC GATTCTTGCC TGGAAATCGG GAAAGGCCCT GTAATTACAT TAGCCGATGC GTCCGGAAGA GGCCTTATAG CTCACCCACA GGTTATTAAG TGGCTTAAAG AAACTGCTAC TGAAAATAAG ATACCTTACC AGCTTGGCGT TGGTTCGGGA GGCACAACCG ATGCAACCTC AATACACCTT ACAAAAGAAG GTATCCCTAC AGGTACAGTC AGCATAGCCA CACGATACAT CCATTCACCT GTTGAAGTCC TGGATGTGGC AGATATTGAC GCGTGCGTTT CCCTTATTGT GAAAGCAATA GAAAACGTAG GCAAATATTT CTGA
|
Protein sequence | MEKGGNLKKI KSLLEKFTNA HGISGFEDDI RKLLEKELEP YVDTMRKDCM GNLIALKKGK GPSIMLAAHM DEIGLMVRYI DDNGFLRFVG IGGWFDQTLL NQRVVLHGKK GPIPGVIGSK PPHVMKEDDR KKPVKLDDMF IDIGAKDRED AENLGIEIGT AVSIDRDFVP LANGKITSKA LDNRAGVVIL IEVMKRLSKH KVGANVYAVG TVQEEVGLKG ARTSAFGVSP DLALALDTTI PGDHPGITKT DSCLEIGKGP VITLADASGR GLIAHPQVIK WLKETATENK IPYQLGVGSG GTTDATSIHL TKEGIPTGTV SIATRYIHSP VEVLDVADID ACVSLIVKAI ENVGKYF
|
| |