Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC5_0468 |
Symbol | |
ID | 4929122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C5 |
Kingdom | Archaea |
Replicon accession | NC_009135 |
Strand | - |
Start bp | 425637 |
End bp | 426689 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640165971 |
Product | cellulase |
Protein accession | YP_001096997 |
Protein GI | 134045511 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0307085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACAC TCGATTATTT AAAGATTCTC GCAACAGAAA AAGGAATTTC TGGAAGAGAA GATAAAGTAA GAGAATACAT GGAAAAAGAA CTTGAAAAAT ACTGTGACAG CATTGAAACA GATAAATTTG GAAATTTAAT TGCAAAAAAA GGATCAACTG GTCCAAAAAT TATGATTGCA TCACACATGG ATGAAATCGG ACTTATGGTT AAGTTCATCG ATGACAAAGG ATTTTTAAAA TTTACAAAAA TTGGTGGAAT TAACGACCAG ATGCTTTTAA ACCAGAAAGT CATCGTTCAC AGCAACGAAG GCGATATTGT TGGTGTTTTA GGTTCAAAAC CACCTCACAA AATGAAAGAA AGCGAAAGAA ACAAGTTAAT TTCTGCTGAA CACATGTTTA TCGACATCGG TGCAAAAAAT AAAGAAGATG CAGAAAAAAT GGGTGTTGAA ATCGGTACTG CGATTTCATT CAAGTCTGAA TTCGACAACC TCGGTGGAAA CGTAGTTTCA TGTAAATCAT TTGACAACAG GGCAGGCTGT GCAGTTGTTT TGAAAACCAT GGAATTACTT AAAGATATGG ATTTAAAATG TCAGGTTTAC GCTGTTGGAA CCGTTCAGGA AGAAGTTGGA TTAAAAGGTG CAAAAACATC TGCATTTGGA ATTAATCCCG ATATTGCATT CGCACTTGAT GTTACAATCT GTGGAGACCA CCCGGGAATT AAATTAGAAG ATGCACCAGT GGAACTTGGA AAAGGCCCTG TTGCTACAAT CGTAGATGCA TCTGGAAGAG GAATTATTAC ACACCCAACA GTTTTAAGAA TGGTTAGAGA TGTTGCAAAA GCAGATGAAA TTCCTGTCCA ATATGAAGTT GGAGAAGGAG GAACTACTGA TGCAACTGCA ATCCACTTAA CAAGAGATGG AATTCCAACA GGAGTTATAT CCGTTCCTTC AAGATACATC CACACACCAG TTGAAGTTAT TGATACAGAA GACCTTGAAA AAACAACCGA ACTCGTTGTT GCGTGCATTA AAAAAGTACA CGAATACTTT TAA
|
Protein sequence | MSTLDYLKIL ATEKGISGRE DKVREYMEKE LEKYCDSIET DKFGNLIAKK GSTGPKIMIA SHMDEIGLMV KFIDDKGFLK FTKIGGINDQ MLLNQKVIVH SNEGDIVGVL GSKPPHKMKE SERNKLISAE HMFIDIGAKN KEDAEKMGVE IGTAISFKSE FDNLGGNVVS CKSFDNRAGC AVVLKTMELL KDMDLKCQVY AVGTVQEEVG LKGAKTSAFG INPDIAFALD VTICGDHPGI KLEDAPVELG KGPVATIVDA SGRGIITHPT VLRMVRDVAK ADEIPVQYEV GEGGTTDATA IHLTRDGIPT GVISVPSRYI HTPVEVIDTE DLEKTTELVV ACIKKVHEYF
|
| |