Gene MmarC5_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_0468 
Symbol 
ID4929122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp425637 
End bp426689 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content37% 
IMG OID640165971 
Productcellulase 
Protein accessionYP_001096997 
Protein GI134045511 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0307085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAC TCGATTATTT AAAGATTCTC GCAACAGAAA AAGGAATTTC TGGAAGAGAA 
GATAAAGTAA GAGAATACAT GGAAAAAGAA CTTGAAAAAT ACTGTGACAG CATTGAAACA
GATAAATTTG GAAATTTAAT TGCAAAAAAA GGATCAACTG GTCCAAAAAT TATGATTGCA
TCACACATGG ATGAAATCGG ACTTATGGTT AAGTTCATCG ATGACAAAGG ATTTTTAAAA
TTTACAAAAA TTGGTGGAAT TAACGACCAG ATGCTTTTAA ACCAGAAAGT CATCGTTCAC
AGCAACGAAG GCGATATTGT TGGTGTTTTA GGTTCAAAAC CACCTCACAA AATGAAAGAA
AGCGAAAGAA ACAAGTTAAT TTCTGCTGAA CACATGTTTA TCGACATCGG TGCAAAAAAT
AAAGAAGATG CAGAAAAAAT GGGTGTTGAA ATCGGTACTG CGATTTCATT CAAGTCTGAA
TTCGACAACC TCGGTGGAAA CGTAGTTTCA TGTAAATCAT TTGACAACAG GGCAGGCTGT
GCAGTTGTTT TGAAAACCAT GGAATTACTT AAAGATATGG ATTTAAAATG TCAGGTTTAC
GCTGTTGGAA CCGTTCAGGA AGAAGTTGGA TTAAAAGGTG CAAAAACATC TGCATTTGGA
ATTAATCCCG ATATTGCATT CGCACTTGAT GTTACAATCT GTGGAGACCA CCCGGGAATT
AAATTAGAAG ATGCACCAGT GGAACTTGGA AAAGGCCCTG TTGCTACAAT CGTAGATGCA
TCTGGAAGAG GAATTATTAC ACACCCAACA GTTTTAAGAA TGGTTAGAGA TGTTGCAAAA
GCAGATGAAA TTCCTGTCCA ATATGAAGTT GGAGAAGGAG GAACTACTGA TGCAACTGCA
ATCCACTTAA CAAGAGATGG AATTCCAACA GGAGTTATAT CCGTTCCTTC AAGATACATC
CACACACCAG TTGAAGTTAT TGATACAGAA GACCTTGAAA AAACAACCGA ACTCGTTGTT
GCGTGCATTA AAAAAGTACA CGAATACTTT TAA
 
Protein sequence
MSTLDYLKIL ATEKGISGRE DKVREYMEKE LEKYCDSIET DKFGNLIAKK GSTGPKIMIA 
SHMDEIGLMV KFIDDKGFLK FTKIGGINDQ MLLNQKVIVH SNEGDIVGVL GSKPPHKMKE
SERNKLISAE HMFIDIGAKN KEDAEKMGVE IGTAISFKSE FDNLGGNVVS CKSFDNRAGC
AVVLKTMELL KDMDLKCQVY AVGTVQEEVG LKGAKTSAFG INPDIAFALD VTICGDHPGI
KLEDAPVELG KGPVATIVDA SGRGIITHPT VLRMVRDVAK ADEIPVQYEV GEGGTTDATA
IHLTRDGIPT GVISVPSRYI HTPVEVIDTE DLEKTTELVV ACIKKVHEYF