Gene Mboo_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1201 
Symbol 
ID5411349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1215556 
End bp1216593 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content52% 
IMG OID640868427 
Productpeptidase M42 family protein 
Protein accessionYP_001404362 
Protein GI154150744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.214071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0232682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAGG AATTATTACG GAAATTATCC AATGCCCACG GGGTATCGGG AAGTGAAGGC 
AGTGTCTTTG CCTTAATAAA AAAAGAACTC AAAGGCTGTG TTGATGAGAT CACCGAGGAT
CCTATGGGAA ATCTCATTGC AGTCAGGCAT GGTAACAAGT CAAAGGTGAT GCTTGCCGCC
CATATGGACG AGATCGGGCT CATGGTCAAG TATATCGATG ACAAGGGCTT TCTCCGGTTC
ATCACTCTTG GCGGGTGGTA CGGGCCAACA CTCTATAACC AGCGCGTGAT CGTCCATGGG
ACAAAAGGTG ACCGGATCGG TGTCATCGGT GGCAAGCCCC CGCACATGAT GGACGAAGAT
GAACGTAAGA AAGGGGTGAA AACCGACGAC ATGTTCATCG ATATCGGGGC AAAAAACAAG
GACGATGTTG CGGAGCTGGG TATTGAGGTT GGCACGCCGG TAACGATTGA TCGCGAATTT
ACCGAACTCG CGAATAACCG CGTTACCGGT AAGGCATTTG ATAACCGGGC CGGTGTTGCA
ATGCTTATTA AGACCATGCA GAAAATGAAG TCCCCGTTCA CGGTTTACGC CGTCTTTACC
GTGCAGGAAG AAGTGGGTCT CAAAGGGGCA AAGACCAGCG CCTATACAAT CGATCCCGAC
TGGGCCATTG CTACGGATGT TACCATTCCC GGCGATCACC CGGGGATTGA TATGAAGGAT
GCAGCAGTAG AAATGGGAAA AGGGCCGGTT ATCACCATTG TCGACAGCAG CGGAAGAGGA
CTTATTGCCA GCCGCAAGGT TGTCCAGTGG CTGAAAAATG CTGCTGAAAC AAACGCGATT
CCCGTGCAGT ACGAGGTTGG TACCGGGGGG ACAACCGATG CAACCTCTAT TCACCTGTCC
CGCGGCGGCG TGCCAAGTAC CACTATCAGC CCGCCCACAC GGTACATCCA CTCCCCGGTG
GAAGTCCTGG ATACAGGGGA TATAGAAGCC GGTGTTAACC TGCTTGTTGC AGCGCTGAAG
ACAAAGCCTG CGCTTTAA
 
Protein sequence
MVKELLRKLS NAHGVSGSEG SVFALIKKEL KGCVDEITED PMGNLIAVRH GNKSKVMLAA 
HMDEIGLMVK YIDDKGFLRF ITLGGWYGPT LYNQRVIVHG TKGDRIGVIG GKPPHMMDED
ERKKGVKTDD MFIDIGAKNK DDVAELGIEV GTPVTIDREF TELANNRVTG KAFDNRAGVA
MLIKTMQKMK SPFTVYAVFT VQEEVGLKGA KTSAYTIDPD WAIATDVTIP GDHPGIDMKD
AAVEMGKGPV ITIVDSSGRG LIASRKVVQW LKNAAETNAI PVQYEVGTGG TTDATSIHLS
RGGVPSTTIS PPTRYIHSPV EVLDTGDIEA GVNLLVAALK TKPAL