Gene Mchl_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2049 
Symbol 
ID7118749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2148572 
End bp2149576 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content69% 
IMG OID643524799 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002420824 
Protein GI218530008 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.574504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGGA AAATGATGAG AGCGCTCCTC TGCGCCGGGG CGGCGCTGCT CGGGCTCGCT 
GCGGGCGGCG CGCGGGCAAC GGAGAAGGTG ACGCTCCAGC TCAAATGGGT GCCCCAGGCG
CAGTTTGCCG GCTACTACGT CGCCCAGGCC AAGGGTTTCT ACAAGGAGGC CGGCCTCGAC
GTGACGATCA AGCCGGGCGG GCCCGACGTG GCCCCGCCCC AGGTCATCGC GGGCGGCGGC
GCCGACGTCG TCGTCGATTG GATGCCCTCG GCGCTCGCCT CGCGCGAGAA GGGCGTGCCG
CTCGTCAACA TCGCGCAGCC GTTCAAGAAA TCGGGCCTGA TGCTGACCTG CCGGGCGGAT
ACCGGCATCA AGTCGCCCGC CGACCTGAAG GGACGGACGC TCGGCGTCTG GTACGCCGGC
AACGAATACC CGTTCCTGGC CTGGATGGCC AAGCTCGGCC TCAAGACCGA CGGCTCGCCC
GGCGGCGTGA CGGTGCTGAA GCAGGGCTTC AACGTCGATC CGCTGATCCA ACGCCAAGCC
GACTGCGTCT CGACCATGAG CTACAACGAG TATTGGCAGG TGATCGATGC CGGCTTCAAG
CCGGAGCAGC TCGTCGTCTT CCGCTACGAG GACCAGGGCG TCGCCGCGCT CGAGGACGGG
CTCTACGCCC TTGAATCCAA GCTGAAGGAC AAGGCCTTCG TCGCCCGGCT GGCGAAGTTC
GTGGCGGCCT CCGAGAAGGG CTGGGCCTAT GCCGCCGCGC ATCCGGACGA GGCGGCCGAG
ATCGTGCTGG AGAACGACGC CAGCGGCGCC CAAACCGAGA CGCACCAGAA GCGGATGATG
CGCGAGATCG CCAAGCTGCT CGACACCTCC GGCGGCCGGC TCGACCCCGC CGATTACGAG
CGCACCGTCG CGATCCTGCT CACCGGCGGC ACCGACCAGC CCGTCATCAC CCGCAAGCCC
GAGGGGGCCT GGACGCATGC GGTGACCGCG ACCCTGGGGC AGTAG
 
Protein sequence
MAGKMMRALL CAGAALLGLA AGGARATEKV TLQLKWVPQA QFAGYYVAQA KGFYKEAGLD 
VTIKPGGPDV APPQVIAGGG ADVVVDWMPS ALASREKGVP LVNIAQPFKK SGLMLTCRAD
TGIKSPADLK GRTLGVWYAG NEYPFLAWMA KLGLKTDGSP GGVTVLKQGF NVDPLIQRQA
DCVSTMSYNE YWQVIDAGFK PEQLVVFRYE DQGVAALEDG LYALESKLKD KAFVARLAKF
VAASEKGWAY AAAHPDEAAE IVLENDASGA QTETHQKRMM REIAKLLDTS GGRLDPADYE
RTVAILLTGG TDQPVITRKP EGAWTHAVTA TLGQ