Gene Mchl_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1937 
Symbol 
ID7116752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2005382 
End bp2006377 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content73% 
IMG OID643524701 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002420728 
Protein GI218529912 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACG ATACCTCCGA ACCAAGCCAC CACGCCCGGC CCCGGCGAAA GCCCGCGCGG 
GCCGAAGATC CCGCTCCGCG AACGGTCGGC TTCGTGCTGG TGCCGGACTT TCCGCTGATG
GCCTACACCG CGGCGGTCGA GCCCCTGCGC GCCGCCAACA CCCTGTCGGG CTGCGAACTC
TATCGCTGGT GGCACGCCGC GCCGGGCGGC GGCGTGGTGC AGGCCTCGAA CGGGCTCGGC
ATCCTCACCG ATGTCGCCGT CGGCGCGCGT GCCAGCGCCG ACCGCGTCTT CGTCTGCGCC
GGCGGAAACC CGGCCGAGTT CGACGATCCC TCCCTGTTCG CGTGGCTGCG CGGGCTCGCC
CGCCACGGGG CGACGCTCGG CGGCATTTCC GGCGGGCCGT ATCTCCTCGC CCGCGCCGGT
CTGCTCTCCG GGCGCCGTTG CACGCTGCAC TGGGAGCACG TCCCGGCCTT CGAGGAGCGC
TACCCCGAGA TCGAGGTGGT CCGCTCGCTG TTCGAGATCC AGGGCGACCG CATCACCTGC
TCGGGTGGCA TCGCCGCGCT CGACCTGATG CTCGACCTGA TCGGCCGCGA CCACGGCGCC
GGCCTCGCCG CGGGCGTCAG CGACTGGTTC CTGCACAACC AGATCCGTGA GGGGTTGAGC
CCGCAACGGA TGGATCTGCG CCAGCGCTTC GGCGTGCGCG ACCCCCGGCT GCTGCGGGTG
CTCGCGGCAA TGGAGGCGAA TCTCGAAGCG CCGGTCCCGC GCGTGGCCCT GGCCGATCTC
GCCCGCGTCT CGGTGCGGCA GTTGGAGCGG CTGTTTCGCG AGGGGCTCGG GCGCGGCCTC
CACCGGCATT ACCTGCATCT GCGCCTCGAC CGGGCGCACC AGCTCGGCCG CGAGAGCGCC
TTGAGCCGCG CCGAGATCGC GGCCGCGACC GGCTTTGCCA ACGCCGACGA ACTCGCGCGC
GCCGAGCGGC GGCGGCACCG GCAGGCAGAG GCCTGA
 
Protein sequence
MSDDTSEPSH HARPRRKPAR AEDPAPRTVG FVLVPDFPLM AYTAAVEPLR AANTLSGCEL 
YRWWHAAPGG GVVQASNGLG ILTDVAVGAR ASADRVFVCA GGNPAEFDDP SLFAWLRGLA
RHGATLGGIS GGPYLLARAG LLSGRRCTLH WEHVPAFEER YPEIEVVRSL FEIQGDRITC
SGGIAALDLM LDLIGRDHGA GLAAGVSDWF LHNQIREGLS PQRMDLRQRF GVRDPRLLRV
LAAMEANLEA PVPRVALADL ARVSVRQLER LFREGLGRGL HRHYLHLRLD RAHQLGRESA
LSRAEIAAAT GFANADELAR AERRRHRQAE A