Gene Mchl_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_0621 
Symbol 
ID7115437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp615735 
End bp617735 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content66% 
IMG OID643523414 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002419471 
Protein GI218528655 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.06387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA AGGCAACCGA GCGGGACGAT GCGGACGCCG CCCAGGATCA ACCGACGGAC 
GGCCCGCTCC TTGACCTGAC GGATGCCGCC GTCAAGCGGA TGGTCAAGCT CGCGAAGAAG
CGCGGCTACG TCACCTACGA AGAGATCAAC GAGGTCCTCC CGGACGGTCA GGTCGACGGC
GACCAGATCG AGGACGTGCT CGCGCAGCTC GACGACATGG GCATCCGCGT GGTCGAGGCG
GAGGAATCCG AGGAGGCTGC CCCCGAGGCC AAGGCCAACG GCGAAGCCTC CGAGGAGGAG
GAGAGCACGG AGGGCGGAGA GGTCGCCGAG ACTTCCGGCA CCCGCGCCGT CGCGGTAGCG
ACCCCCACCA CCCGCGAGCC GACCGACCGC ACGGACGATC CCGTGCGGAT GTACCTGCGC
GAGATGGGCT CGGTGGAGCT GCTCTCGCGT GAGGGCGAAA TCGCGATCGC CAAGCGCATC
GAGGCCGGCC GCGAGGCGAT GATCGCAGGT CTCTGCGAGA GCCCGCTGAC CTTCCAGGCC
ATCATCATCT GGCGCGACGA ACTCGTCGAC GGCAAGGTGC TGCTGCGCGA CATCATCGAT
CTCGAAGCCA CCTATGCCGG CCCCGATGCC CGCGGCGCGC CGCAGGAAGC GGAAGCCGAG
GGTGAAGAGT CGGAGGAGGC CGAAGGCGCC GTCCCGCCGG AAGGCGCCGA CGACGAAGAC
GACATGGAGA ACAACGTGTC GCTTGCGGCC ATGGAGGCCG AGATCAAGCC GCGGGTTCTC
GAAACCTTCG ACAACATCGC CTCGAACTAC CGCAAGCTGC GCAAGCTCCA GAACGAGGAG
ACGGAGCTGA AGACGGGCGG CGGCACCGTC ACCTCCGCGC AGACCAAGAA GCAGGCCGAG
CTGAAGGACA TCGTCGTCAC CGACGTGAAG TCGCTCTCGC TCAACGCCAA CCGCATCGAG
GCCCTGGTCG AGCAGCTCTA CGACATCAAC AAGCGCCTCA TCTCGCATGA GGGCCGCCTG
ATGCGCGCCG CCGAGCACCA CGGCGTCGCC CGCGACGAGT TCCTGCGCCA CTACCAGGGC
TACGAGCTCG ACCCGAACTG GATGGACCGC GTCGCCACGC TGGGCGGCAA GGGCTGGAAG
AACTTCGTCG AGCGCGGCGG CCGTCAGGTC GCGGACCTGC GCGAGCAGAT CCTGACGCTC
GCCTCCGAGA CCGGCCTGCA GATCGGCGAG TACCGCAAGA TCGTCGCCAT GGTGCAGAAG
GGCGAGCGCG AGGCCCGCCA GGCGAAGAAG GAGATGATCG AGGCCAACCT CCGCCTCGTG
ATCTCGATCG CCAAGAAGTA CACCAACCGC GGCCTGCAGT TCCTGGACCT GATCCAGGAG
GGCAATATCG GCCTGATGAA GGCGGTCGAT AAGTTCGAGT ATCGCCGCGG CTACAAGTTC
TCGACCTACG CTACGTGGTG GATCCGGCAG GCGATCACCC GCTCGATCGC CGACCAGGCG
CGCACGATCC GCATTCCGGT GCACATGATC GAGACGATCA ACAAGATCGT CCGCACGTCA
CGCCAGATGC TGCACGAGAT CGGCCGCGAG CCGACTCCGG AGGAGCTGGC CGAGAAATTG
GCCATGCCGC TGGAGAAGGT GCGCAAGGTC CTGAAGATCG CCAAGGAGCC GATCTCCCTC
GAAACGCCGA TCGGCGACGA GGAGGATTCG CATCTCGGCG ACTTCATTGA GGACAAGAAC
GTCGTCCTGC CGATCGACGC GGCGATTCAG TCGAACCTGC GCGAGACCAC GACCCGTGTG
CTCGCCTCGC TGACGCCGCG CGAGGAGCGC GTGCTGCGCA TGCGCTTCGG CATCGGCATG
AACACCGACC ACACTCTCGA AGAGGTGGGT CAGCAGTTCT CGGTGACCCG CGAGCGCATC
CGCCAGATCG AAGCGAAGGC CCTGCGCAAG CTCAAGCATC CGAGCCGGTC GCGGAAGCTG
CGAAGCTTCC TCGACAACTG A
 
Protein sequence
MATKATERDD ADAAQDQPTD GPLLDLTDAA VKRMVKLAKK RGYVTYEEIN EVLPDGQVDG 
DQIEDVLAQL DDMGIRVVEA EESEEAAPEA KANGEASEEE ESTEGGEVAE TSGTRAVAVA
TPTTREPTDR TDDPVRMYLR EMGSVELLSR EGEIAIAKRI EAGREAMIAG LCESPLTFQA
IIIWRDELVD GKVLLRDIID LEATYAGPDA RGAPQEAEAE GEESEEAEGA VPPEGADDED
DMENNVSLAA MEAEIKPRVL ETFDNIASNY RKLRKLQNEE TELKTGGGTV TSAQTKKQAE
LKDIVVTDVK SLSLNANRIE ALVEQLYDIN KRLISHEGRL MRAAEHHGVA RDEFLRHYQG
YELDPNWMDR VATLGGKGWK NFVERGGRQV ADLREQILTL ASETGLQIGE YRKIVAMVQK
GEREARQAKK EMIEANLRLV ISIAKKYTNR GLQFLDLIQE GNIGLMKAVD KFEYRRGYKF
STYATWWIRQ AITRSIADQA RTIRIPVHMI ETINKIVRTS RQMLHEIGRE PTPEELAEKL
AMPLEKVRKV LKIAKEPISL ETPIGDEEDS HLGDFIEDKN VVLPIDAAIQ SNLRETTTRV
LASLTPREER VLRMRFGIGM NTDHTLEEVG QQFSVTRERI RQIEAKALRK LKHPSRSRKL
RSFLDN