Gene Mchl_0772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_0772 
Symbol 
ID7115765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp784861 
End bp787176 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content71% 
IMG OID643523576 
ProductRNA binding S1 domain protein 
Protein accessionYP_002419619 
Protein GI218528803 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAGCG TGAACCTGCT GATCGCCGAG GAACTCGGTG CGCGCGAGGG GCAGGTGGCG 
GCGGCCGTGG ACCTGCTCGA CGGCGGCTAC ACCGTCCCGT TCATCGCCCG CTACCGCAAG
GAGGCGACCG GCTCGCTGGA CGACGCACAG CTCCGCACCC TTGAGGAGCG GCTGGGCTAT
CTGCGCGAGC TGCGCGACCG GCGCACCAGC GTCACGGAGA GCATCCGCGC CCAGGGCAAG
CTGACGCCGG AACTCGCCGC CGCCATCGCC GCCGCCGACA CCAAGGCGCG GCTGGAAGAC
ATCTACCTGC CGTTCCGGCC CAAGCGCCGC AGCAAGGCGC AGACCGCCCG CGAGGCCGGG
CTCGCGCCGC TGGCCGAGGC CCTGCTCACG CGTCCCGAGA CGGTGCCGGA GCGTGCGGCG
CAGGGCTTCG TGGACGCGGC CAAGGGCATC GAAACCGCCG AGGCGGCCCT GGAGGGCGCC
CGGGCGATCC TGATCGAGCG GTTTGCCGAG GATGCCGACC TGATCGGCCG CCTGCGCGAG
GATTTCTGGC GCAGTGGCGA GGCCGTGGCA AAGGTGCGCA AGGGCCAGGA GACGACGGGC
CAGAAGTTCT CGGACTATTT CGACTGGCGC GAGCGCCTGG AGCGGATGCC CTCGCACCGG
GTGCTGGCGG TGTTCCGCGG CGAGAAGGAG GAGGTGCTCG ACCTCGCCTT TGCCGCGGAG
GGCGAGGATT CCGCGCCCGG CGTGCCGGGG CCGTTCGAGC TCGCCGTCTG CCGCCGGTTC
GGCGTCTCCG CGCGGGGCCG GCCGGCGGAT GCGTGGCTGC TCGACACGGT TCGCACCGCC
TGGCGCACCA AGATCCGCAC CGGCATCAAG GCCGATCTGC GGGCGCGCCT GTTCGAGCGG
GCGGAGGAGG CGGCGGTGAA GGTGTTCGCC GGCAATCTCA AGGATCTGCT GCTTGCCGCC
CCCGCGGGCG GCCGGGCGAC GCTCGGGCTC GATCCCGGCT ACCGCAACGG CGTGAAGGCG
GCGGTGGTCG ACCGCACCGG TAAGGTCGTG GCGGTCGAGA CCACCTATCC GCACGAGCCG
CAGCGGCGCT GGAAGGAGGC AGTGGTCTCG CTCTCCCGGC TCTGTCGCCA GCACAGCGTT
GAGCTGATCG CCATCGGCAA CGGCACAGCC TCGCGCGAGA CCGACCGGCT CGCCACCGAG
ATCCTGGCGG CCAACCCTGA TCTCAAGATG GCCAAGGTCA CGGTGTCGGA GGCCGGCGCC
TCGGTCTATT CAGCGTCGGC CATCGCCACG CGTGAGTTGC CCGACCTCGA CGTGTCGCAT
CGCGGCGCCG TCTCCATCGC CCGGCGCCTG CAGGACCCGC TGGCGGAACT GGTGAAGATC
GACCCGAAAT CCATCGGCGT CGGCCAGTAC CAGCACGACG TCACCGAACA GAAGCTGTCG
CGCTCGCTTC AAGCGGTGGT CGAGGATGCG GTGAACGCGG TCGGCGTCGA TGTGAACACC
GCCTCCGGCC CGCTGCTCGC CCAGGTCTCG GGCCTCGGCG CGTCGGTGGC GGACAAGATC
GTTAGCCACC GCGACGCCCA CGGCCCGTTC CGCACCCGCG CCGGGCTGAA GAAGGTGCCG
GGCCTCGGCG CTAAGACTTT TGAGCTCGCG GCGGGCTTCC TGCGCATCCC CGATGGCGAG
GACCCGCTCG ACCGCTCCGG CGTCCACCCG GAGGCCTATC CGGTGGTGCG CCGCATCCTG
GAGGCGACGA AGAGCGACAT CCGCGTGCTG ATCGGCAATG AAGCCGCCCT GCGCCTGCTC
TTACCTGCCG CCTTCGCCGA CGAACGCTTC GGCGTGCCGA CCGTGCGCGA CATCATCGCC
GAGCTGGAAA AGCCCGGCCG CGACCCGCGC CCGGCCTTCA AGACGGCGAA CTTCCAGGAA
GGCGTCGAGA AAATCGGCGA CCTCAAGCCG GGGATGCAGT TGGAGGGCGT CGTCACCAAC
GTCGCGGCCT TCGGCGCCTT CGTCGATATC GGCGTGCATC AGGACGGACT CGTCCACATC
TCGGCGATGG CCCGCAAGCG GATCGCCTCG CCTTCCGAGG TGGTGAAGAC CGGCGACGTG
GTGCGTGTGC TGGTGTTGTC GATCGATGTG CCGCGCAAGC GCATCGCGCT GTCGATGCGG
CTCGACGACC CCCTTGAGGG CGCAACGGCG CCGCGTGGAA ACGCCCCCCG CCCCGAGGCG
CAGCCCCGGC GCCCAGCGCC TGCAGCCCCG CCGCAGGACG GGGCGCTGGC CGACGCGCTC
CGGCGCGCCG GAGTTTCGTC GCCTAAGCGT TCTTGA
 
Protein sequence
MKSVNLLIAE ELGAREGQVA AAVDLLDGGY TVPFIARYRK EATGSLDDAQ LRTLEERLGY 
LRELRDRRTS VTESIRAQGK LTPELAAAIA AADTKARLED IYLPFRPKRR SKAQTAREAG
LAPLAEALLT RPETVPERAA QGFVDAAKGI ETAEAALEGA RAILIERFAE DADLIGRLRE
DFWRSGEAVA KVRKGQETTG QKFSDYFDWR ERLERMPSHR VLAVFRGEKE EVLDLAFAAE
GEDSAPGVPG PFELAVCRRF GVSARGRPAD AWLLDTVRTA WRTKIRTGIK ADLRARLFER
AEEAAVKVFA GNLKDLLLAA PAGGRATLGL DPGYRNGVKA AVVDRTGKVV AVETTYPHEP
QRRWKEAVVS LSRLCRQHSV ELIAIGNGTA SRETDRLATE ILAANPDLKM AKVTVSEAGA
SVYSASAIAT RELPDLDVSH RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVTEQKLS
RSLQAVVEDA VNAVGVDVNT ASGPLLAQVS GLGASVADKI VSHRDAHGPF RTRAGLKKVP
GLGAKTFELA AGFLRIPDGE DPLDRSGVHP EAYPVVRRIL EATKSDIRVL IGNEAALRLL
LPAAFADERF GVPTVRDIIA ELEKPGRDPR PAFKTANFQE GVEKIGDLKP GMQLEGVVTN
VAAFGAFVDI GVHQDGLVHI SAMARKRIAS PSEVVKTGDV VRVLVLSIDV PRKRIALSMR
LDDPLEGATA PRGNAPRPEA QPRRPAPAAP PQDGALADAL RRAGVSSPKR S