Gene M446_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2945 
Symbol 
ID6131210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3261721 
End bp3264051 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content74% 
IMG OID641643136 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001769791 
Protein GI170741136 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0681902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCA TCACCCAGCG CATCGCGGCC GAGCTCGGCG CGCAGGAATG GCAGGTCAAG 
GCGGCGGTCG ACCTCCTCGA CGGCGGCGCC ACCGTCCCGT TCGTCGCCCG CTACCGCAAG
GAGGCGACCG GAACTCTCGA CGACGCGCAG TTGCGCACCC TGGAGGAGCG GCTGCGCTAC
CTGCGCGAGC TGGAGGAGCG CCGCCGGGCG ATCCTCGACG GGATCGAGGC CCAGGGCAAA
CTCGACGAGG CCCTGCGCCG CCAGATCCTG GCGGCCGAGA CCAAGGCCCG GCTCGAGGAC
CTCTACCTTC CCTACAAGCC CAAGCGGCGC ACCAGGGCGC AGATCGCCCG CGAGGCCGGG
CTCGGCCCGC TGGCGGAGGC GCTCCTCGCC GATCCGCGCC GCGCGCCGCG CGAGGCCGCC
GCCCCCTTCG CGGACGCGCA GAAGGGTGTC GCCAGCCCGG AGGCCGCGCT GGAGGGAGCC
CGCGCCATCC TGATCGAGCG CTTCGCCGAG GATGCCGAGC TGGTCGGCGC CCTGCGCGAG
ACCTTCTGGA AGGAGGGGCG CCTCGTCTCC ACCCTGCGCG CGGGCAAGGC CGAGGCGGGC
GCGAAATTCG CCGATTACTT CGCCTTCTCG GAGCCGCTGA CCCGCCTCCC CTCCCACCGG
GTGCTGGCGC TGTTCCGCGG CGAGAAGGAG GAGGTGCTCG ACCTGCGCCT CGACGAGACG
CCCGCCGGGA CCGAGCCCGG CGCGCCGAGC CGCTACGAGG GCCGCATCGC CCTGCGCCAC
GGCATCCGCG ACGAGGGCCG CCCCGGCGAC CGCTGGCTGG CCGAGACGGT GCGCGCGGCA
TGGCGCACGA AGCTCAGGCT CTCGATCGAG CTCGACCTGC GGGCGCGCCT GTGGGAGGCG
GCCGAGAGCG AGGCCGTGCG GGTCTTCGCC GGCAACCTGC GCGACCTGCT CCTGGCGGCC
CCCGCGGGCG CGCGCCCGAC GCTCGGGCTC GATCCGGGCT ACCGGACGGG GGTCAAGGTC
GCGGTGGTGG ACGGAACCAG CAAGGTCGTG GCCACGGACA CGATCTACCC GCACGAGCCG
CGCCGCGACT GGGACCGCGC CGTCGCGACC CTGGCGCGCC TCTGCCGCCA GCACCGGGTC
GAGCTCGTGG CGATCGGCAA CGGCACCGCC TCGCGCGAGA CCGACCGGCT CGCCGGGGAG
CTGATCCGCC TGCACCCGGA CCTCGGCCTC ACCAAGGTGA TGGTGTCGGA GGCCGGCGCC
TCGGTCTATT CGGCCTCCGC CTATGCGAGC CAGGAATTGC CGGACCTCGA CGTGTCGCTG
CGCGGGGCGG TCTCGATCGC CCGGCGGCTG CAGGATCCCC TGGCCGAACT GGTGAAGATC
GAGCCGCGCT CGATCGGCGT CGGCCAGTAC CAGCACGACC TCGCGGAGGG GAAGCTGTCG
CGCTCCCTCG ACGCGGTGGT GGAGGATTGC GTGAACGGCG TCGGAGTCGA CGTCAACACC
GCCTCGGCCC CGCTGCTGGC GCGGGTCTCG GGGCTGAGCG AGCGGGTGGC GCAGGCCATC
GTCGTGCACC GCGACGCGCA CGGGCCCTTC CGCAGCCGCA CCGCCCTGAA GAAGGTGGCG
GGTCTCGGGC CCAAGGCCTT CGAGCTCTCG GCGGGCTTCC TGCGCATCAC CGGCGGCGAC
GACCCGCTCG ACGCCTCGGG CGTGCACCCG GAGGCCTACC CGGTGGTGCG CAAGATCCTG
CAGGCGACGA AGAGCGACAT CCGGGCGGTG ATCGGCAACG GATCGGTGCT GCGGAGCCTC
GATCCGCGGG CCTTCACGGA CGCGACCTTC GGGCTGCCGA CGGTCACCGA CATCCTGGCC
GAGTTGGAGA AGCCCGGCCG CGACCCGCGC CCGAGCTTCC GCACCGCGAG CTTCCAGGAG
GGCGTCGAGA CGATCGGCGA CCTCAAGCCC GGGATGCTGC TGGAAGGGGT GGTGACCAAC
GTCGCGGCCT TCGGGGCCTT CGTGGACGTG GGCGTCCACC AGGACGGGCT CGTCCACATC
TCGGCCCTGT CGAACAGCTT CGTCAGGGAT CCGCGGGCGG TGGTGAAGCC CGGGGACGTC
GTGCGGGTGA AGGTTCTCGA CGTCGACGTG CCGCGCAAGC GGATCTCGCT GACGATGCGC
CTCGACGATG CCCCGCAGGC CCGGCCGGGC CGGGACGGCG CCCGGCGGGA GCCCGCGCCG
AACCCTCCGC GCACCGAGAC AAAGCCGCCG CGCGGGGAGA CCGAGCCGGC CGGTGACGGC
GCCCTCGCCG AGGCGCTGCG CCGGGCGGGG CTCGACCGGC CGCGACGGTA G
 
Protein sequence
MASITQRIAA ELGAQEWQVK AAVDLLDGGA TVPFVARYRK EATGTLDDAQ LRTLEERLRY 
LRELEERRRA ILDGIEAQGK LDEALRRQIL AAETKARLED LYLPYKPKRR TRAQIAREAG
LGPLAEALLA DPRRAPREAA APFADAQKGV ASPEAALEGA RAILIERFAE DAELVGALRE
TFWKEGRLVS TLRAGKAEAG AKFADYFAFS EPLTRLPSHR VLALFRGEKE EVLDLRLDET
PAGTEPGAPS RYEGRIALRH GIRDEGRPGD RWLAETVRAA WRTKLRLSIE LDLRARLWEA
AESEAVRVFA GNLRDLLLAA PAGARPTLGL DPGYRTGVKV AVVDGTSKVV ATDTIYPHEP
RRDWDRAVAT LARLCRQHRV ELVAIGNGTA SRETDRLAGE LIRLHPDLGL TKVMVSEAGA
SVYSASAYAS QELPDLDVSL RGAVSIARRL QDPLAELVKI EPRSIGVGQY QHDLAEGKLS
RSLDAVVEDC VNGVGVDVNT ASAPLLARVS GLSERVAQAI VVHRDAHGPF RSRTALKKVA
GLGPKAFELS AGFLRITGGD DPLDASGVHP EAYPVVRKIL QATKSDIRAV IGNGSVLRSL
DPRAFTDATF GLPTVTDILA ELEKPGRDPR PSFRTASFQE GVETIGDLKP GMLLEGVVTN
VAAFGAFVDV GVHQDGLVHI SALSNSFVRD PRAVVKPGDV VRVKVLDVDV PRKRISLTMR
LDDAPQARPG RDGARREPAP NPPRTETKPP RGETEPAGDG ALAEALRRAG LDRPRR