Gene Mext_0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0813 
Symbol 
ID5832321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp883759 
End bp886074 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content72% 
IMG OID641366595 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001638289 
Protein GI163850246 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.179863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.675147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAGCG TGAACCTGCT GATCGCCGAG GAACTCGGTG CGCGCGAGGG GCAGGTGGCG 
GCGGCCGTGG ACCTGCTCGA CGGCGGCTAC ACCGTCCCGT TCATCGCCCG CTACCGCAAG
GAGGCAACTG GCTCGCTGGA CGACGCACAG CTCCGCACCC TTGAGGAGCG GCTGGGGTAT
CTGCGCGAGC TGCGCGACCG GCGCACCAGC GTCACCGAGA GCATCCGCGC CCAGGGCAAG
CTGACGCCGG AACTCGCCGC CGCCATCGCC GCCGCCGACA CCAAGGCGCG GCTGGAAGAC
ATCTACCTGC CGTTCCGGCC CAAGCGCCGC AGCAAGGCGC AGACCGCCCG CGAGGCCGGG
CTCGCGCCGT TGGCCGAGAC CCTGCTCGCG CGGCCGGAGA CGGCGCCGGG GCGGGCAGCG
CAGGGCTTCG TCGACGCGGC CAAGGGCATC GAAACCGCCG AGGCGGCCCT GGAGGGCGCC
CGGGCGATCC TGATCGAGCG GTTTGCCGAG GATGCCGACC TGATCGGCCG CCTGCGCGAG
GACTTTTGGC GCGGCGGCGA GGCCGTGGCG AAGGTGCGCA AGGGCCAGGA GACTGCGGGC
CAGAAATTCT CCGACTATTT CGACTGGCGC GAGCGCCTGG AGCGGATGCC CTCGCACCGG
GTGCTGGCGG TGTTCCGCGG CGAAAAGGAG GAGGTGCTCG ACCTCGCCTT CGCCGCGGAG
GGCGAGGATT CCGCGCCCGG CGTGCCGGGG CCGTTCGAGC TCGCCGTCTG CCGCCGGTTC
GGCGTCTCCG CGCGGGGCCG GCCGGCGGAT GCGTGGCTGC TCGACACCGT CCGCACCGCG
TGGCGCACCA AGATCCGCAC CGGCATCAAG GCCGATCTGC GGGCACGCCT GTTCGAGCGG
GCGGAGGAGG CGGCGGTGAA GGTGTTCGCC GGCAACCTCA AGGATCTGCT GCTCGCCGCC
CCGGCGGGCG GGCGGGCGAC GCTCGGGCTC GATCCGGGCT ACCGCAACGG CGTGAAGGCG
GCGGTGGTCG ACCGCACCGG CAAGGTCGTC GCGGTCGAGA CCACCTATCC GCACGAGCCG
CAGCGGCGCT GGAAGGAGGC GGTGGCCTCG CTCTCCCGGC TCTGCCGCCA GCACGGCGTC
GAGTTGATCG CCATCGGCAA CGGCACCGCC TCGCGCGAGA CCGACCGGCT CGCCACCGAG
ATCCTGGCCG CAAACCCTGA TCTCAAGATG GCCAAGGTCA CGGTGTCGGA GGCCGGCGCC
TCGGTGTACT CGGCCTCGGC CATCGCCACC CGTGAGTTGC CCGACCTCGA CGTGTCGCAT
CGCGGCGCCG TCTCCATCGC CCGGCGCCTG CAGGACCCGC TGGCGGAACT GGTGAAAATT
GACCCGAAAT CCATCGGCGT CGGCCAGTAC CAGCACGACG TCACCGAGCA GAAGCTGTCG
CGCTCGCTTC AAGCGGTGGT CGAGGATGCG GTGAACGCGG TCGGCGTCGA TGTGAACACC
GCCTCCGGCC CGCTGCTGGC CCAGGTCTCG GGCCTCGGCG CGTCGGTGGC GGACAAGATC
GTCACCCACC GCGACGCCAA CGGCCCGTTC CGTACCCGCG CCGGACTGAA GAAGGTGCCG
GGCCTCGGCG CCAAGACCTT CGAGTTGGCG GCGGGCTTCC TGCGCATCCC CGATGGCGAG
GACCCGCTCG ACCGCTCCGG CGTCCACCCG GAAGCCTATC CGGTGGTGCG CCGCATCCTG
GAGGCGACGA AGAGCGACAT CCGCGTGCTG ATCGGCAATG CCGCCGCCCT GCGTCCGCTC
TCGCCCGCCG CCTTCGCCGA CGAACGCTTC GGCGTGCCGA CCGTGCGCGA CATCATCGCC
GAGTTGGAAA AGCCCGGCCG CGACCCGCGC CCGGCCTTCA AGACGGCGAG CTTCCAGGAA
GGCGTCGAGA AAATCGGCGA CCTCAGGCCA GGGATGCAGT TGGAGGGCGT CGTCACCAAC
GTCGCGGCCT TCGGCGCCTT CGTCGATATC GGCGTGCATC AGGACGGACT CGTCCACATC
TCGGCCATGG CCCGCAAGCG GATCGCCTCG CCTTCCGAAG TGGTGAAGAC CGGCGACGTG
GTGCGTGTGC TGGTGTTGTC GGTCGATGTG CCGCGCAAGC GCATCGCGCT GTCGATGCGG
CTCGACGACC CCCTTGAGGG CGCAACGGCG CCGCGTGGAA ACGTCCCCCG CCCCGAGGCG
CAGCCCCGGC GCCCGGCGCC CGCAGCCCCG CCGCAGGATG GGGCGCTGGC CGACGCGCTC
CGGCGCGCCG GAGTCTCGTC GTCTAAGCGT TCTTGA
 
Protein sequence
MKSVNLLIAE ELGAREGQVA AAVDLLDGGY TVPFIARYRK EATGSLDDAQ LRTLEERLGY 
LRELRDRRTS VTESIRAQGK LTPELAAAIA AADTKARLED IYLPFRPKRR SKAQTAREAG
LAPLAETLLA RPETAPGRAA QGFVDAAKGI ETAEAALEGA RAILIERFAE DADLIGRLRE
DFWRGGEAVA KVRKGQETAG QKFSDYFDWR ERLERMPSHR VLAVFRGEKE EVLDLAFAAE
GEDSAPGVPG PFELAVCRRF GVSARGRPAD AWLLDTVRTA WRTKIRTGIK ADLRARLFER
AEEAAVKVFA GNLKDLLLAA PAGGRATLGL DPGYRNGVKA AVVDRTGKVV AVETTYPHEP
QRRWKEAVAS LSRLCRQHGV ELIAIGNGTA SRETDRLATE ILAANPDLKM AKVTVSEAGA
SVYSASAIAT RELPDLDVSH RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVTEQKLS
RSLQAVVEDA VNAVGVDVNT ASGPLLAQVS GLGASVADKI VTHRDANGPF RTRAGLKKVP
GLGAKTFELA AGFLRIPDGE DPLDRSGVHP EAYPVVRRIL EATKSDIRVL IGNAAALRPL
SPAAFADERF GVPTVRDIIA ELEKPGRDPR PAFKTASFQE GVEKIGDLRP GMQLEGVVTN
VAAFGAFVDI GVHQDGLVHI SAMARKRIAS PSEVVKTGDV VRVLVLSVDV PRKRIALSMR
LDDPLEGATA PRGNVPRPEA QPRRPAPAAP PQDGALADAL RRAGVSSSKR S