Gene Mext_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2501 
Symbol 
ID5832513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2803372 
End bp2804907 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID641368303 
ProductRNA polymerase sigma-54 factor, RpoN 
Protein accessionYP_001639967 
Protein GI163851924 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.423834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.389615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGC TCCAGCGGCT CGAGATGCGC CAGGGCCAAG CCCTGGTGAT GACACCGCAA 
CTGCTGCAGG CGATCAAGCT CCTGCAGCTC TCGCAGCTCG ATCTCTCGGC CTATGTCGAT
GCCGAACTGG AGCGCAATCC GCTTCTGGAG CGCGCCGAGT CCGAAGGAGA ATCCGAGCGC
GAGGCGCCGG AGGCTTCGGG CGACGGCGAG TACGACGGCG GCGACGGGCC GGAAGCCGAG
ACCTGGCTCG GCTCCGAGAT GACGCAGAGC CGCAGCGAGA TCGAGGGTGA CCTCGACACG
CGCCTCGACA ACGTGTTCGC CGACGAGAAT CCCACTCCGC GCGAGACCGG GACCGGGGAC
ATGCTCTCGC TGACCCCAGC CCCTTACGGC AATGCCGGCG GCAGCTTCGA CGGCGAGCTT
CCCGATTTCG AAGCGACGCT GACCGCCGAG ACCAGCTTGC GCGAGCACCT CGCCGCACAG
CTCGATCTCG CGACGCAAAA CCCGTCCGAC CGGTTGATCG GCAGCTTCCT GATCGATGCG
GTGGACGATG CGGGGTACCT GCGCGAGGAC ATCGACGGCG TGGCCGAGCG GCTCGGCGCC
TCGCTCGATG ACGTCGCCCG CATTCTGAAA CTCGTCCAGA CCTTCGATCC GCCGGGCGTG
GCCGCCCGCG ACCTCGCCGA ATGCCTCGCC ATCCAGCTTC GCGAGCAGGA TCGGTTCGAT
CCGGCGATGC AGGCGCTGGT CTCGCGCCTC GATCTCGTGG CCAAGCGCGA CTTCCCGGGC
CTGCGCCGCC TCTGCGGCGT GGACGACGAG GATCTCGTTG AGATGCTGGC CGAGATCCGC
CGCCTCGACC CCAAGCCCGG CCGCGCCTTC GGCGCCCACG CGGTCGAGGT TCTGGTCCCC
GACGTGTTCG TGCGCCCCGC GCCGGATGGC TGCTGGCTGG TGGAACTGAA CTCCGAGGCG
CTGCCGCGGG TGCTGGTCAA CCAGAGCTAC TATGCCCGCG TCTCGAAGGG AGCGGTGGCG
GACGGCGACA AGGCGTTCCT GTCCGAGTGT CTGCAGACGG CGAACTGGCT CACCCGCAGC
CTGGAGCAGC GGGCGCGCAC CATCCTGAAG GTCGCCACCG AGATCGTCCG CCAGCAGGAC
GCCTTCTTCC TGCACGGTGT CGCCCACCTG CGTCCGCTCA ACCTCAAGAC GGTCGCCGAG
GCGATCGGCA TGCATGAATC GACCGTCTCC CGCGTCACAT CCAACAAGTC GATCGGCACC
AGCCGCGGCA CCCTGGAGAT GAAGTACTTC TTCACCGCCG CAATCCCCGG CGCCGCCGGT
GCCGCGTCCC ATTCCTCGGA ATCGGTGCGC CACCGCATCA AACAACTTGT CGATGCGGAA
GCCTCCGACG TCCTGTCCGA CGATGCGCTG GTCCAGCGTC TGCGCGACGA GGGCATCGAC
ATCGCCCGCC GCACCGTCGC CAAGTATCGG GAGTCGCTGC GGATCCCCTC GTCGATCGAG
CGGCGCCGGG AGAAGATGGC CACGGCCGTC CGCTGA
 
Protein sequence
MSLLQRLEMR QGQALVMTPQ LLQAIKLLQL SQLDLSAYVD AELERNPLLE RAESEGESER 
EAPEASGDGE YDGGDGPEAE TWLGSEMTQS RSEIEGDLDT RLDNVFADEN PTPRETGTGD
MLSLTPAPYG NAGGSFDGEL PDFEATLTAE TSLREHLAAQ LDLATQNPSD RLIGSFLIDA
VDDAGYLRED IDGVAERLGA SLDDVARILK LVQTFDPPGV AARDLAECLA IQLREQDRFD
PAMQALVSRL DLVAKRDFPG LRRLCGVDDE DLVEMLAEIR RLDPKPGRAF GAHAVEVLVP
DVFVRPAPDG CWLVELNSEA LPRVLVNQSY YARVSKGAVA DGDKAFLSEC LQTANWLTRS
LEQRARTILK VATEIVRQQD AFFLHGVAHL RPLNLKTVAE AIGMHESTVS RVTSNKSIGT
SRGTLEMKYF FTAAIPGAAG AASHSSESVR HRIKQLVDAE ASDVLSDDAL VQRLRDEGID
IARRTVAKYR ESLRIPSSIE RRREKMATAV R