Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2501 |
Symbol | |
ID | 5832513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2803372 |
End bp | 2804907 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641368303 |
Product | RNA polymerase sigma-54 factor, RpoN |
Protein accession | YP_001639967 |
Protein GI | 163851924 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.423834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.389615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGC TCCAGCGGCT CGAGATGCGC CAGGGCCAAG CCCTGGTGAT GACACCGCAA CTGCTGCAGG CGATCAAGCT CCTGCAGCTC TCGCAGCTCG ATCTCTCGGC CTATGTCGAT GCCGAACTGG AGCGCAATCC GCTTCTGGAG CGCGCCGAGT CCGAAGGAGA ATCCGAGCGC GAGGCGCCGG AGGCTTCGGG CGACGGCGAG TACGACGGCG GCGACGGGCC GGAAGCCGAG ACCTGGCTCG GCTCCGAGAT GACGCAGAGC CGCAGCGAGA TCGAGGGTGA CCTCGACACG CGCCTCGACA ACGTGTTCGC CGACGAGAAT CCCACTCCGC GCGAGACCGG GACCGGGGAC ATGCTCTCGC TGACCCCAGC CCCTTACGGC AATGCCGGCG GCAGCTTCGA CGGCGAGCTT CCCGATTTCG AAGCGACGCT GACCGCCGAG ACCAGCTTGC GCGAGCACCT CGCCGCACAG CTCGATCTCG CGACGCAAAA CCCGTCCGAC CGGTTGATCG GCAGCTTCCT GATCGATGCG GTGGACGATG CGGGGTACCT GCGCGAGGAC ATCGACGGCG TGGCCGAGCG GCTCGGCGCC TCGCTCGATG ACGTCGCCCG CATTCTGAAA CTCGTCCAGA CCTTCGATCC GCCGGGCGTG GCCGCCCGCG ACCTCGCCGA ATGCCTCGCC ATCCAGCTTC GCGAGCAGGA TCGGTTCGAT CCGGCGATGC AGGCGCTGGT CTCGCGCCTC GATCTCGTGG CCAAGCGCGA CTTCCCGGGC CTGCGCCGCC TCTGCGGCGT GGACGACGAG GATCTCGTTG AGATGCTGGC CGAGATCCGC CGCCTCGACC CCAAGCCCGG CCGCGCCTTC GGCGCCCACG CGGTCGAGGT TCTGGTCCCC GACGTGTTCG TGCGCCCCGC GCCGGATGGC TGCTGGCTGG TGGAACTGAA CTCCGAGGCG CTGCCGCGGG TGCTGGTCAA CCAGAGCTAC TATGCCCGCG TCTCGAAGGG AGCGGTGGCG GACGGCGACA AGGCGTTCCT GTCCGAGTGT CTGCAGACGG CGAACTGGCT CACCCGCAGC CTGGAGCAGC GGGCGCGCAC CATCCTGAAG GTCGCCACCG AGATCGTCCG CCAGCAGGAC GCCTTCTTCC TGCACGGTGT CGCCCACCTG CGTCCGCTCA ACCTCAAGAC GGTCGCCGAG GCGATCGGCA TGCATGAATC GACCGTCTCC CGCGTCACAT CCAACAAGTC GATCGGCACC AGCCGCGGCA CCCTGGAGAT GAAGTACTTC TTCACCGCCG CAATCCCCGG CGCCGCCGGT GCCGCGTCCC ATTCCTCGGA ATCGGTGCGC CACCGCATCA AACAACTTGT CGATGCGGAA GCCTCCGACG TCCTGTCCGA CGATGCGCTG GTCCAGCGTC TGCGCGACGA GGGCATCGAC ATCGCCCGCC GCACCGTCGC CAAGTATCGG GAGTCGCTGC GGATCCCCTC GTCGATCGAG CGGCGCCGGG AGAAGATGGC CACGGCCGTC CGCTGA
|
Protein sequence | MSLLQRLEMR QGQALVMTPQ LLQAIKLLQL SQLDLSAYVD AELERNPLLE RAESEGESER EAPEASGDGE YDGGDGPEAE TWLGSEMTQS RSEIEGDLDT RLDNVFADEN PTPRETGTGD MLSLTPAPYG NAGGSFDGEL PDFEATLTAE TSLREHLAAQ LDLATQNPSD RLIGSFLIDA VDDAGYLRED IDGVAERLGA SLDDVARILK LVQTFDPPGV AARDLAECLA IQLREQDRFD PAMQALVSRL DLVAKRDFPG LRRLCGVDDE DLVEMLAEIR RLDPKPGRAF GAHAVEVLVP DVFVRPAPDG CWLVELNSEA LPRVLVNQSY YARVSKGAVA DGDKAFLSEC LQTANWLTRS LEQRARTILK VATEIVRQQD AFFLHGVAHL RPLNLKTVAE AIGMHESTVS RVTSNKSIGT SRGTLEMKYF FTAAIPGAAG AASHSSESVR HRIKQLVDAE ASDVLSDDAL VQRLRDEGID IARRTVAKYR ESLRIPSSIE RRREKMATAV R
|
| |