Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0148 |
Symbol | |
ID | 4784849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 155771 |
End bp | 157294 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640088696 |
Product | putative RNA polymerase sigma N (sigma 54) factor transcription regulator protein |
Protein accession | YP_001019345 |
Protein GI | 124265341 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCTT CCCTGCAGGT CCGTTTTTCG CAGCATCTGG CGCTCACGCC GCAGCTTCAG CAGTCGATCC GGCTGCTGCA GCTTTCCACG CTCGAGCTTC ACCAGGAAGT CGAGCAGATG CTGGAGCAGA ACCCCTTCCT CGAAGTGGAG GAAGATGCGC CGACCCCGTT CGACGCGCCG GTGGAGCGCG CCACGGCCAC GGAGCGCCAG GCCGATGACG CCTGGGAGGG CTCGGGGTCC GAGGTGGCCG CCGACCCCGA GCCGGTCGCG GTGGATGCGG CCGAGTTCGG CACCACCGAG CGCGAGGACT GGGAGAACGG CACCGAGCGC GAGGACTTCG ACGGCATCCG CGAGACGCCC GGCAAGGCCG GCAACAACGA CAGCGACGAG TTCGACCCCA TGGAGCGCAG CAGCGCCGGG GTGAGCCTGC AGGACCACCT GCGCGACCAG TTGCGCGGCA TGCGCCTGAG CGACGAGGAC CGCGGCGCGG TGATGGTGCT GATCGAATCG CTCGACGAGG ACGGCTACCT GGCCGACCCC CTGGAAGAGA TCGCCCAGCG CCTGGCCGGC GACGAGGACG ACATCGCGGT CGAGGAGCTG CTCGACCGCC TGCGCTGCGC GCTGAAGTGG CTGCACAACC TGGAGCCGCT GGGCGTCGGT GCGCGCGACC TGTCGGAGTG CCTGACGCTG CAGCTGCGGG CCGGACCGCG CTGCGAGGCG CAGATGATCG CGATCCTGAT CTGCAAGTAC CACCTCGAGT TGCTGGCGCG GCGCGACGGC AAGAAGCTGA TGGCGGCCAC CGGCGCCGAC GAGGAGCTGC TGAAGGCCGC GCAGGCGCTG ATCGTGCGCT GCGAGCCCAA GCCCGGCCGG CCCTTCACCA AGGCCGAGGC CAACATCATC GTGCCCGATG TCATCGTGCA GAAGGCCGGC CGCGGCTGGC GCGTGGTGCT CAACCCCGAC GTGATGCCCA AGCTGCGCAT CAACGACCTC TACGCCCAGG CCATCAAGCA GCAGCGGGGC GCGCGCACCG AATCGGGCGC GGGCCTGAGC TCGCGGCTGC AGGAGGCGCG CTGGTTCATG AAGAACATCC TGCAGCGCTT CGACACCATC CAGCGCGTGT CGCAGGCCAT CGTCGAGCGG CAGAAGGCCT TCTTCAGCCA CGGCGCGATC GCGATGAAGC CACTGGTGCT GCGCGAGATC GCCGACGAGC TGGGTCTGCA CGAGTCGACC ATCTCGCGCG TGACCACCGC CAAGTACATG TCCACGCCCT ACGGCACCTT CGAGCTGAAG TATTTCTTCG GCTCCTCGCT CAACACCGAG GCCGGTGGCA ATGCGTCGAG CACCGCGGTG CGTGCGCTGA TCAAGCAGCT GGTCAGTGCC GAGGATGCCA AGAAGCCGCT GTCGGACAGC CAGCTCAGCA GCATGCTGGA AGAGCAGGGC ATCCAGGTGG CGCGCCGCAC GGTGGCGAAG TACCGCGAGG CGCTGAAGAT CGCGCCGGCC AACCTGCGGC GCACGATGAT GTAA
|
Protein sequence | MKPSLQVRFS QHLALTPQLQ QSIRLLQLST LELHQEVEQM LEQNPFLEVE EDAPTPFDAP VERATATERQ ADDAWEGSGS EVAADPEPVA VDAAEFGTTE REDWENGTER EDFDGIRETP GKAGNNDSDE FDPMERSSAG VSLQDHLRDQ LRGMRLSDED RGAVMVLIES LDEDGYLADP LEEIAQRLAG DEDDIAVEEL LDRLRCALKW LHNLEPLGVG ARDLSECLTL QLRAGPRCEA QMIAILICKY HLELLARRDG KKLMAATGAD EELLKAAQAL IVRCEPKPGR PFTKAEANII VPDVIVQKAG RGWRVVLNPD VMPKLRINDL YAQAIKQQRG ARTESGAGLS SRLQEARWFM KNILQRFDTI QRVSQAIVER QKAFFSHGAI AMKPLVLREI ADELGLHEST ISRVTTAKYM STPYGTFELK YFFGSSLNTE AGGNASSTAV RALIKQLVSA EDAKKPLSDS QLSSMLEEQG IQVARRTVAK YREALKIAPA NLRRTMM
|
| |