Gene Mpe_A0148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0148 
Symbol 
ID4784849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp155771 
End bp157294 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content68% 
IMG OID640088696 
Productputative RNA polymerase sigma N (sigma 54) factor transcription regulator protein 
Protein accessionYP_001019345 
Protein GI124265341 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT CCCTGCAGGT CCGTTTTTCG CAGCATCTGG CGCTCACGCC GCAGCTTCAG 
CAGTCGATCC GGCTGCTGCA GCTTTCCACG CTCGAGCTTC ACCAGGAAGT CGAGCAGATG
CTGGAGCAGA ACCCCTTCCT CGAAGTGGAG GAAGATGCGC CGACCCCGTT CGACGCGCCG
GTGGAGCGCG CCACGGCCAC GGAGCGCCAG GCCGATGACG CCTGGGAGGG CTCGGGGTCC
GAGGTGGCCG CCGACCCCGA GCCGGTCGCG GTGGATGCGG CCGAGTTCGG CACCACCGAG
CGCGAGGACT GGGAGAACGG CACCGAGCGC GAGGACTTCG ACGGCATCCG CGAGACGCCC
GGCAAGGCCG GCAACAACGA CAGCGACGAG TTCGACCCCA TGGAGCGCAG CAGCGCCGGG
GTGAGCCTGC AGGACCACCT GCGCGACCAG TTGCGCGGCA TGCGCCTGAG CGACGAGGAC
CGCGGCGCGG TGATGGTGCT GATCGAATCG CTCGACGAGG ACGGCTACCT GGCCGACCCC
CTGGAAGAGA TCGCCCAGCG CCTGGCCGGC GACGAGGACG ACATCGCGGT CGAGGAGCTG
CTCGACCGCC TGCGCTGCGC GCTGAAGTGG CTGCACAACC TGGAGCCGCT GGGCGTCGGT
GCGCGCGACC TGTCGGAGTG CCTGACGCTG CAGCTGCGGG CCGGACCGCG CTGCGAGGCG
CAGATGATCG CGATCCTGAT CTGCAAGTAC CACCTCGAGT TGCTGGCGCG GCGCGACGGC
AAGAAGCTGA TGGCGGCCAC CGGCGCCGAC GAGGAGCTGC TGAAGGCCGC GCAGGCGCTG
ATCGTGCGCT GCGAGCCCAA GCCCGGCCGG CCCTTCACCA AGGCCGAGGC CAACATCATC
GTGCCCGATG TCATCGTGCA GAAGGCCGGC CGCGGCTGGC GCGTGGTGCT CAACCCCGAC
GTGATGCCCA AGCTGCGCAT CAACGACCTC TACGCCCAGG CCATCAAGCA GCAGCGGGGC
GCGCGCACCG AATCGGGCGC GGGCCTGAGC TCGCGGCTGC AGGAGGCGCG CTGGTTCATG
AAGAACATCC TGCAGCGCTT CGACACCATC CAGCGCGTGT CGCAGGCCAT CGTCGAGCGG
CAGAAGGCCT TCTTCAGCCA CGGCGCGATC GCGATGAAGC CACTGGTGCT GCGCGAGATC
GCCGACGAGC TGGGTCTGCA CGAGTCGACC ATCTCGCGCG TGACCACCGC CAAGTACATG
TCCACGCCCT ACGGCACCTT CGAGCTGAAG TATTTCTTCG GCTCCTCGCT CAACACCGAG
GCCGGTGGCA ATGCGTCGAG CACCGCGGTG CGTGCGCTGA TCAAGCAGCT GGTCAGTGCC
GAGGATGCCA AGAAGCCGCT GTCGGACAGC CAGCTCAGCA GCATGCTGGA AGAGCAGGGC
ATCCAGGTGG CGCGCCGCAC GGTGGCGAAG TACCGCGAGG CGCTGAAGAT CGCGCCGGCC
AACCTGCGGC GCACGATGAT GTAA
 
Protein sequence
MKPSLQVRFS QHLALTPQLQ QSIRLLQLST LELHQEVEQM LEQNPFLEVE EDAPTPFDAP 
VERATATERQ ADDAWEGSGS EVAADPEPVA VDAAEFGTTE REDWENGTER EDFDGIRETP
GKAGNNDSDE FDPMERSSAG VSLQDHLRDQ LRGMRLSDED RGAVMVLIES LDEDGYLADP
LEEIAQRLAG DEDDIAVEEL LDRLRCALKW LHNLEPLGVG ARDLSECLTL QLRAGPRCEA
QMIAILICKY HLELLARRDG KKLMAATGAD EELLKAAQAL IVRCEPKPGR PFTKAEANII
VPDVIVQKAG RGWRVVLNPD VMPKLRINDL YAQAIKQQRG ARTESGAGLS SRLQEARWFM
KNILQRFDTI QRVSQAIVER QKAFFSHGAI AMKPLVLREI ADELGLHEST ISRVTTAKYM
STPYGTFELK YFFGSSLNTE AGGNASSTAV RALIKQLVSA EDAKKPLSDS QLSSMLEEQG
IQVARRTVAK YREALKIAPA NLRRTMM