Gene Mpe_A1255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1255 
Symbol 
ID4785832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1352328 
End bp1353359 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content68% 
IMG OID640089821 
ProductRNA polymerase sigma S (sigma-38) factor transcription regulator protein 
Protein accessionYP_001020452 
Protein GI124266448 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.286023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA GGAAGCACTC TCCGCCCGGC AACGGACACG CGCTTCCTGA TGCGAGGGCC 
CTGCCGCACG GGGCCTGGCC GGTCGAGCCG ACGCATGGTG GTGGAGATCC GCCGAATGGT
GCGGACGACC TGCCGGAGCC GGTGCAGGAG CACCTGCCGA TCGCTGCCGG CGAAGGCGAG
GTTGGCAACA CGCTGCAGAC CTATCTGCGC GAGATCCGGC GAGCGCCCCT GTTCACGCCC
GACGAGGAAT TCGCGATGGC CACGCGCGCC CGGGCCGGCG ACTTCGCGGC GCGGCAGCAG
ATGATCGAGC GCAATCTGCG GCTCGTGGTC AGCATCGCCA AGAGCTACCT CAGCCGCGGC
CTGCCGATGA CCGACCTGAT CGAGGAAGGC AACCTCGGCC TGATGCACGC GATTGGCAAG
TTCGAGCCTG AACGCGGCTT TCGCTTCTCG ACCTACGCCT CGTGGTGGAT CCGCCAGAGC
ATCGAGCGCG CGATCATGCA CCAGGCCCGC CTGGTGCGGC TGCCGGTGCA TGTGGTGCGT
GAACTCAACC AGGTGCTCAA GGCGCGCCGT GCGCTGGAGG GCGAAGCCGC AGCGAGCGCC
GACGGGCGGA CCGTCCGCGT CGACGAGATC GCTGCAGCGT TGGGGCGCCC GGTGACCGAG
GTGTCCGAAC TGCTGCGATT TGCCGAGCAG CCCACGTCGC TCGATGCGCC GCTCGAGCGC
CAGGCAGGCA ATGGGGCCGA GACGCTGGGC GACATGGTGG CCGACGAGCA GGCCACGGAC
CCGCTGGGCC ACACGCTGAA CCATGAACTC GACGTGCTGC TGGAGCATGG TTTGGGCGAA
CTGAGCGAAC GCGAGCGCGA GGTGCTGGCT GGGCGCTATG GATTGCACGA CAGGGAACCC
GAGACGCTGG AGGTACTTGC CGAGCGGCTG GGCCTGACGC GCGAGCGTAT CCGCCAGATC
CAGCAGGAGG CGCTGCTCAA GCTCAAGCGC CGGATGGCAC GCAACGGCGT CAACCGCGAC
TCGATCTTCT GA
 
Protein sequence
MNHRKHSPPG NGHALPDARA LPHGAWPVEP THGGGDPPNG ADDLPEPVQE HLPIAAGEGE 
VGNTLQTYLR EIRRAPLFTP DEEFAMATRA RAGDFAARQQ MIERNLRLVV SIAKSYLSRG
LPMTDLIEEG NLGLMHAIGK FEPERGFRFS TYASWWIRQS IERAIMHQAR LVRLPVHVVR
ELNQVLKARR ALEGEAAASA DGRTVRVDEI AAALGRPVTE VSELLRFAEQ PTSLDAPLER
QAGNGAETLG DMVADEQATD PLGHTLNHEL DVLLEHGLGE LSEREREVLA GRYGLHDREP
ETLEVLAERL GLTRERIRQI QQEALLKLKR RMARNGVNRD SIF