Gene Mpe_A3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3171 
Symbol 
ID4786568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3372326 
End bp3373285 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content65% 
IMG OID640091743 
ProductRNA polymerase sigma-32 factor 
Protein accessionYP_001022359 
Protein GI124268355 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.171654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.223351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCG ATGCCATGAA CACCACGCTG TCCCTGAACA CTTCCGCGGG CTCGCAGGCG 
CTGACGGTGC GCGATCCGTG GGCGCTGGTC CCGTCGCTCG GCGACCTGAA TGCCTATATC
GCCGCCGTGA ACCGCCTGCC GATGCTGACG CTCGAGGAAG AGCAGTCGCT CGGCCAGCGG
CTGCGTGACG AACACGACCT CGAGGCTGCC GGTCGCTTGG TGCTGTCGCA CTTGCGTCTG
GTCGTGTCGG TATCGCGTCA GTACCTGGGC TACGGACTCC CTCACGGCGA CCTGATCCAG
GAAGGCAATG TCGGCCTGAT GAAGGCCGTC AAGCGCTTCG ACCCCAGCCA GGGCGTGCGA
CTGGTCAGCT ATGCGCTGCA CTGGATCAAG GCCGAAATTC ACGAGTACGT GCTGCGCAAC
TGGCGCATGG TCAAGCTGGC GACCACGAAG GCACAGCGCA AGCTGTTCTT CAACCTGCGC
TCGATGAAAC AGGGTTTCAA GGGCGATGCC ACCGACAGCG ACCTGCACCG CAGCACGCTG
ACCGATGCCG AGATCGACAT CGTGGCCAGT GAACTCAAGG TCAAGCGCGA GGAAGTGATC
GAGATGGAGA CGCGCCTGTC GGGTGGCGAT GTCGCGCTCG ATCCACAGAC CGACGACGGC
GACGAGAGCT ACGCGCCGAT CGCCTATCTG GCCGACGACC GCCACGAGCC GACGCGTGTG
CTCGACGCCC AGCGCCGTGA CGCGCTGGCC GGCGACGGCA TCGGCGAGGC GCTGGACGTG
CTGGACGCGC GCAGCCGACG CATCGTCGAG GAACGCTGGC TCAAGGTCAA CGACAACGGT
TCAGGCGGCA TGACGCTGCA TGAACTGGCC GCCGAGTACG GCGTCAGCGC CGAGCGGATC
CGCCAGATCG AGGTAGCCGC CATGAAGAAG ATGCGCAAGG CGCTGGCCGC CTACGCCTGA
 
Protein sequence
MSFDAMNTTL SLNTSAGSQA LTVRDPWALV PSLGDLNAYI AAVNRLPMLT LEEEQSLGQR 
LRDEHDLEAA GRLVLSHLRL VVSVSRQYLG YGLPHGDLIQ EGNVGLMKAV KRFDPSQGVR
LVSYALHWIK AEIHEYVLRN WRMVKLATTK AQRKLFFNLR SMKQGFKGDA TDSDLHRSTL
TDAEIDIVAS ELKVKREEVI EMETRLSGGD VALDPQTDDG DESYAPIAYL ADDRHEPTRV
LDAQRRDALA GDGIGEALDV LDARSRRIVE ERWLKVNDNG SGGMTLHELA AEYGVSAERI
RQIEVAAMKK MRKALAAYA