Gene Mpe_A0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0019 
Symbol 
ID4785303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp21840 
End bp23126 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content75% 
IMG OID640088566 
Productputative RNA polymerase sigma factor 
Protein accessionYP_001019216 
Protein GI124265212 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.962895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.592717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACGA CCGACACGCA GCGCGCGATC GATGCCGTCT GGCGCATCGA ATCGGCGAAG 
ATCGTCGCCG TGGTCGCGCG CATGGTGCGC GACGTCGGCG TGGCCGAGGA GCTGGCGCAG
GATGCGCTGG TCGCCGCGCT CGAGCACTGG CCCGTCGACG GCCTGCCCGA CAAGCCCGGG
GCCTGGCTGA TGACCACCGC CAAGCACCGC GCGCTCGACC GGCTGCGGCA GGACGCGCTG
CACGCGCGCA AGCAGCAGGA GCTCGGCGCC GACCTCGACG CGCTGCAGGC GGGCGTGGTG
CCGGACTTCG TGGACGCGCT CGACGCGGCA CGCCAGGACG ACATCGGCGA CGACCTGCTG
CGCCTGATCT TCACCGCCTG CCATCCGGTG CTCGGCACCG AGGCGCGCGT GGCCCTCACA
CTGCGCCTGC TGGGCGGGCT GAGCACCGAC GAGATCGCGC GCGCCTTCCT GGCGCCGGAG
GCGACGATCG CGCAGCGCAT CGTGCGCGCC AAGCGCACGC TGAGCGCGGC CCGCGTGCCC
TTCGAGGTGC CGCAGGCGCA GGAACGCGCG CCGCGCCTGG CCTCGGTGCT GGAGGTGGTC
TACCTGATCT TCAACGAGGG CTACTCGGCC ACCGCCGGCG ACGACTGGAT GCGCCCGGCG
CTGTGCGACG AAGCGCTGCG CCTGGGCCGC GTGCTGGCCG GGCTGGCGCC CGACGAGCCC
GAGGTGCACG GCCTGGTGGC GCTGATGGAG ATCCAGTCCT CGCGCACCGC GGCCCGCACC
GATGCGCGAG GGCGGCCGGT GCTGCTGCTC GACCAAGATC GCACGCGCTG GGACCCGCTG
CTGATCCGCC GCGGCCTGGC CGCGCTGGAG CGCGCCACGG CGCTGGGCGG CCTGCGCGGG
CCGTATGCGC TGCAGGCGGC GCTCGCGGCC TGCCATGCGC GCGCGGCGAA GGCGGCCGAC
ACCGACTGGC CGCTGATCGT GGCGCTCTAC GACGCGCTGG CCCAGGTGGC GCCGTCGCCG
GTGGTCGAGC TCAACCGCGC GGTCGCGGTC GGCATGGCCT TCGGACCAAC AGCGGGGCTG
GAAATCGTCG ACACGGTCGC ATCGAGCCCG GCGCTCGCCG GCTACCCCTG GTTGCCGAGC
GTGCGCGGCG ACCTGCTCGC CAAGCTGGGC CGCCATGACG AGGCGCGCGC GGAGTTCGAG
CGCGCCGCGG CGCTGACGCG CAATGGGCGC GAGCGCGAGC TGCTGCTGGA GCGGGCGGCT
CAGTCGCGCG ATGCCGGCCG CAGGTAG
 
Protein sequence
MATTDTQRAI DAVWRIESAK IVAVVARMVR DVGVAEELAQ DALVAALEHW PVDGLPDKPG 
AWLMTTAKHR ALDRLRQDAL HARKQQELGA DLDALQAGVV PDFVDALDAA RQDDIGDDLL
RLIFTACHPV LGTEARVALT LRLLGGLSTD EIARAFLAPE ATIAQRIVRA KRTLSAARVP
FEVPQAQERA PRLASVLEVV YLIFNEGYSA TAGDDWMRPA LCDEALRLGR VLAGLAPDEP
EVHGLVALME IQSSRTAART DARGRPVLLL DQDRTRWDPL LIRRGLAALE RATALGGLRG
PYALQAALAA CHARAAKAAD TDWPLIVALY DALAQVAPSP VVELNRAVAV GMAFGPTAGL
EIVDTVASSP ALAGYPWLPS VRGDLLAKLG RHDEARAEFE RAAALTRNGR ERELLLERAA
QSRDAGRR