Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3171 |
Symbol | |
ID | 4786568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3372326 |
End bp | 3373285 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640091743 |
Product | RNA polymerase sigma-32 factor |
Protein accession | YP_001022359 |
Protein GI | 124268355 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.171654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.223351 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTCG ATGCCATGAA CACCACGCTG TCCCTGAACA CTTCCGCGGG CTCGCAGGCG CTGACGGTGC GCGATCCGTG GGCGCTGGTC CCGTCGCTCG GCGACCTGAA TGCCTATATC GCCGCCGTGA ACCGCCTGCC GATGCTGACG CTCGAGGAAG AGCAGTCGCT CGGCCAGCGG CTGCGTGACG AACACGACCT CGAGGCTGCC GGTCGCTTGG TGCTGTCGCA CTTGCGTCTG GTCGTGTCGG TATCGCGTCA GTACCTGGGC TACGGACTCC CTCACGGCGA CCTGATCCAG GAAGGCAATG TCGGCCTGAT GAAGGCCGTC AAGCGCTTCG ACCCCAGCCA GGGCGTGCGA CTGGTCAGCT ATGCGCTGCA CTGGATCAAG GCCGAAATTC ACGAGTACGT GCTGCGCAAC TGGCGCATGG TCAAGCTGGC GACCACGAAG GCACAGCGCA AGCTGTTCTT CAACCTGCGC TCGATGAAAC AGGGTTTCAA GGGCGATGCC ACCGACAGCG ACCTGCACCG CAGCACGCTG ACCGATGCCG AGATCGACAT CGTGGCCAGT GAACTCAAGG TCAAGCGCGA GGAAGTGATC GAGATGGAGA CGCGCCTGTC GGGTGGCGAT GTCGCGCTCG ATCCACAGAC CGACGACGGC GACGAGAGCT ACGCGCCGAT CGCCTATCTG GCCGACGACC GCCACGAGCC GACGCGTGTG CTCGACGCCC AGCGCCGTGA CGCGCTGGCC GGCGACGGCA TCGGCGAGGC GCTGGACGTG CTGGACGCGC GCAGCCGACG CATCGTCGAG GAACGCTGGC TCAAGGTCAA CGACAACGGT TCAGGCGGCA TGACGCTGCA TGAACTGGCC GCCGAGTACG GCGTCAGCGC CGAGCGGATC CGCCAGATCG AGGTAGCCGC CATGAAGAAG ATGCGCAAGG CGCTGGCCGC CTACGCCTGA
|
Protein sequence | MSFDAMNTTL SLNTSAGSQA LTVRDPWALV PSLGDLNAYI AAVNRLPMLT LEEEQSLGQR LRDEHDLEAA GRLVLSHLRL VVSVSRQYLG YGLPHGDLIQ EGNVGLMKAV KRFDPSQGVR LVSYALHWIK AEIHEYVLRN WRMVKLATTK AQRKLFFNLR SMKQGFKGDA TDSDLHRSTL TDAEIDIVAS ELKVKREEVI EMETRLSGGD VALDPQTDDG DESYAPIAYL ADDRHEPTRV LDAQRRDALA GDGIGEALDV LDARSRRIVE ERWLKVNDNG SGGMTLHELA AEYGVSAERI RQIEVAAMKK MRKALAAYA
|
| |