Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0019 |
Symbol | |
ID | 4785303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 21840 |
End bp | 23126 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640088566 |
Product | putative RNA polymerase sigma factor |
Protein accession | YP_001019216 |
Protein GI | 124265212 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.962895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.592717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGACGA CCGACACGCA GCGCGCGATC GATGCCGTCT GGCGCATCGA ATCGGCGAAG ATCGTCGCCG TGGTCGCGCG CATGGTGCGC GACGTCGGCG TGGCCGAGGA GCTGGCGCAG GATGCGCTGG TCGCCGCGCT CGAGCACTGG CCCGTCGACG GCCTGCCCGA CAAGCCCGGG GCCTGGCTGA TGACCACCGC CAAGCACCGC GCGCTCGACC GGCTGCGGCA GGACGCGCTG CACGCGCGCA AGCAGCAGGA GCTCGGCGCC GACCTCGACG CGCTGCAGGC GGGCGTGGTG CCGGACTTCG TGGACGCGCT CGACGCGGCA CGCCAGGACG ACATCGGCGA CGACCTGCTG CGCCTGATCT TCACCGCCTG CCATCCGGTG CTCGGCACCG AGGCGCGCGT GGCCCTCACA CTGCGCCTGC TGGGCGGGCT GAGCACCGAC GAGATCGCGC GCGCCTTCCT GGCGCCGGAG GCGACGATCG CGCAGCGCAT CGTGCGCGCC AAGCGCACGC TGAGCGCGGC CCGCGTGCCC TTCGAGGTGC CGCAGGCGCA GGAACGCGCG CCGCGCCTGG CCTCGGTGCT GGAGGTGGTC TACCTGATCT TCAACGAGGG CTACTCGGCC ACCGCCGGCG ACGACTGGAT GCGCCCGGCG CTGTGCGACG AAGCGCTGCG CCTGGGCCGC GTGCTGGCCG GGCTGGCGCC CGACGAGCCC GAGGTGCACG GCCTGGTGGC GCTGATGGAG ATCCAGTCCT CGCGCACCGC GGCCCGCACC GATGCGCGAG GGCGGCCGGT GCTGCTGCTC GACCAAGATC GCACGCGCTG GGACCCGCTG CTGATCCGCC GCGGCCTGGC CGCGCTGGAG CGCGCCACGG CGCTGGGCGG CCTGCGCGGG CCGTATGCGC TGCAGGCGGC GCTCGCGGCC TGCCATGCGC GCGCGGCGAA GGCGGCCGAC ACCGACTGGC CGCTGATCGT GGCGCTCTAC GACGCGCTGG CCCAGGTGGC GCCGTCGCCG GTGGTCGAGC TCAACCGCGC GGTCGCGGTC GGCATGGCCT TCGGACCAAC AGCGGGGCTG GAAATCGTCG ACACGGTCGC ATCGAGCCCG GCGCTCGCCG GCTACCCCTG GTTGCCGAGC GTGCGCGGCG ACCTGCTCGC CAAGCTGGGC CGCCATGACG AGGCGCGCGC GGAGTTCGAG CGCGCCGCGG CGCTGACGCG CAATGGGCGC GAGCGCGAGC TGCTGCTGGA GCGGGCGGCT CAGTCGCGCG ATGCCGGCCG CAGGTAG
|
Protein sequence | MATTDTQRAI DAVWRIESAK IVAVVARMVR DVGVAEELAQ DALVAALEHW PVDGLPDKPG AWLMTTAKHR ALDRLRQDAL HARKQQELGA DLDALQAGVV PDFVDALDAA RQDDIGDDLL RLIFTACHPV LGTEARVALT LRLLGGLSTD EIARAFLAPE ATIAQRIVRA KRTLSAARVP FEVPQAQERA PRLASVLEVV YLIFNEGYSA TAGDDWMRPA LCDEALRLGR VLAGLAPDEP EVHGLVALME IQSSRTAART DARGRPVLLL DQDRTRWDPL LIRRGLAALE RATALGGLRG PYALQAALAA CHARAAKAAD TDWPLIVALY DALAQVAPSP VVELNRAVAV GMAFGPTAGL EIVDTVASSP ALAGYPWLPS VRGDLLAKLG RHDEARAEFE RAAALTRNGR ERELLLERAA QSRDAGRR
|
| |