Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2491 |
Symbol | |
ID | 4784887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2650948 |
End bp | 2653188 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640091061 |
Product | putative RNA polymerase sigma(sigma-70) factor transcription regulator protein |
Protein accession | YP_001021681 |
Protein GI | 124267677 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.046137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.87444 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGA AGAAGAGCGC TCCCGCCCCT GCGAGCAAGC CATCGAAGAC CGCCGCGCCG GCGCCGGCCG CCAAGGCCGC AGCCAAGTCG GCCGGCAAGG CCGAGATCGA CCCGAAGAAG GCCAAGGCGG CTGCCGCCCC GGCCAAAGCC GCGGGGAAGA CGGCCAAAGT CGCCGTGCCC GAGGCCCCCA AGGGCAAGCC GGGTCGCAAG CCGGCCGCCA AGCAAGCGCT GCCGGTGCTC GACGAGGACC TGGGCGACAT CGAGGCCGAC CTCGAAGGCG AAGCCAGCGC CGAGGACACC GGTGGCGACA AGCCCAAGGC CAAGCCGCTG CGCATGAAGG TCTCGCGCGC CAAGGAGCGC GCGCTGATGC GCGAATTCGG TCTCGATGAG ACGGCGCTGA CCGAGGACGA GGTGGCCAAG CGCCGCCAGG AGCTCAAGAC GCTCATCAAG ATGGGCAAGA CCCGGGGTTT CCTGACACAC CAGGAAATCA ACGACCACCT GCCCGAAAAG CTGATCGAGG CCGAGATCCT CGAGGCCATC GTGTCGATGC TCAACGACAT GGGCATCGCC GTCTACGAGC AAGCACCCGA CGCGGCCACG CTGCTGATCG CCGGCGGCTC GACCGCCACC TCGACCGATG AGGAGGCCGA GGAAGAGGCC GAGGCGGCGC TGTCGACCGT CGACTCCGAG TTCGGCCGCA CCACCGACCC GGTGCGCATG TACATGCGCG AGATGGGCAC GGTCGAGCTG CTGACGCGCG AGGGTGAGAT CGAGATCGCC AAACGCATCG AGGGCGGGCT GCAGGCGATG ATGCTGGCGA TCAGCGAATC GCCCACCACG ATCGCCGAGA TCCTGGTGCT GGCCGACAAG ATCCGCGTCG GCGAGATGCA GATCTCGGAG GCGGTCGACG GCTTCGTGTC GAACGAGGAG GCCGACGACT ACGTCGCCGA AGAGGACTTC GACGAGTTCG ACGAGGAAGA CGACGACGAC GGCAACGGCG GCTCGAAGGC GCTGACCAAG AAGCTCGAGG AACTGAAGAC GCAGGCGCTG GTGCGTTTCG ACGAACTGCG CACGCACTTC GACCGCATGC GCAAGGCCTA CGAGAAGGAA GGCTACAAGT CGCCGGCCTA CAACCGCGCG CAGATGGGCG TGAGTTCCGA GATCATGAGC CTGCGCTTCA CGGTCAAGAC CATCGAGCGG CTGTGCCAGA TCCTGCGCTC GCAGGTCGAC GACATCCGCC GCTACGAGCG CGAACTGCGC AAGATCGTGG TCGACAAGTG CGGCATGCCG CAGGACCACT TCATCAAGAC CTTCCCGCCA AACTCGCTGA ACCTCAAGTG GGCCGAGAAG GAGATCGCAG CCAACAAGTC GTACAGCGCG GTGATGGCGC GCAACCTGCC GCCGATCCAG GACCTGCAGC AGAAGCTGAT CGACCTGCAG AGCCGTGCGG TGGTGCCGAT CGACGACCTG AAGCTCATCA ACAAGAAGAT GAACCAGGGC GAGAAGGCCT CGCGAGACGC GAAGAAGGAG ATGATCGAGG CCAACCTGCG CCTGGTGATC TCGATCGCGA AGAAGTACAC CAACCGCGGC CTGCAGTTCC TCGATCTGAT CCAGGAAGGC AACATCGGCC TGATGAAGGC GGTCGACAAG TTCGAATACC GCCGCGGCTA CAAGTTCTCG ACCTACGCGA CGTGGTGGAT CCGCCAGGCC ATCACGCGCT CGATCGCCGA CCAGGCGCGC ACCATTCGCA TTCCGGTGCA CATGATCGAG ACGATCAACA AGATGAACCG CCTGTCGCGC CAGCACCTGC AGGAGTTCGG CTTCGAGCCG GACGCCCCGA CGCTGGCCGA AAAGATGGAG ATGCCCGAGG ACAAGATCCG CAAGATCATG AAGATCGCCA AGGAGCCGAT CTCCATGGAG ACGCCGATCG GCGACGACGA CGATTCGCAC CTGGGCGACT TCATCGAGGA CACCAACAAC ACCGCGCCGA TCGAGGCGGC GATGCAGGCA GGTCTGCGCG ACGTGGTGAA GGACATCCTC GACTCGCTGA CGCCGCGCGA GGCCAAGGTG CTGCGCATGC GCTTCGGCAT CGAGATGTCG ACCGACCACA CGCTGGAGGA AGTGGGCAAG CAGTTCGACG TGACGCGCGA GCGCATCCGC CAGATCGAAG CCAAGGCGAT CCGCAAGCTC AAGCACCCGA GCCGTTCCGA CAAGCTGAGG ACCTACCTGG ACAATCTCTG A
|
Protein sequence | MTAKKSAPAP ASKPSKTAAP APAAKAAAKS AGKAEIDPKK AKAAAAPAKA AGKTAKVAVP EAPKGKPGRK PAAKQALPVL DEDLGDIEAD LEGEASAEDT GGDKPKAKPL RMKVSRAKER ALMREFGLDE TALTEDEVAK RRQELKTLIK MGKTRGFLTH QEINDHLPEK LIEAEILEAI VSMLNDMGIA VYEQAPDAAT LLIAGGSTAT STDEEAEEEA EAALSTVDSE FGRTTDPVRM YMREMGTVEL LTREGEIEIA KRIEGGLQAM MLAISESPTT IAEILVLADK IRVGEMQISE AVDGFVSNEE ADDYVAEEDF DEFDEEDDDD GNGGSKALTK KLEELKTQAL VRFDELRTHF DRMRKAYEKE GYKSPAYNRA QMGVSSEIMS LRFTVKTIER LCQILRSQVD DIRRYERELR KIVVDKCGMP QDHFIKTFPP NSLNLKWAEK EIAANKSYSA VMARNLPPIQ DLQQKLIDLQ SRAVVPIDDL KLINKKMNQG EKASRDAKKE MIEANLRLVI SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKMNRLSR QHLQEFGFEP DAPTLAEKME MPEDKIRKIM KIAKEPISME TPIGDDDDSH LGDFIEDTNN TAPIEAAMQA GLRDVVKDIL DSLTPREAKV LRMRFGIEMS TDHTLEEVGK QFDVTRERIR QIEAKAIRKL KHPSRSDKLR TYLDNL
|
| |