Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2033 |
Symbol | |
ID | 4784253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2175687 |
End bp | 2176952 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640090603 |
Product | putative arginine/proline rich protein |
Protein accession | YP_001021226 |
Protein GI | 124267222 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0619869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCC TCAAGCTCAA GAAGCCGGCG CCCGGCGCGC CCTCCTCACC CGCGACCGTC CGCCGCGCGC CACTGCGCAG CGGCGGCGTG AAGCCCGCGC GGCCGACCCT GGCGCAGGCC GAGGCCGAGC GCGCCCGGCA GCGCGCGGAA AGCACGCCCC CACCGCGGCC CGAGCGGGCC GACGCCCCGC CTCGCGCCGG CCGCGCGGCA CCCGCCGCGC CCGGCCGTCA GCGCAGCGAC GCACCACGCC CCTCATCCCG CACGGAGGCC GGCCAGCCCC CGCGCGGCCC CGGTCGGCCC GGTGCCGAAC GCTCGCCGCG TGACCCGAAC CGCACCTCCG AGCGCCCCAC CTTGCGCGAT CCGGCCCGCG AGCCGAGCTC AGACCGCCGC CCACCGCGTA CAAACGACAG CGGTGCGCCG CGCCCCTCGG CCCGGCCCTC GCAGCGCCCA CCGCCCCGCC CTGCCCAGGC CCGCACCACC GGCGGCGCGC CGCCCGAGGA GCTCAACCCG CGCCTCTCCA AGCGCATGAG CGAACTCGGC CTCGCCTCGC GTCGCGAGGC CGACGAGTGG ATCGAGCAAG GCTACGTGCG CGTCGACGGC GAGGTGGTCG ACCAGCTCGG CGCCCGCGTG CGGCCCGAGC AGCAGATCAC CATCGACCCG CAGGCCAGGC TGGAGCAGGC GCAGCGCGTG ACCATCCTGC TGCACAAGCC GCTCGGCTAC GTGAGCGGCC AGGCCGAGGA CGGCCACGCG CCGGCCTTCA CGCTGGTCAC CGCCGCCAAC CGCTGGGCGA CCGACGGCAG CAAGCAGCGC TTCAACGCCA GCCAGCTCAA GCACCTGGTG CCGGCCGGCC GCCTGGACCT CGACTCCACC GGCCTGCTGG TGCTGACCCA GGACGGCCGC GTCGCCAAGC TGCTGATCGG CGAGGACAGC CCGGTCGAGA AGGAGTACGT GGTGCGCGTG CAGTGGACCG CGCGGCCCGA GCTCACCGAC CTGAAGCAGC ACTTCCCGCC CGAGGCCCTG GCGCGGCTGC GCCACGGGCT CGCGCTCGAC GGCGAGAAGC TCAAGCCCGC CAAGGTGTCG TGGCAGAACG AGCAGCACCT GCGCTTCGTG CTGCGCGAGG GCAAGAAGCG CCAGATCCGC CGCATGTGCG AGCTGGTCGG CCTGAAGGTC GAGTCGCTCA AGCGCATCCG TATCGGCCGC GTCGGCCTGG GCGAGCTGCC GCCGGGGCAG TGGCGCTACC TCGGGCCGTT CGAGAACTTC CTGTGA
|
Protein sequence | MATLKLKKPA PGAPSSPATV RRAPLRSGGV KPARPTLAQA EAERARQRAE STPPPRPERA DAPPRAGRAA PAAPGRQRSD APRPSSRTEA GQPPRGPGRP GAERSPRDPN RTSERPTLRD PAREPSSDRR PPRTNDSGAP RPSARPSQRP PPRPAQARTT GGAPPEELNP RLSKRMSELG LASRREADEW IEQGYVRVDG EVVDQLGARV RPEQQITIDP QARLEQAQRV TILLHKPLGY VSGQAEDGHA PAFTLVTAAN RWATDGSKQR FNASQLKHLV PAGRLDLDST GLLVLTQDGR VAKLLIGEDS PVEKEYVVRV QWTARPELTD LKQHFPPEAL ARLRHGLALD GEKLKPAKVS WQNEQHLRFV LREGKKRQIR RMCELVGLKV ESLKRIRIGR VGLGELPPGQ WRYLGPFENF L
|
| |