Gene Mpe_A2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2033 
Symbol 
ID4784253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2175687 
End bp2176952 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content76% 
IMG OID640090603 
Productputative arginine/proline rich protein 
Protein accessionYP_001021226 
Protein GI124267222 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0619869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCC TCAAGCTCAA GAAGCCGGCG CCCGGCGCGC CCTCCTCACC CGCGACCGTC 
CGCCGCGCGC CACTGCGCAG CGGCGGCGTG AAGCCCGCGC GGCCGACCCT GGCGCAGGCC
GAGGCCGAGC GCGCCCGGCA GCGCGCGGAA AGCACGCCCC CACCGCGGCC CGAGCGGGCC
GACGCCCCGC CTCGCGCCGG CCGCGCGGCA CCCGCCGCGC CCGGCCGTCA GCGCAGCGAC
GCACCACGCC CCTCATCCCG CACGGAGGCC GGCCAGCCCC CGCGCGGCCC CGGTCGGCCC
GGTGCCGAAC GCTCGCCGCG TGACCCGAAC CGCACCTCCG AGCGCCCCAC CTTGCGCGAT
CCGGCCCGCG AGCCGAGCTC AGACCGCCGC CCACCGCGTA CAAACGACAG CGGTGCGCCG
CGCCCCTCGG CCCGGCCCTC GCAGCGCCCA CCGCCCCGCC CTGCCCAGGC CCGCACCACC
GGCGGCGCGC CGCCCGAGGA GCTCAACCCG CGCCTCTCCA AGCGCATGAG CGAACTCGGC
CTCGCCTCGC GTCGCGAGGC CGACGAGTGG ATCGAGCAAG GCTACGTGCG CGTCGACGGC
GAGGTGGTCG ACCAGCTCGG CGCCCGCGTG CGGCCCGAGC AGCAGATCAC CATCGACCCG
CAGGCCAGGC TGGAGCAGGC GCAGCGCGTG ACCATCCTGC TGCACAAGCC GCTCGGCTAC
GTGAGCGGCC AGGCCGAGGA CGGCCACGCG CCGGCCTTCA CGCTGGTCAC CGCCGCCAAC
CGCTGGGCGA CCGACGGCAG CAAGCAGCGC TTCAACGCCA GCCAGCTCAA GCACCTGGTG
CCGGCCGGCC GCCTGGACCT CGACTCCACC GGCCTGCTGG TGCTGACCCA GGACGGCCGC
GTCGCCAAGC TGCTGATCGG CGAGGACAGC CCGGTCGAGA AGGAGTACGT GGTGCGCGTG
CAGTGGACCG CGCGGCCCGA GCTCACCGAC CTGAAGCAGC ACTTCCCGCC CGAGGCCCTG
GCGCGGCTGC GCCACGGGCT CGCGCTCGAC GGCGAGAAGC TCAAGCCCGC CAAGGTGTCG
TGGCAGAACG AGCAGCACCT GCGCTTCGTG CTGCGCGAGG GCAAGAAGCG CCAGATCCGC
CGCATGTGCG AGCTGGTCGG CCTGAAGGTC GAGTCGCTCA AGCGCATCCG TATCGGCCGC
GTCGGCCTGG GCGAGCTGCC GCCGGGGCAG TGGCGCTACC TCGGGCCGTT CGAGAACTTC
CTGTGA
 
Protein sequence
MATLKLKKPA PGAPSSPATV RRAPLRSGGV KPARPTLAQA EAERARQRAE STPPPRPERA 
DAPPRAGRAA PAAPGRQRSD APRPSSRTEA GQPPRGPGRP GAERSPRDPN RTSERPTLRD
PAREPSSDRR PPRTNDSGAP RPSARPSQRP PPRPAQARTT GGAPPEELNP RLSKRMSELG
LASRREADEW IEQGYVRVDG EVVDQLGARV RPEQQITIDP QARLEQAQRV TILLHKPLGY
VSGQAEDGHA PAFTLVTAAN RWATDGSKQR FNASQLKHLV PAGRLDLDST GLLVLTQDGR
VAKLLIGEDS PVEKEYVVRV QWTARPELTD LKQHFPPEAL ARLRHGLALD GEKLKPAKVS
WQNEQHLRFV LREGKKRQIR RMCELVGLKV ESLKRIRIGR VGLGELPPGQ WRYLGPFENF
L