Gene Mpe_A2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2003 
Symbol 
ID4783790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2146031 
End bp2147035 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content73% 
IMG OID640090573 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_001021196 
Protein GI124267192 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.704766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.26865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCG AGCCGGCCGA GGCCGACGAG CCCGCCGAAC CGCTGGCGGG CGATGCCGAG 
CAACGCCAGG CGCGGGTCGA TGCCGGCGCC CACGGCGAGC GTCTCGATCG CTGGCTGGCG
ACGCTGGCGG GCGAGTTCTC CCGCAACCAC CTGCAGACGC TGATCGAGGG CGGGCAGGTC
CGCATCGACG GGCAGACCGT CACCGCCCGC GCACGGCGCG TCAGTGCGGG CCAGCAGATC
GAGATCACGC TGCTGCCCAC GCCGGAGAGC CAGGCCTTCC GGCCCGAGCC GGTGGCGCTG
CCGGTGGTGT TCGAGGACGC CCATCTGCTG GTCATCGACA AGCCGGCCGG TCTGGTGGTG
CACCCGGCGC CCGGCAATTG GTCGGGCACC GTGCTCAACG GCCTGCTGGC GCATCACGCC
GGCGCGGCGG CGCTGCCGCG GGCCGGCATC GTGCACCGGC TCGACAAGGA CACCAGCGGC
CTGATGGTGG TGGCCAAGAC GCTGCCGGCC ATGACGTCAC TGGTGCGTGC GATCGCCGCG
CGGGAGGTGC ATCGCGAGTA CCTGGCCCTG GTGCACGGTG TGCTGCGCCA GGCGGTCTTC
AGCGTCGACG CGCCGATCGG TCGCGATCCG GTGTCGCGCA TCAAGATGGC CGTGCTGGCG
GGCGGCCGGA CGGCGCGCAC CGATGTCGAG TTGATCGCGG CGCGCGACGC CGTCAGCGCG
GTGCGCTGCA CGCTCCACAC CGGCCGCACG CACCAGATCC GGGTGCACAT GGCCTCGAGA
GGACATGCCC TGTTGGCCGA CGCGCTGTAC GGTGGCCGAC CCGGACTCGG CCTGACGCGA
CAGGCGCTGC ATGCGCACCG GCTGGGCTTC GTCCATCCGG CGAGCCTGCA GCCAATCGAG
TTCAGCGCCG TCCCGCCCGA CGACCTGCGC GCGGCCTGGC AGCAGATCGT GGGCGGGTCG
CACGCGCCCG GCCACGATAC AGATACAATC GTCGTGCAGG GCTGA
 
Protein sequence
MATEPAEADE PAEPLAGDAE QRQARVDAGA HGERLDRWLA TLAGEFSRNH LQTLIEGGQV 
RIDGQTVTAR ARRVSAGQQI EITLLPTPES QAFRPEPVAL PVVFEDAHLL VIDKPAGLVV
HPAPGNWSGT VLNGLLAHHA GAAALPRAGI VHRLDKDTSG LMVVAKTLPA MTSLVRAIAA
REVHREYLAL VHGVLRQAVF SVDAPIGRDP VSRIKMAVLA GGRTARTDVE LIAARDAVSA
VRCTLHTGRT HQIRVHMASR GHALLADALY GGRPGLGLTR QALHAHRLGF VHPASLQPIE
FSAVPPDDLR AAWQQIVGGS HAPGHDTDTI VVQG