Gene Mpe_A2588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2588 
Symbol 
ID4787025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2761499 
End bp2762743 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content71% 
IMG OID640091159 
Productpyrroloquinoline quinone biosynthesis protein PqqE 
Protein accessionYP_001021777 
Protein GI124267773 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR02109] coenzyme PQQ biosynthesis protein E 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0973479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACG CCGCCGCCCC CGCCGCTTCG CCCGCCACCC CGCAGCCCCC CGGCGCGCCG 
ATGTGGCTGC TCGCGGAGCT CACCTACAAG TGCCCGCTGC ACTGCGTGTT CTGCTCCAAC
CCCACGAACT ACGCCGACCA CCTGGGCGAG ATCGGCACCG AGGACTGGAA GCGCGTGTTC
CGCGAGGCGC GGCAGATGGG CGCGGTGCAG CTCGGCTTCT CGGGCGGCGA GCCGCTGCTG
CGCGACGACC TGGAAGAGCT GGTGGCCGAA GCGCGCCAGC TCGGCTACTA CACCAACCTC
ATCACCTCGG GCATCGGCCT CACCGAAAAG CGCGCCCGCG CGCTGAAGGA CGCCGGGCTC
GACCACATCC AGCTCAGCTT CCAGGACTCC ACCAAGGAGC TGAACGACTT CCTGTCGTCC
ACCCGCACCT TCGACCACAA GAACAAGGTG GCCGCGATCA TCAAGTCGCT CGGCTACCCG
ATGGTGCTGA ACTGCGTGAT GCACCGCTAC AACCTGCCGC ACGTGGGCCG CATCATAGAG
ATGGCCGAGG CCATGGGCGC CGACTACCTG GAGCTGGCCA ACACCCAGTA CTACGGCTGG
GCCTGGCTCA ACCGCGCGGC GCTGATGCCC ACGCCCGACG AACTGCGCGC GGCCGAGGCC
ATCGTCGACA GCCACCGCGA GCGGCTGGCC GGCCGCACCA AGATCCTGTG GGTCTCGCCC
GACTACGTGG ACGCCAAGCC CAAGCCCTGC ATGGCCGGCT GGGGCGCGGT GTTCATGGTC
ATCGCGCCCG ACGGCACAGC CCTGCCCTGC CACAGCGCGC GCATGCTGCC GGGCTTCGAC
TTTCCCAAGG TCACCGAGCA CAGCATCGCC GGCATCTGGC GCGACAGCGA CGCCTTCAAC
CGCTACCGCG GCACGGCGTG GATGAGCGAC ACCTGCCTGA GCTGCGACCA GCACCCGGTC
GACCACGGCG GCTGCCGCTG CCAGGCCTTC CTGGTCTCCG GCGACGCGGC GGCCACCGAC
CCGGTCTGCC CCAAGAGCCC CGACCGCCCG CTGATCGACG CCGCGCTGCA GGCCGCCGTC
GAGCAGGCCG AGGCCGCGCC CGCGACCCAG CCGCTGCGCT TCGTGCCCGG TGCGGCGCGC
AGCGGCAACC TCTGGTACCG CACCGACGCG AACTCGCGCA CGCTGTCCGA CGACGGGCAC
GGCCCGCAGC ACGACACCGC GCCCGCCGCC GCCGAGACGC GGTGA
 
Protein sequence
MSHAAAPAAS PATPQPPGAP MWLLAELTYK CPLHCVFCSN PTNYADHLGE IGTEDWKRVF 
REARQMGAVQ LGFSGGEPLL RDDLEELVAE ARQLGYYTNL ITSGIGLTEK RARALKDAGL
DHIQLSFQDS TKELNDFLSS TRTFDHKNKV AAIIKSLGYP MVLNCVMHRY NLPHVGRIIE
MAEAMGADYL ELANTQYYGW AWLNRAALMP TPDELRAAEA IVDSHRERLA GRTKILWVSP
DYVDAKPKPC MAGWGAVFMV IAPDGTALPC HSARMLPGFD FPKVTEHSIA GIWRDSDAFN
RYRGTAWMSD TCLSCDQHPV DHGGCRCQAF LVSGDAAATD PVCPKSPDRP LIDAALQAAV
EQAEAAPATQ PLRFVPGAAR SGNLWYRTDA NSRTLSDDGH GPQHDTAPAA AETR