Gene Mpe_A3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3107 
Symbol 
ID4786680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3306727 
End bp3307839 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content66% 
IMG OID640091678 
Product3-dehydroquinate synthase 
Protein accessionYP_001022295 
Protein GI124268291 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.117886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGA TCTCTGTGAC TCCTCTCTCC GGCATCGAGA CCATCGATAT CGCCCTCGGC 
GAGCGGAGCT ATCCCATACG CATCGGCTCC GGCTTGCTGC GCGCGCCCGA AAGTTTTGCA
GGCGTCCCGC GCAGCCCGTT GGCCGTCATC GTCAGCAACA CCACGGTGGC GCCGCTCTAT
GCGCAGTCCC TGCGGGACGC TCTGCGCGGG AGGCACGCGC AGGTCGAGCT CATCACGTTG
CCCGACGGCG AAAGCCACAA GGATTGGGCC GCGCTGAACC TGATCTTCGA CGCGCTGCTG
GCCAGAGGCG CCGATCGGAA GACGATCCTC TATGCGCTGG GCGGTGGGGT GGTCGGCGAC
ATGACCGGCT TCGCCGCAGC CAGCTACATG CGCGGGGTGC CCTTCGTTCA GGTCCCCACC
ACGCTGCTGG CGCAGGTCGA TTCCTCCGTG GGTGGCAAGA CGGGTATCAA CCACCCGCGC
GGTAAGAACA TGATCGGCGC GTTCCATCAG CCGGTCTGCG TCGTCGTCGA TCTGGAGACG
CTGAGCACGC TGCCGATGCG GGAGTTGCGC GCCGGCCTGG CCGAAGTCAT CAAGTACGGG
CCGATCGCCG ATGCGAGCTT CCTGGGTTGG GTCGAAGCCA ACCTGGATGC ATTGCTTGCT
CGCGATGTGG CCACCCTGCG CCATGCCGTG CGACGGTCGT GCGAGATCAA GGCGGCCGTC
GTCGGTCAGG ACGAGCGCGA GGCCGGCTTG CGGGCCATTC TCAATTTCGG CCATACCTTC
GGTCATGCGA TCGAGGCAGG TCTGGGTTAC GGAGAGTGGC TCCATGGTGA GGCTGTCGGT
TGCGGCATGG CGATGGCAGC CGAAACCTCG GCGCGACTGG GCCTGCTGCC CGAGGGGGAC
GCGGAGCGCC TGATCCGGCT CATCGATCGT GCGGGTTTGC CGGTGAAGGG GCCGGACCTG
GGGGCGGATC GCTATCTCGA GCTCATGCGC CTCGACAAGA AGGCGGAAGC CGGCGAAATC
AAGTTCGTGC TGCTCGACGC CATCGGGCAT GCCGTGCTGC GCAGCGTGCC GGATGCAACC
TTGCGTCAGG TCCTCGCTTC GCGCTGTACG TGA
 
Protein sequence
MSSISVTPLS GIETIDIALG ERSYPIRIGS GLLRAPESFA GVPRSPLAVI VSNTTVAPLY 
AQSLRDALRG RHAQVELITL PDGESHKDWA ALNLIFDALL ARGADRKTIL YALGGGVVGD
MTGFAAASYM RGVPFVQVPT TLLAQVDSSV GGKTGINHPR GKNMIGAFHQ PVCVVVDLET
LSTLPMRELR AGLAEVIKYG PIADASFLGW VEANLDALLA RDVATLRHAV RRSCEIKAAV
VGQDEREAGL RAILNFGHTF GHAIEAGLGY GEWLHGEAVG CGMAMAAETS ARLGLLPEGD
AERLIRLIDR AGLPVKGPDL GADRYLELMR LDKKAEAGEI KFVLLDAIGH AVLRSVPDAT
LRQVLASRCT