Gene Mpe_A1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1022 
Symbol 
ID4785624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1089337 
End bp1091604 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content72% 
IMG OID640089584 
Productcyanophycin synthetase 
Protein accessionYP_001020219 
Protein GI124266215 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02068] cyanophycin synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCATG GAATGCGGCC CGGAGCGTGG TGCCAAGGAG CCGGTCGGGC GCACAATCGT 
GCGTCGGTCC TGGGCTGCCC GGGCGCCTCG TCCCCGCCGA AAATGAGCAA GAAAAACGAC
CTGCAACTGC TCCGCATCAA TTACCTGCGC GGCCCCAACA TCTGGACCTA CCGTCCGGTG
CTGGAAACCT GGCTCGACCT CGGCGAGCTC GAAGACCACC CCTCCCATCT GTTGCCCGGC
TTCAACGAGC GCCTGTGCGC GCTGCTGCCG GCGCTCGAGG AGCACCACTG CGGCGTGGGC
GAGCGGGGCG GGTTCCTGCA GCGCCTGCGG GAGGGCACCT GGTGCGGCCA CGTGCTGGAG
CACGTGGTCA TCGAGCTCCT CAACCTGGCG GGCATGCCCA CCGGTTTCGG CCAGACTCGC
AGCACGTCGC AGCGGGGTGT CTACCGGATG GTGTTTCGCG CCCGCGAGGA GCAGGTCGCG
CGCGCGGCGC TGGCCGAAGG CCACCGGCTG CTGATGGCCG CCATCAACGA TGAGGCCTTC
GATGTCGCTG CGGCCGTGGC CCGCGTGCGC GAGCAGGTCG ACGACACCTA TTTCGGGCCC
AGCACCGCCG CCATCGTCGG CGCCGCCACC GAACGTGGCA TCCCGCACAT CCGCCTCAAT
GACGGCAACC TGGTGCAGCT CGGCCACGGT GCCCGCCAGC GCCGCATCTG GACCGCCGAG
ACCGACCGCA CCAGCGCCAT CGCCGAGGGC ATCGCCGGCG ACAAGGACCT GACCAAGCGT
CTGCTCAAGT CCTGCGGCGT GCCGGTGCCG GAGGGTGAGG TCGTCGACAG CGTCGAGGCC
GCCTGGGAGG CGGCGCAGGA CATCGGCCTG CCGGTGGCAC TGAAGCCGAC CGACGGCAAC
CACGGCCGCG GCGTCACGCT GGACCTGACC AGCCGCGAGG ACATCGAGGC CGCGTATGCC
TATGCCGACC TGCACGGCAG CGAGGTCATG GTCGAGCGCC ACGTGCCCGG CCAGGAGCAC
CGCCTGCTGG TGGTGGGCGG CCGGGTGGTG GCCGCAGCGC GCGGCGAGAC CGCCTGGGTC
ACCGGTGACG GCCGTTCCAC GGTGAGTGAG CTGGTCGACG CCCAGATCAA CACCGACCCG
AGGCGCGGCG AGACCGAGGA CTTCCCGCTC GGCCTGATCG AGACCGACAA GGACGGCGCG
GTGCTGTCCG ACCTGCAGCG CCAGGGGCTC GCGCCCGATG CGGTGCCGGC CAGCGGCCGC
CGCGTGCTGA TCCAGCGCAA CGGCAACGTC GCGATCGACT GCAGCGACCA GGTCCACCCC
GAGGTCGACC ACATCGTGTC GCTCGCCGCG CGCGTGGTGG GCCTGGACAT CGCCGGCGTC
GACGTCGTGG CCGAGGACAT CTCGCGCCCG CTGGCCGAGC AGGGCGGCGC GATCGTCGAG
GTCAACGCCG GCCCCGGGCT GCTGATGCAC CTGCGGCCGG CCGAGGGCAT GCCGCGGCCG
GTCGGCCAAG CCATCATCGA CCACCTGTTC GCCGAAACGG AAAGCGGGCG CATTCCCATC
GTCGGCGTGG CCGGGACCCG AGGCACCCAC ACCATCGCCC GGCTGGTGGC CTGGCTGGTG
CACCTGAGCG GCCGCCACGT CGGGCTGGCC TGCCGCGACG GCCTGTTCCT CGGCGGACGC
CGGATCGAGC ACGGCGACTG CGCGCACTGG GAAGCCAGCC ACCGGCTGCT GATCAACCGG
CAGGTGGAGG TGGCGGTGGT GGAGAACGGC GCCGAGGCCA TCCTGCGCGA CGGCCTCGCC
TACGACCGCT GCCAGGTCGG CGTCGTCACC GACCTCGACG GCGCCGCGGC GCTGACGGCG
TTCGACATTC GCGAATCCGA CCAGATGCTC AAGGTGCTGC GCACCCAGGT CGACGTGGTA
CTGCCGGACG GCGTGGCGGT GCTCAATGCC GACGACGAGC GGGTGGCCGA ACTGGCCGGC
CTGTGCGACG GCGAGGTGAT CCTCTACGGC GTCGACGCAG CCACACCGGT CCTGCAGGCG
CAGCGCGCCC AGGGCGGGCG AGCCGTGTTC CTGCGCCACG GCCGCGCGAT CCTGGCCACC
GGCGGGGTCG AGACGCAGGG ACCCGAATTC ACCCGTGGCC GCATGGCCGA GGTGCCGCCG
GAGACCATGC TGGCGGCCAT CGCCGCCGCC TGGGCGCTGG GGATCGCGCC CGAACTGGCG
GCCGCCGGCA TCGAGACCTT CCAGCTCGAA CCGAAGACCA CGCACTGA
 
Protein sequence
MPHGMRPGAW CQGAGRAHNR ASVLGCPGAS SPPKMSKKND LQLLRINYLR GPNIWTYRPV 
LETWLDLGEL EDHPSHLLPG FNERLCALLP ALEEHHCGVG ERGGFLQRLR EGTWCGHVLE
HVVIELLNLA GMPTGFGQTR STSQRGVYRM VFRAREEQVA RAALAEGHRL LMAAINDEAF
DVAAAVARVR EQVDDTYFGP STAAIVGAAT ERGIPHIRLN DGNLVQLGHG ARQRRIWTAE
TDRTSAIAEG IAGDKDLTKR LLKSCGVPVP EGEVVDSVEA AWEAAQDIGL PVALKPTDGN
HGRGVTLDLT SREDIEAAYA YADLHGSEVM VERHVPGQEH RLLVVGGRVV AAARGETAWV
TGDGRSTVSE LVDAQINTDP RRGETEDFPL GLIETDKDGA VLSDLQRQGL APDAVPASGR
RVLIQRNGNV AIDCSDQVHP EVDHIVSLAA RVVGLDIAGV DVVAEDISRP LAEQGGAIVE
VNAGPGLLMH LRPAEGMPRP VGQAIIDHLF AETESGRIPI VGVAGTRGTH TIARLVAWLV
HLSGRHVGLA CRDGLFLGGR RIEHGDCAHW EASHRLLINR QVEVAVVENG AEAILRDGLA
YDRCQVGVVT DLDGAAALTA FDIRESDQML KVLRTQVDVV LPDGVAVLNA DDERVAELAG
LCDGEVILYG VDAATPVLQA QRAQGGRAVF LRHGRAILAT GGVETQGPEF TRGRMAEVPP
ETMLAAIAAA WALGIAPELA AAGIETFQLE PKTTH