Gene Mpe_A3181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3181 
Symbol 
ID4786578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3381524 
End bp3382720 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID640091753 
Productputative cytochrome C oxidase polypeptide II precursor 
Protein accessionYP_001022369 
Protein GI124268365 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2
[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID[TIGR02866] cytochrome c oxidase, subunit II 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.8149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.584186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGA TCAACGCCCT TAGCAACGCC GCGAGCGCCC TCCGACGGCA GGCCATGGCG 
ACCGGTCTTG GACTGGCGGC CACGCTGTAC ACGACGGCGG CTCTGGCCGT CAACGACCTG
CCGGGCGGTC CGGCCGTGAA TCAGCTCGAC CTGCACCCGC CGGTGACGCG CATTGCGGCT
GAACAGCAAT GGCTGCACTA CTTCATGCTC GTGATCTGCA TGGTCATCTT CGTCGCCGTG
TTCGGCGTCA TGTTCTATTC GATCTTCAAG CACCGTCGCT CCAAGGGGGC GAAGCCGGCC
AACTTCCACG AATCGACCAC GGTCGAGATC ATCTGGACCG TCGTGCCGTT CTTCATCGTG
ATCCTGATGG CGCTGCCCGC CACCAAGGTC GTGGTCGCGA TGAAGGACAC CACCAACGCC
GACCTGACCA TCAAGGCCAC CGGCTACCAG TGGAAGTGGG GCTACGACTA CCTCAAGGGT
GAGGGTGAGG GGATCGCCTT CGTCTCCACG CTCGATACCT CGCATCGCCT GATGTCGGAC
AGCGGCAAGC CCGAACCGAC CGACGACTAC CTGCTCAAGG TCGACAACCC GCTGGTGGTG
CCTGTCGACA AGAAGGTGCG CATCATCACC ACTGCCAACG ACGTGATCCA CGCCTTCATG
GTGCCGGCCT TCGGCATCAA GCAGGATGCG ATCCCCGGCT TCGTGCGCGA CACCTGGTTC
CGCGCCGAGA AGACCGGTGA CTTCTACGGC CAGTGCGCCG AACTTTGCGG CAAGGAGCAC
GCCTACATGC CGATCCACGT GAAGGTGCTG TCGCAGGCCG ACTACGCGGT GTGGGTGGAA
GGCGAGAAGA AGAAGCTGGC CGCCAAAGCC GACGATCCGG CCAAGGTCTG GGAACTGCCC
GAACTCGTGG CCCGCGGCGA GAAGGTCTAT GCTGCCAACT GCGCTGCCTG CCACCAGGCG
AGCGGCAAGG GCGCGGGCGC GATCAAGCCG ATCGACGGTG CCGCCGTGGT GCTCGATGCC
GACAAGACCA AGCAGATCGC GATCCTGCTC AACGGCCAGA ACAATGGTGC GATGCCCGCC
TGGAAGCACC TGTCGGACAC GGAGATCGCC GCCGTCATCA CCTACACCAA GAACCACTGG
TCGAACGCGA CCGGTCAGAT CGTGCAGCCG GCCGACGTGC TCGCCGCTCG CAAGTAA
 
Protein sequence
MKTINALSNA ASALRRQAMA TGLGLAATLY TTAALAVNDL PGGPAVNQLD LHPPVTRIAA 
EQQWLHYFML VICMVIFVAV FGVMFYSIFK HRRSKGAKPA NFHESTTVEI IWTVVPFFIV
ILMALPATKV VVAMKDTTNA DLTIKATGYQ WKWGYDYLKG EGEGIAFVST LDTSHRLMSD
SGKPEPTDDY LLKVDNPLVV PVDKKVRIIT TANDVIHAFM VPAFGIKQDA IPGFVRDTWF
RAEKTGDFYG QCAELCGKEH AYMPIHVKVL SQADYAVWVE GEKKKLAAKA DDPAKVWELP
ELVARGEKVY AANCAACHQA SGKGAGAIKP IDGAAVVLDA DKTKQIAILL NGQNNGAMPA
WKHLSDTEIA AVITYTKNHW SNATGQIVQP ADVLAARK