Gene Mpe_A2437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2437 
Symbol 
ID4784273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2596770 
End bp2597837 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content72% 
IMG OID640091007 
Productdiheme cytochrome c SoxD 
Protein accessionYP_001021627 
Protein GI124267623 
COG category[C] Energy production and conversion 
COG ID[COG4654] Cytochrome c551/c552 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCT GGCGTGAACT CGCGGTGCTC GCCGCACTGA CGGTCAGCGG CGCCTCGCTG 
GCGCAGGGGA CGGTCTATGA CGGCATCGGG CGCCCCGCCA CGGCCGGGGA GATCGCCGCC
TGGGACATCG ACGTGCGGCC GGACTTCAAG GGACTGCCCA AGGGATCGGG TTCGGTCGCC
AGGGGCCAGG GGGTCTGGGA GAGCAAGTGC GCGTCCTGCC ACGGCATCTT CGGCGAATCC
GGCGAGGTGT TCAATCCGCT GGTCGGCGGC ACCACCCAGG CCGATGTCGA GGCCGGTCGC
GTCGCGCGCC TGACCGACTC GAGCTTCCCC GGCCGCACCA CGTTGATGAA GGCGGCCCAC
CTGTCGACGC TGTGGGACTA CATCAACCGG GCCATGCCCT GGGACAACCC GAAGTCGCTG
GCGACCGAGG AGGTCTACGC CGTCACTGCC TACCTGCTGA ACCTCGGCGG CGTGGTGCCC
GACGACTTCG TGCTCTCGGA CCGCAATGCC GCCGAAGTCC AGCAGCGGAT GCCCAATCGC
AAGGGCCTGA CCACGCAGCA CGGGCTGTGG CCCGGCCCCG AGTTCGGCGG CACCGGCAAG
CCCGACGTGC AGGGGTCCGG CTGCATGCGC AACTGCGGCG GCGAACCGCG GCTGGCCTCC
TCGCTGCCCG AGTTCGCGCG CGATGCGCAT GGCAATCTTG CCGACCAGAA CCGGACCGTC
GGCGCGCAGC GCGGCGCGGA CACGACGCGG CCCGATGCCG CCGCGAAGTC GGCGCCTGTG
CCGGCGCGCG CCGCCGCGAA CGGCACCGGC AACGCCGCGT TCGCGCTGAC CAGTTCGAAC
GCCTGCACGG CGTGCCATTC GCTCGACAGC AAGGGCCTGG GCCCGTCCTT CCGGCAGATT
GCCCAGAAGT ACGCCGGACG CGCCGACGGG GTCGACTACC TGACCGGCAA GATCCGGAGC
GGCGGCGGCG GTGTGTGGGG CGGTGCCATG GCGATGCCGC CGCAGGCGCT GCCCGAGGCC
GATGCCCGGA CGATCGCCGC CTGGCTCGCC GCCGGGGCCC CCAAGTAA
 
Protein sequence
MSSWRELAVL AALTVSGASL AQGTVYDGIG RPATAGEIAA WDIDVRPDFK GLPKGSGSVA 
RGQGVWESKC ASCHGIFGES GEVFNPLVGG TTQADVEAGR VARLTDSSFP GRTTLMKAAH
LSTLWDYINR AMPWDNPKSL ATEEVYAVTA YLLNLGGVVP DDFVLSDRNA AEVQQRMPNR
KGLTTQHGLW PGPEFGGTGK PDVQGSGCMR NCGGEPRLAS SLPEFARDAH GNLADQNRTV
GAQRGADTTR PDAAAKSAPV PARAAANGTG NAAFALTSSN ACTACHSLDS KGLGPSFRQI
AQKYAGRADG VDYLTGKIRS GGGGVWGGAM AMPPQALPEA DARTIAAWLA AGAPK