Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2437 |
Symbol | |
ID | 4784273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2596770 |
End bp | 2597837 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640091007 |
Product | diheme cytochrome c SoxD |
Protein accession | YP_001021627 |
Protein GI | 124267623 |
COG category | [C] Energy production and conversion |
COG ID | [COG4654] Cytochrome c551/c552 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCT GGCGTGAACT CGCGGTGCTC GCCGCACTGA CGGTCAGCGG CGCCTCGCTG GCGCAGGGGA CGGTCTATGA CGGCATCGGG CGCCCCGCCA CGGCCGGGGA GATCGCCGCC TGGGACATCG ACGTGCGGCC GGACTTCAAG GGACTGCCCA AGGGATCGGG TTCGGTCGCC AGGGGCCAGG GGGTCTGGGA GAGCAAGTGC GCGTCCTGCC ACGGCATCTT CGGCGAATCC GGCGAGGTGT TCAATCCGCT GGTCGGCGGC ACCACCCAGG CCGATGTCGA GGCCGGTCGC GTCGCGCGCC TGACCGACTC GAGCTTCCCC GGCCGCACCA CGTTGATGAA GGCGGCCCAC CTGTCGACGC TGTGGGACTA CATCAACCGG GCCATGCCCT GGGACAACCC GAAGTCGCTG GCGACCGAGG AGGTCTACGC CGTCACTGCC TACCTGCTGA ACCTCGGCGG CGTGGTGCCC GACGACTTCG TGCTCTCGGA CCGCAATGCC GCCGAAGTCC AGCAGCGGAT GCCCAATCGC AAGGGCCTGA CCACGCAGCA CGGGCTGTGG CCCGGCCCCG AGTTCGGCGG CACCGGCAAG CCCGACGTGC AGGGGTCCGG CTGCATGCGC AACTGCGGCG GCGAACCGCG GCTGGCCTCC TCGCTGCCCG AGTTCGCGCG CGATGCGCAT GGCAATCTTG CCGACCAGAA CCGGACCGTC GGCGCGCAGC GCGGCGCGGA CACGACGCGG CCCGATGCCG CCGCGAAGTC GGCGCCTGTG CCGGCGCGCG CCGCCGCGAA CGGCACCGGC AACGCCGCGT TCGCGCTGAC CAGTTCGAAC GCCTGCACGG CGTGCCATTC GCTCGACAGC AAGGGCCTGG GCCCGTCCTT CCGGCAGATT GCCCAGAAGT ACGCCGGACG CGCCGACGGG GTCGACTACC TGACCGGCAA GATCCGGAGC GGCGGCGGCG GTGTGTGGGG CGGTGCCATG GCGATGCCGC CGCAGGCGCT GCCCGAGGCC GATGCCCGGA CGATCGCCGC CTGGCTCGCC GCCGGGGCCC CCAAGTAA
|
Protein sequence | MSSWRELAVL AALTVSGASL AQGTVYDGIG RPATAGEIAA WDIDVRPDFK GLPKGSGSVA RGQGVWESKC ASCHGIFGES GEVFNPLVGG TTQADVEAGR VARLTDSSFP GRTTLMKAAH LSTLWDYINR AMPWDNPKSL ATEEVYAVTA YLLNLGGVVP DDFVLSDRNA AEVQQRMPNR KGLTTQHGLW PGPEFGGTGK PDVQGSGCMR NCGGEPRLAS SLPEFARDAH GNLADQNRTV GAQRGADTTR PDAAAKSAPV PARAAANGTG NAAFALTSSN ACTACHSLDS KGLGPSFRQI AQKYAGRADG VDYLTGKIRS GGGGVWGGAM AMPPQALPEA DARTIAAWLA AGAPK
|
| |