Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0399 |
Symbol | |
ID | 4785149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 435819 |
End bp | 436826 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640088954 |
Product | photosystem II stability/assembly factor-like protein |
Protein accession | YP_001019596 |
Protein GI | 124265592 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAG GTCGGCAACA CGCGCGGCCC GCCGCTCGCG CCGGGCTGTT CGCGCGGTTC GCGGCGATGG CGGTGCTGCT GTCGGGGGCT GCGGTGTCGA TGGCGGCACC GCCGTCATCC GGCGCCAAGC TTCTGCGCCA CGGCACGGCG CACGATGCTC TGTACGACGT GGTGTTCGAA GGGGAGAAGG GCATCGCGGT GGGGGCGTTC GGCAACGTGC TGGCCACGAC CGACGGCGGG GCGACATGGC AGGTCCAGGC CTTCCCGATG AAGCACCTGG CGCTGATGGC CGTGGCGATG CGCGAAGGCA AATGCATCGC CGTCGGCCAG ACCGGCCTGG TGTATGCGGC AGCCGACTGC AAGACCTGGA AGGCTGCGCC CTCCATGACC AAGTCGCGCC TGCTCGCCGT CGACGTCACC CGCCAAGGCC TCGCCTACGC CGTGGGCGCC TTCGGCACCA TCCTCAAGTC CACCGACTGG GGCCAGTCCT GGGCCGTGCA GACCGTCGAC TGGAGCACCA TCACCGATGA CGGCGCCGAA CCCCACCTCT ACGACATCCA CGTCGCCGAG GACGGCAGCG TCACCGCCGT GGGCGAATTC GAACTCGTCC TGCGCAGCAG CGACGGCCAG CAGTGGAAAG CCCTGCACAA GGGAGAACGC TCCCTGTTCG GCCTGTCCGT CGTCGAAGGC GGCAAGAAGA TGTACGCCAG CGGCCAGAGC GGCGCGCTGC TCAGCAGCGC CGACGGCGGC GCCACCTGGA CCTCGCACAA GACCGGCACC GGCGCCATCC TGACCGGCGT GCACGCGACC GCCCAGGGCG AAGTGCTCGC CAGCGGCATC AACGCCGTGG TCCTCAGCCG CGACGGCGGG GCCACCTGGA GCCCGCTGAA CTCCAAGCTC GTGCGCAACG CCTGGTACCA GGCGCTGGCT GCGAGCGAAG GCACCGGCGG CAAGCGGCGC CTGGTGGCGG TGGGAGCCGG TGGAACGATC CTGGAACTCG ATCTTTGA
|
Protein sequence | MSEGRQHARP AARAGLFARF AAMAVLLSGA AVSMAAPPSS GAKLLRHGTA HDALYDVVFE GEKGIAVGAF GNVLATTDGG ATWQVQAFPM KHLALMAVAM REGKCIAVGQ TGLVYAAADC KTWKAAPSMT KSRLLAVDVT RQGLAYAVGA FGTILKSTDW GQSWAVQTVD WSTITDDGAE PHLYDIHVAE DGSVTAVGEF ELVLRSSDGQ QWKALHKGER SLFGLSVVEG GKKMYASGQS GALLSSADGG ATWTSHKTGT GAILTGVHAT AQGEVLASGI NAVVLSRDGG ATWSPLNSKL VRNAWYQALA ASEGTGGKRR LVAVGAGGTI LELDL
|
| |