Gene Mpe_A0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0399 
Symbol 
ID4785149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp435819 
End bp436826 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content70% 
IMG OID640088954 
Productphotosystem II stability/assembly factor-like protein 
Protein accessionYP_001019596 
Protein GI124265592 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAG GTCGGCAACA CGCGCGGCCC GCCGCTCGCG CCGGGCTGTT CGCGCGGTTC 
GCGGCGATGG CGGTGCTGCT GTCGGGGGCT GCGGTGTCGA TGGCGGCACC GCCGTCATCC
GGCGCCAAGC TTCTGCGCCA CGGCACGGCG CACGATGCTC TGTACGACGT GGTGTTCGAA
GGGGAGAAGG GCATCGCGGT GGGGGCGTTC GGCAACGTGC TGGCCACGAC CGACGGCGGG
GCGACATGGC AGGTCCAGGC CTTCCCGATG AAGCACCTGG CGCTGATGGC CGTGGCGATG
CGCGAAGGCA AATGCATCGC CGTCGGCCAG ACCGGCCTGG TGTATGCGGC AGCCGACTGC
AAGACCTGGA AGGCTGCGCC CTCCATGACC AAGTCGCGCC TGCTCGCCGT CGACGTCACC
CGCCAAGGCC TCGCCTACGC CGTGGGCGCC TTCGGCACCA TCCTCAAGTC CACCGACTGG
GGCCAGTCCT GGGCCGTGCA GACCGTCGAC TGGAGCACCA TCACCGATGA CGGCGCCGAA
CCCCACCTCT ACGACATCCA CGTCGCCGAG GACGGCAGCG TCACCGCCGT GGGCGAATTC
GAACTCGTCC TGCGCAGCAG CGACGGCCAG CAGTGGAAAG CCCTGCACAA GGGAGAACGC
TCCCTGTTCG GCCTGTCCGT CGTCGAAGGC GGCAAGAAGA TGTACGCCAG CGGCCAGAGC
GGCGCGCTGC TCAGCAGCGC CGACGGCGGC GCCACCTGGA CCTCGCACAA GACCGGCACC
GGCGCCATCC TGACCGGCGT GCACGCGACC GCCCAGGGCG AAGTGCTCGC CAGCGGCATC
AACGCCGTGG TCCTCAGCCG CGACGGCGGG GCCACCTGGA GCCCGCTGAA CTCCAAGCTC
GTGCGCAACG CCTGGTACCA GGCGCTGGCT GCGAGCGAAG GCACCGGCGG CAAGCGGCGC
CTGGTGGCGG TGGGAGCCGG TGGAACGATC CTGGAACTCG ATCTTTGA
 
Protein sequence
MSEGRQHARP AARAGLFARF AAMAVLLSGA AVSMAAPPSS GAKLLRHGTA HDALYDVVFE 
GEKGIAVGAF GNVLATTDGG ATWQVQAFPM KHLALMAVAM REGKCIAVGQ TGLVYAAADC
KTWKAAPSMT KSRLLAVDVT RQGLAYAVGA FGTILKSTDW GQSWAVQTVD WSTITDDGAE
PHLYDIHVAE DGSVTAVGEF ELVLRSSDGQ QWKALHKGER SLFGLSVVEG GKKMYASGQS
GALLSSADGG ATWTSHKTGT GAILTGVHAT AQGEVLASGI NAVVLSRDGG ATWSPLNSKL
VRNAWYQALA ASEGTGGKRR LVAVGAGGTI LELDL