Gene Mpe_A0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0944 
Symbol 
ID4787327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp999425 
End bp1000435 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content69% 
IMG OID640089506 
Producthypothetical protein 
Protein accessionYP_001020141 
Protein GI124266137 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCCC AGTTTGAGAA ATTCGGAGGT GGTGCGCGCG GCGCGGTCAA CCTGCTCACC 
TCGGCCCTGC CGATCGCCAT CATCGGCGGC CTGCTCTACG CCGGCTTCTT CGTGAAGGCC
GAGGCGGTCA TCAAGAAGGT CGAGCCGAAG GCGGTCGAAC GCCGCGACAA CTTCTTCAGC
ATCGCCACGC CGAACGACCA GGTGGCCTGG GCCGCAGGCA GCGGCGGCAA GATCGTCCAC
ACGGTCGATG GCGGCAAGAC CTGGCAGCGG CAGTCGACCG CGACGCTGGA GAACCTGCAG
GGCATCGCCG CGTGGGACGC GATGCACGCT GTGGCGGTGG GCAACAACGG CGTGATCCTC
GTCACCACCA ACGGCGGCAA TCTCTGGACG GCGGCCACGC TGCCGAGCTC CGGCAACCCG
AACAAGCTGT TCCGCGTGCG CATCTTCGAC GGCGTGGCCT GGGCGGTCGG CGAGTTCGGC
GCGCTGCTGC GCTCCGACGA CAAGGGCCAG ACCTGGACGC GCGCGCTGCC CGAGAAGGAC
CGCGCCTGGA ATGCCGTGAG CTTCATCGGT CAGACCGGCT GGCTGGTCGG CGAGTTCGGC
GCGGTGATGC GCAGCACCGA CGGCGGCGCC AACTGGACCG ACATCGAGAC CAAGAACAAG
GTCAGCCTGA TGGCGGTGAG CTTCCGTGAC CCGCAGCACG GCGTGGCCGT GGGCCTCGCG
GGCACGCTGG TCGTCACGAA CGACGGCGGG CTCACCTGGA GCGACGTCGA ACGCCCGACG
CGCGAGCACC TGCTCGACGT CATCTGGGAC GAGAACCGCT GGACCGCGGT CGGCGACAAG
GGGGTCATGG TGAGCTCCGA TGCCACGGCG CAGACCTGGA AAGCCCGCCG CATCTCGGAC
GGCGACGTCT CGTGGCGCAC CCAGATCGCG AAGTCCGGCC CGCGCTACTA CCTGGCCGGC
GCCAACCTCG CCGTGCTCGA AGGCGACCAG CTGACCGTCG CCGGTCGCTG A
 
Protein sequence
MLAQFEKFGG GARGAVNLLT SALPIAIIGG LLYAGFFVKA EAVIKKVEPK AVERRDNFFS 
IATPNDQVAW AAGSGGKIVH TVDGGKTWQR QSTATLENLQ GIAAWDAMHA VAVGNNGVIL
VTTNGGNLWT AATLPSSGNP NKLFRVRIFD GVAWAVGEFG ALLRSDDKGQ TWTRALPEKD
RAWNAVSFIG QTGWLVGEFG AVMRSTDGGA NWTDIETKNK VSLMAVSFRD PQHGVAVGLA
GTLVVTNDGG LTWSDVERPT REHLLDVIWD ENRWTAVGDK GVMVSSDATA QTWKARRISD
GDVSWRTQIA KSGPRYYLAG ANLAVLEGDQ LTVAGR