Gene Mpe_A0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0966 
Symbol 
ID4787112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1027116 
End bp1028207 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content76% 
IMG OID640089528 
Producthypothetical protein 
Protein accessionYP_001020163 
Protein GI124266159 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.254365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCAT CGCGCTCGTT GATCGCGCTC ACCGCGGCGC TGTGCCTGGC GGTCGCCGCC 
GCCGTGCCGG TCGGCGCTGC CGGCACCGCG GCACCGCGCT TCGGCGTGCT GCAGCAGGCC
GCGCTGCAGT CGCCGCGGGC GCTGTCGGCC ACGATGCTCG CGGTGGCGGC CGCCGGGAAG
CGACTGGTCG CCGTTGGCGA GCGCGGCATC GTGCTGCTGT CCGACGACGG CGGCGCCCGC
TGGCGCCAGG CGGCCACCCC CGTGCGGGCC AGCCTGACGG CGGTGCAGTT TGTCGACGAA
CGGCAGGGCT GGGCCGTCGG CCACCTCGGC GTCGTGCTGC ATTCCGGCGA CGGCGGCGAA
ACGTGGACCA AGCAGCTCGA CGGGCTGCAG CTGCCGGCGC TGTTCGAGCA GGCGGCACGC
GCCGACGCGG CCGCCGCGCC GGCCTACCGC GACTACGTCC AGCTGCTTGC CGACGACGGC
CCCGACAAGC CGCTGCTCGC GCTGCACTTC CAGGACGCTC GGCGCGGCAT CGTCGTCGGC
GCCTACAACC TCGCGCTCGG CACGGAGGAC GGTGGCGCCA CCTGGACGCC GCTGAGCGCT
CGACTGCCCA ACCCGCGCTC GCTGCACCTC TACGGCGTCG CGGTCAGCGG CGCCTCGATC
GTGCTGGCCG GCGAGCAGGG CCTGCTGCTG CGCTCCGACA ACGGCGGCCG TGATTTCGCC
GCGCTGGAGT CGCCCTACCG GGGCAGCTGG TTCGGCCTGC TGGCCACCCG CGGCGACCGC
CTGCTGGTCT ACGGACTGCG CGGTGCGGCC TACGTGTCGG CCGACCGCGG CTCGAGCTGG
ACCCAGGCGA GCACCGAGCT GCCCGTCTCG ATCAGCGGCG CGGCCGAACT GGCCGATGGC
ACGCTCGTGC TCGGCAGCTC GGCCGGCGAC CTGCTGGTCA GCCGTGACCA GGGGCGCAGC
TTCCAGCGCC GCGACGGACC GCCCCAGCCG CCGATCGCCG GCCTGGTGCC CACGCAGGAC
GGCGCCCTCG CACTGGCCGG GCTGCGGGGT CCGCAGCGCG TGGACCTGGC TGCCCCGCCC
GCCGCCCGCT GA
 
Protein sequence
MSPSRSLIAL TAALCLAVAA AVPVGAAGTA APRFGVLQQA ALQSPRALSA TMLAVAAAGK 
RLVAVGERGI VLLSDDGGAR WRQAATPVRA SLTAVQFVDE RQGWAVGHLG VVLHSGDGGE
TWTKQLDGLQ LPALFEQAAR ADAAAAPAYR DYVQLLADDG PDKPLLALHF QDARRGIVVG
AYNLALGTED GGATWTPLSA RLPNPRSLHL YGVAVSGASI VLAGEQGLLL RSDNGGRDFA
ALESPYRGSW FGLLATRGDR LLVYGLRGAA YVSADRGSSW TQASTELPVS ISGAAELADG
TLVLGSSAGD LLVSRDQGRS FQRRDGPPQP PIAGLVPTQD GALALAGLRG PQRVDLAAPP
AAR