Gene Mpe_A2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2066 
Symbol 
ID4784641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2209290 
End bp2210834 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content71% 
IMG OID640090634 
Producthypothetical protein 
Protein accessionYP_001021257 
Protein GI124267253 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0645] Predicted kinase
[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.695789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCCG AGGCGCTTTC GACCTTCGCC GCGGCAAAGA CCGGGGCGCA AGCAGCGCCC 
TTCGCCGCCG GCGAACGCAT CGTCGGCGCA CTCCAGCGCA AGCTGGGCGC GCGCCTCGTC
GAGACCCACA TCTCGTGGGT GCTGCTCGGC GCCGGGCAAG TCTGGAAGAT CAAGAAGCCG
GTGACGCTGC CGTTCGTCGA CTTCAGCACG CTGGAGGCGC GCCGGCGCAT GTGCGCCGAG
GAGTTGCAGC TGAATCGCCG CCTCGCGCCT TCGATCTACC AGGCCGTCGT CCCGGTGACC
GGCGAGCCCG ACGACCCGGT GCTCGGCGGG TCCGGGCCTG CGATCGAGTG GGCGCTCGTG
ATGCGCCGCT TCGCCGATGG GGCGCTGTTC AGCGAGCGCC TGGCGGCCGG CACGCTGACG
CCAGCGCTCG TCGACAAGCT GGCGGAACGG CTGGCCACCT TCCATGCCGC GGCGCCTCGC
GCCAGCGCGG GCACGGCCTA CGGCAGCCCG ACTCGCATCG CGGCCGACAC GTGTCACTTG
CTCGAAGGTC TTGCCGCGCA CGGCGTGCAG ACGCAGGCGT TCGAAACCTG GGCCCGCTCG
CAGGCGTCGG CACTCGACGG CCACTGGCGG GCGCGCCAGC GCGAAGGCTG GGTGCGCGAA
GGCCATGGCG ACCTGCACCT GGCCAACCTG TTCGCCGAAG GCGACGAGGC AACCGCCTTC
GATGGCATCG AGTTCGACCC GGCGATGCGC TGGATCGACG TCCAGGCCGA CATCGCCTTC
ACCGTGATGG ACCTGACGGC TCACGGACGC ACTGACCTCG GATTCCGCTT CCTCGATCGT
TGGCTCGCGG CCACCGGCGA CCACGCCGGT CTGGCCGTGT TGCGCTACTA CCTCGTCTAC
CGTGCCCTCG TGCGGGCGAT GGTCGCGACG CTGCACTCAC CGGTGACCGA GGCGCCCGAC
TACCTGGGCC TGGCACGGCA CTGGATCGCG CCCGCGAGCG GGCGGCTGCT GCTGATGCAT
GGCGTGTCCG GCTCGGGCAA GAGCACCATC GCGGAGCGCC TGCTCGAACG CGCAGGCGCG
GTGCGGTTGC GTTCTGACGT CGAGCGCAAG CGCCTGCATG GGCTGGCGGC GCTGGCGCGC
AGCGGAGCAG GACTGGGCGA CGGTCCCTAC GATGCGGCGG GCACGGAGCA GACCTATGCG
TACCTGCGTG ACGCTGCGCA CCACGCGCTC GCGGCCGGCT GGCCGGTGAT CGTCGACGCG
ACCTTCCTTT CCGAGGCGCC GCGGCGCATG TTCCGCGCGC TGGCCGACCA GATGCGGGTG
CCGTTCTCGA TCCTGCACTG CGAGGCACCC CGGGACGTTC TTGCCGTGCG TCTGGCGCAG
CGTGCCGCCG CCGCGAGCGA CGCTTCGGAA GGCGGCACCG AGGTGCTGGA GCATCAGTTG
CGCACACAGC AGCTGCTGGC CGAAGACGAG CTTGCCCACG TGCTGCCGCT GGCCGGTGAC
ATCGATGCGC TGGCGGCGCA CTGGCTCGCC GCCGACGCGC GCTGA
 
Protein sequence
MRPEALSTFA AAKTGAQAAP FAAGERIVGA LQRKLGARLV ETHISWVLLG AGQVWKIKKP 
VTLPFVDFST LEARRRMCAE ELQLNRRLAP SIYQAVVPVT GEPDDPVLGG SGPAIEWALV
MRRFADGALF SERLAAGTLT PALVDKLAER LATFHAAAPR ASAGTAYGSP TRIAADTCHL
LEGLAAHGVQ TQAFETWARS QASALDGHWR ARQREGWVRE GHGDLHLANL FAEGDEATAF
DGIEFDPAMR WIDVQADIAF TVMDLTAHGR TDLGFRFLDR WLAATGDHAG LAVLRYYLVY
RALVRAMVAT LHSPVTEAPD YLGLARHWIA PASGRLLLMH GVSGSGKSTI AERLLERAGA
VRLRSDVERK RLHGLAALAR SGAGLGDGPY DAAGTEQTYA YLRDAAHHAL AAGWPVIVDA
TFLSEAPRRM FRALADQMRV PFSILHCEAP RDVLAVRLAQ RAAAASDASE GGTEVLEHQL
RTQQLLAEDE LAHVLPLAGD IDALAAHWLA ADAR