Gene Mpe_A1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1999 
Symbol 
ID4783786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2141251 
End bp2142435 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID640090569 
Producthypothetical protein 
Protein accessionYP_001021192 
Protein GI124267188 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.175204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGG CCAACCTGCT CGAATTCGAT CTCGACGCGC TGGCCGCTTT CTGCGAGCAG 
CTCGGCGAGA AGCGTTTTCG CGCCACTCAG CTGTTCCGCT GGATCCACCA GAAGGGCCAG
AGCGACTTCG CTCAGATGTC CGATCTGGCG AAGTCGCTGC GCGAGAAGCT GGCGGGGCGG
GCGGTGGTCC GGCCACTCGC AGTGCTGAGC GAGCACGTGT CGGCCGACGG CACGGTCAAG
TGGCTGTTCG ACGTCGGCGG CGGCAATGCC GTCGAGACGG TGTTCATCCC CGAGAACGAT
CGCGGCACGT TGTGCATCTC GTCGCAGGCC GGTTGCGCGG TCGGTTGCCG CTTCTGCTCG
ACCGGTCACC AGGGCTTCAG TCGCAACCTG TCGACCGGCG AGATCGTTGC CCAGCTCTGG
CATGCCGAGC ACCAGCTGCG CGCACGGCTG GGCACGACCG AGCGCGTCAT CAGCAACGTC
GTGATGATGG GCATGGGTGA GCCGCTGCAG AACTACGCCG CGCTGTTGCC GGCGCTGCGC
GTGATGCTCG ACGATCACGG CTACGGCCTG TCGCGTCGCC GTGTCACGGT ATCGACCTCC
GGTGTGGTGC CGATGATCGA CCGCCTGCGC GAGGACTGTC CGGTGGCTCT GGCAGTGTCG
CTGCATGCGC CGACCGACGC GCTGCGCGAC GATCTCGTGC CGCTCAACCG CAAGTACCCG
ATCGCAGAGC TGCTCGAGGC CTGCCAGCGC TACCTCGAGG CGGCGCCGCG CGACTTCATC
ACCTTCGAGT ACTGCATGCT CGACGGCGTC AACGACAGTG AGGCGCAGGC GCGCGAACTG
TTGCGCCTGG TGGGCGAACG CGGGCCGGTG GGGCGCGTGC CCTGCAAGAT CAACCTCATC
CCGTTCAACC CGTTCCCGGC CTCGGGGCTG ACGCGTTCGT CAGTGGCGCG CGTGCAGGCC
TTCGCGCAGC TGCTGGTCGA CGGGGGTCTG GTCACCACGG TGCGACGGAC TCGCGGCGAT
GACATCGACG CCGCCTGCGG CCAACTGGCC GGCGAGGTAC AGGACCGCAC CAATGCGCAG
GCACGGATGC GGCGTGCGCC GATCGCCATC CGGCCGATCG ACAGCGCGGT GCAGCGCCGG
GCCGACGCTG CACCATCAGG TTCAGCCACG GAGACGACAC GATGA
 
Protein sequence
MTAANLLEFD LDALAAFCEQ LGEKRFRATQ LFRWIHQKGQ SDFAQMSDLA KSLREKLAGR 
AVVRPLAVLS EHVSADGTVK WLFDVGGGNA VETVFIPEND RGTLCISSQA GCAVGCRFCS
TGHQGFSRNL STGEIVAQLW HAEHQLRARL GTTERVISNV VMMGMGEPLQ NYAALLPALR
VMLDDHGYGL SRRRVTVSTS GVVPMIDRLR EDCPVALAVS LHAPTDALRD DLVPLNRKYP
IAELLEACQR YLEAAPRDFI TFEYCMLDGV NDSEAQAREL LRLVGERGPV GRVPCKINLI
PFNPFPASGL TRSSVARVQA FAQLLVDGGL VTTVRRTRGD DIDAACGQLA GEVQDRTNAQ
ARMRRAPIAI RPIDSAVQRR ADAAPSGSAT ETTR