Gene Mpe_A3247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3247 
Symbol 
ID4786526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3451210 
End bp3452268 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content75% 
IMG OID640091820 
Productdioxygenase 
Protein accessionYP_001022435 
Protein GI124268431 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0319665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG CTGCCACCTC CCAACTGCTC GCCCTGCTCG GCACCGAGCT GCCGATCATC 
CAGGCGCCGA TGGCTGGCGT GCAGGTCGGC GCGATGACCG TCGCCGTCAG CAACGCCGGC
GGGCTGGGCT CGCTGCCGGC CGCGATGCTG GGCGCAGACG CGCTGCGCAG CGAGCTGGCC
GCGATCCGCG AACGGACCGC ACGCCCCTAC AACGTCAACT TCTTCTGCCA TGCACCGCCC
GTGCCCAGCA GCGAGCGCGA GGCGAACTGG CGGGCCACGC TGGCGCCGTA CTACCGCGAG
TTCGGTATCG ACGCCTCGGC CATCCCGCCG GGCCCGGGGC GACGCGCGTT CGGTGCCGAG
GAGGCGGAGC TGCTGGCCGA GTTCGAGCCG CCGGTGGTGA GCTTCCACTT CGGGCTCCCG
TCGGCCGAGC TGATGGTGCG CGTGCGGCGC TGGGGCGCGA AGCTGCTGGC GTCGGCCACC
ACGGTCGACG AGGCGCGCTG GCTCGAGGAC CACGGGGTCG ACGCCGTCAT CGCCCAGGGC
CTGGAGGCCG GCGGCCACCG CGGCCACTTC CTGTCCGACG ACCTGAGCGC CCAGCTCGGG
ACCTTCGCGC TGCTGCCCCA GGTGGTGCGG GCGGTGCGCG TGCCGGTGAT CGCGGCCGGC
GGCATTGCCG ATGCGAACGG CGTGGCCGCA GCGCTGGCCC TGGGCGCGGC CGGCGTGCAG
GTGGGCACGG CCTACCTGCT GTGCCCGGAA GCGACCACCA GCGCGCTGCA CCGCGCCGCG
CTGCAGAGCG ACGCCGCGCG CCACACGGCC CTCACGCGCC TGTTCACCGG CCGGCCCGCG
CGCGGCATCG TCAACCGTGT GATGCGCGAG CTGGGGCCGA TGAACCCGGC CGCGCCCGCG
TTCCCGCTGG CCACCGCGGC GATCGCGCCG CTGCGCGCAC ACGCCGAGAA GCAGGGCAGC
GGCGACTTCT CGCCGCTGTG GTCGGGGCAG AACGCGAGCG GCTGCCTCGC GTTGCCGGCC
GCCGAGGTGA CGCGTTCGCT GGCGGAGGGT CTGCTGTAG
 
Protein sequence
MSSAATSQLL ALLGTELPII QAPMAGVQVG AMTVAVSNAG GLGSLPAAML GADALRSELA 
AIRERTARPY NVNFFCHAPP VPSSEREANW RATLAPYYRE FGIDASAIPP GPGRRAFGAE
EAELLAEFEP PVVSFHFGLP SAELMVRVRR WGAKLLASAT TVDEARWLED HGVDAVIAQG
LEAGGHRGHF LSDDLSAQLG TFALLPQVVR AVRVPVIAAG GIADANGVAA ALALGAAGVQ
VGTAYLLCPE ATTSALHRAA LQSDAARHTA LTRLFTGRPA RGIVNRVMRE LGPMNPAAPA
FPLATAAIAP LRAHAEKQGS GDFSPLWSGQ NASGCLALPA AEVTRSLAEG LL