Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3247 |
Symbol | |
ID | 4786526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3451210 |
End bp | 3452268 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640091820 |
Product | dioxygenase |
Protein accession | YP_001022435 |
Protein GI | 124268431 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0319665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCG CTGCCACCTC CCAACTGCTC GCCCTGCTCG GCACCGAGCT GCCGATCATC CAGGCGCCGA TGGCTGGCGT GCAGGTCGGC GCGATGACCG TCGCCGTCAG CAACGCCGGC GGGCTGGGCT CGCTGCCGGC CGCGATGCTG GGCGCAGACG CGCTGCGCAG CGAGCTGGCC GCGATCCGCG AACGGACCGC ACGCCCCTAC AACGTCAACT TCTTCTGCCA TGCACCGCCC GTGCCCAGCA GCGAGCGCGA GGCGAACTGG CGGGCCACGC TGGCGCCGTA CTACCGCGAG TTCGGTATCG ACGCCTCGGC CATCCCGCCG GGCCCGGGGC GACGCGCGTT CGGTGCCGAG GAGGCGGAGC TGCTGGCCGA GTTCGAGCCG CCGGTGGTGA GCTTCCACTT CGGGCTCCCG TCGGCCGAGC TGATGGTGCG CGTGCGGCGC TGGGGCGCGA AGCTGCTGGC GTCGGCCACC ACGGTCGACG AGGCGCGCTG GCTCGAGGAC CACGGGGTCG ACGCCGTCAT CGCCCAGGGC CTGGAGGCCG GCGGCCACCG CGGCCACTTC CTGTCCGACG ACCTGAGCGC CCAGCTCGGG ACCTTCGCGC TGCTGCCCCA GGTGGTGCGG GCGGTGCGCG TGCCGGTGAT CGCGGCCGGC GGCATTGCCG ATGCGAACGG CGTGGCCGCA GCGCTGGCCC TGGGCGCGGC CGGCGTGCAG GTGGGCACGG CCTACCTGCT GTGCCCGGAA GCGACCACCA GCGCGCTGCA CCGCGCCGCG CTGCAGAGCG ACGCCGCGCG CCACACGGCC CTCACGCGCC TGTTCACCGG CCGGCCCGCG CGCGGCATCG TCAACCGTGT GATGCGCGAG CTGGGGCCGA TGAACCCGGC CGCGCCCGCG TTCCCGCTGG CCACCGCGGC GATCGCGCCG CTGCGCGCAC ACGCCGAGAA GCAGGGCAGC GGCGACTTCT CGCCGCTGTG GTCGGGGCAG AACGCGAGCG GCTGCCTCGC GTTGCCGGCC GCCGAGGTGA CGCGTTCGCT GGCGGAGGGT CTGCTGTAG
|
Protein sequence | MSSAATSQLL ALLGTELPII QAPMAGVQVG AMTVAVSNAG GLGSLPAAML GADALRSELA AIRERTARPY NVNFFCHAPP VPSSEREANW RATLAPYYRE FGIDASAIPP GPGRRAFGAE EAELLAEFEP PVVSFHFGLP SAELMVRVRR WGAKLLASAT TVDEARWLED HGVDAVIAQG LEAGGHRGHF LSDDLSAQLG TFALLPQVVR AVRVPVIAAG GIADANGVAA ALALGAAGVQ VGTAYLLCPE ATTSALHRAA LQSDAARHTA LTRLFTGRPA RGIVNRVMRE LGPMNPAAPA FPLATAAIAP LRAHAEKQGS GDFSPLWSGQ NASGCLALPA AEVTRSLAEG LL
|
| |