Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3331 |
Symbol | |
ID | 5831729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3693109 |
End bp | 3694131 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641369131 |
Product | 2OG-Fe(II) oxygenase |
Protein accession | YP_001640789 |
Protein GI | 163852746 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA CGCTCCCTAT CCTCGATCTC GCCCGACTCG ATGCGGGTGG GAGCGAGCGC GACGCCTTCC TGGCGGAGCT GCGGACGGCC GCCCGCGAGA CCGGTTTCTT TTATCTCGTC GGACACGGCA TTCCGGCCTC GCAGATCGCC GGCGTCCAGG TGCTGGCACG GCACTTCTTT GCCCTGCCAG CGGAGGAGAA GCGGGCGGTC GCGATGGTGA ACTCGCCCCA TTTCCGAGGC TACACCGAGG CCGGCCAGGA GATCACCCGT GGGCGGGCCG ACTGGCGCGA GCAGTTCGAT ATCGGCGCCG AGCGGGCGGC GCGGCCGCGC GAACCCGGCC TCCCGGCCTG GACTCGCCTC CAGGGGCCGA ACCAGTGGCC GGCCGCCCTG CCGGGGCTCC GGGTCGGGCT CCTGGCCTGG CAGGAGGCGG TCACAGATAT CGGCATCCGC TTGCTGCAGG CCTTCGCCCT GGCGCTTGGG CAGGAGGCCG ACGCCTTCGC GCCGATCTAC GCAGGCGCAC CCAACCAGCA CATCAAGATC ATCCGCTATC CCGGCCGCGA GGCCACCGGC GACAACCAGG GCGTCGGCGC CCACAAGGAC AGCGGCTTCC TCACTCTGTT GGTGCAGGAC GGCGTGGGCG GCCTGGAGGT CGAGGATGCC GACGGCCGCT GGATCGCGGT CGCGCCAGTC GAGGGGGCCT TCGTGGTCAA TGTCGGCGAA CTGCTGGAAC TCGCCTCGAA CGGCTACCTG CGGGCCACAA TTCACCGGGT GGTGACGCCG GATGCAGGCC GGGACCGGCT CTCCATCGCC TTCTTCCTCG GCGCGCGACA CGACGCCACC GTCCCGCTTC TCGAGCTGCC CGCCGCGCTG GCGGAGCAGG CACGGGGCGC GGCGAGCGAC CCCGAAAACC CGCTCTTCCG CGAGGTCGGC CGCAATTACC TTAAGGGTCG ACTGCGCTCG CATCCCGACG TGGCGGCAGC CCACTATGCC GACCTGCTCA TCGCCGAGGC GCGGGCCGCA TGA
|
Protein sequence | MSRTLPILDL ARLDAGGSER DAFLAELRTA ARETGFFYLV GHGIPASQIA GVQVLARHFF ALPAEEKRAV AMVNSPHFRG YTEAGQEITR GRADWREQFD IGAERAARPR EPGLPAWTRL QGPNQWPAAL PGLRVGLLAW QEAVTDIGIR LLQAFALALG QEADAFAPIY AGAPNQHIKI IRYPGREATG DNQGVGAHKD SGFLTLLVQD GVGGLEVEDA DGRWIAVAPV EGAFVVNVGE LLELASNGYL RATIHRVVTP DAGRDRLSIA FFLGARHDAT VPLLELPAAL AEQARGAASD PENPLFREVG RNYLKGRLRS HPDVAAAHYA DLLIAEARAA
|
| |