Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2734 |
Symbol | |
ID | 4286047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 3005764 |
End bp | 3007278 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638142233 |
Product | tryptophan halogenase |
Protein accession | YP_757958 |
Protein GI | 114571278 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.816599 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCAA ACCGCATCAA ATCCGTCCTG ATTGTTGGCG GAGGAACCGC CGGCTGGATG GCGGCCGCGG CCCTGTCACG CTTCCTCGGT CCGTCGCTGT CCATCACCCT GGTTGAATCA GACCAGATCG CCACGGTCGG CGTCGGCGAA GCGACCATCC CTCAGATCAA GCATCTGAAT GCGGCGCTGG GCATTGAAGA ACGCGATTTT GTGGCCCGCA CAAACGGGTC CTTCAAACTT GGAATCGAAT TCATCGACTG GCATCGCAAG GGGGCTGCCT ACCTCCACAA TTTCGGGTCC ATCGGCCTCA ACCTGCAGCA GGTCCCCTTC CACCACTACT GGCTGAGGGA GCGGAGCGAG GGGGCGACCA CAAGTTTGTG GGACTATTGC CTCAACACCG CGGCAGCCCG GGACATGCGG TTCGCGCCGA TGGAAAAAGT CGGAAGTTCG CCGCTGTCGG GTATTGGCTA TGCCTATCAT TTCGACGCGA CTCTCTATGG GCGTTTCCTT CGCGAATACG CTGAACAGCG TGGGGTTTCG CGCATTGAAG GCTTGATCGA GACCTGCCGG CAGGATCCGC AAACCGGTGA CGTTTCCAAA GTTTGCCTGC AGGATGGCCG CGAACTGGCC GCCGACCTGT TCATCGATTG CTCCGGATTC CGGGGCCTGC TGATCGAGCA GGCTCTGGAG ACCGGCTATG AGGACTGGAC CCATTGGCTA CCCTGCGACC GAGCCATCCC GGTCCCAAGC TCGAACGAGG CGGCCCCTAT CCGTCCCTAC ACGCAATCGA TTGCCCATAA GGCGGGCTGG CAGTGGCGCA TCCCGCTGCA GCACCGGACA GGCAATGGCC ATGTCTTTTC CAGCGCCCAT ATGAGCGAAG CAGAGGCGAC CGACACCCTG CTGGACAATC TGGAAGGGAA GCCGCTGGCC GAGCCGCGCA TGATCCGCTT CGTGACCGGG CGCCGCAAAC AATTCTGGAA CCGCAATGTT GTCGCACTCG GTCTGGCGTC CGGCTTTCTT GAGCCCCTTG AATCGACATC CATCCACCTG GTCCAGTCCG GCATCAGCCG CCTGATCGCG ATGTTCCCCG ATGGCGACAT CGCAGCAGCC GACCGGACCG AATACAATCG CCAGATGGTG CTCGAATATG AGCGCGTTCG CGACTTCATC ATCTTGCACT ACCATCTAAA CGAGCGGACC GACTCCGACT TCTGGCAGGA TTGTGCGGCC ATGTCGGTGC CCGACAGTTT GACCGCCAAG ATGGACCTGT TCCGCGCCAA TGGTCGCCTC TACCATCGCG AAGAAGACCT GTTCACCGAC TCAAGCTGGC TCCAGGTCAT GCTGGGCCAG GGGTTGATGC CGCGTGGCTA TCACCCCATA GCCGATGCCT TGCCATCTGA TCAGCTGGCA GGCTTTCTCG GCGATATCCT AAAGATCGTC GGGCAGGCCG CGGCCACTCT GCCGCCGCAT GCGGACTATC TGGCCAGGCA TTGCCCGGCA GCGGCCGACG CCTAG
|
Protein sequence | MKPNRIKSVL IVGGGTAGWM AAAALSRFLG PSLSITLVES DQIATVGVGE ATIPQIKHLN AALGIEERDF VARTNGSFKL GIEFIDWHRK GAAYLHNFGS IGLNLQQVPF HHYWLRERSE GATTSLWDYC LNTAAARDMR FAPMEKVGSS PLSGIGYAYH FDATLYGRFL REYAEQRGVS RIEGLIETCR QDPQTGDVSK VCLQDGRELA ADLFIDCSGF RGLLIEQALE TGYEDWTHWL PCDRAIPVPS SNEAAPIRPY TQSIAHKAGW QWRIPLQHRT GNGHVFSSAH MSEAEATDTL LDNLEGKPLA EPRMIRFVTG RRKQFWNRNV VALGLASGFL EPLESTSIHL VQSGISRLIA MFPDGDIAAA DRTEYNRQMV LEYERVRDFI ILHYHLNERT DSDFWQDCAA MSVPDSLTAK MDLFRANGRL YHREEDLFTD SSWLQVMLGQ GLMPRGYHPI ADALPSDQLA GFLGDILKIV GQAAATLPPH ADYLARHCPA AADA
|
| |