Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2724 |
Symbol | |
ID | 4286089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2989166 |
End bp | 2990698 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638142223 |
Product | tryptophan halogenase |
Protein accession | YP_757948 |
Protein GI | 114571268 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.262395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0424923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTTG ATCCCATTCG CTCCGTCCTG ATTGTCGGCG GCGGCACGGC CGGCTGGATG ACGGCGGCGG CCCTGGCGAA GATATTCGGT GATCAGGCGC TGGACATACG CCTTGTGGAA AGCGAGCAGA TCGGCACGGT CGGCGTCGGC GAGGCCACGA TTCCACAAAT CCTGCTCTTC AACCGGATGC TCGGGATCGA CGAGAACGAG TTTGTTCGCG CGACCCAGGG CACGTTCAAG CTGGGGATCG AATTCGTCAA TTGGCGGCGT GAGGGGCATA GCTATATCCA CCCGTTCGGC TCCTACGGCA GCGACATGGA TGGGGTTATG TTCCACCACT TCTGGCTCCA CGCGCGGCAG CGCGGCTATG AAGTGGACCT GCCGGAATTT TGCCTGCAGA TAATCGCCGC CCGCCAGGGC AAATTCCTGC GGCCGGTACC CGATGCCCGC AATTCTCCGC TCGGGCATAT CGCCTACGCA TTCCAGTTCG ATGCCAGCCT CTATGCCGCC TATCTCCGGC GCTACGCAGA AGGCCGTGGC GTCACGAGAA CGGAAGGCCG CATTGCTAAT ACGCGTCTCG ATCCAGAGAC AGGACATGTG CAGGGCGTTC AGCTTGAGAA CGGCGAGACG ATAGAAGCCG ACTTCTTCAT CGACTGCTCG GGCTTTCGCG GCCTGCTGAT CGAACAGGCG CTCAAGACTG GTTATGAGGA CTGGAGCTCC TGGCTGCCCT GCAACAGCGC CCTTGCGGTT CCGTCCGAAA ACACGGGGCC GCCCTCGCCC TACACGCGCG CCACGGCCCG GAAGGCTGGT TGGCAGTGGC GTATTCCGCT CCAGCACCGC ACCGGTAACG GTCATGTCTA TTGCTCTGAC CATATCAGCG ATGACGAAGC GGCGAACATC CTGCTGTCCA ACCTCGACGG CCCGGCGCTG CGTGACCCGC TGCAACTGCG TTTTACGACC GGACATCGAA AGAAGTTCTG GAACAAGAAC GTGCTCGCGA TCGGACTGTC TGCCGGCTTC ATGGAACCAC TGGAGTCGAC GAGCATCCAT CTGATTCAGA GCGGCATTGC GCGTTTGATG ACACTCTTTC CCGACCGCGC CTTCAACCCG GTGGACATCG ACGAATACAA TCGACTCACC CTGCAGGAAT ACGACTATGT GCGCGACTTT CTGGTTCTGC ACTATCGGCA GACAGAGCGC GATGACAGCG AGTTCTGGCG CTATTGCCGA AACCTCCCGC TAACGGATCA TCTGGCCCGC AAGCTTGCGC TCTACCAGAC CAATGGTCGC GTCAGCCGCG AAAAGGACGA GCTCTTCAAC GAGACCAGCT GGCTGGCTGT TCTGGATGGG CAGGGCGTCG TTCCGCAGGG GCATCACCCG CTTGTTACCG GGCTTGGCGA TGCAGAAGTC GATGCGCGCC TGGAGCAGAT TCTGGGCGCA GTCCGAGCAT CGGCCCGACA AATGCCCTCG CATGCAGACT TCATCGCCAA CCAGTGCGCT GCGCCGGCCG ATATCCAACT TTTTCAGGCT TAG
|
Protein sequence | MPVDPIRSVL IVGGGTAGWM TAAALAKIFG DQALDIRLVE SEQIGTVGVG EATIPQILLF NRMLGIDENE FVRATQGTFK LGIEFVNWRR EGHSYIHPFG SYGSDMDGVM FHHFWLHARQ RGYEVDLPEF CLQIIAARQG KFLRPVPDAR NSPLGHIAYA FQFDASLYAA YLRRYAEGRG VTRTEGRIAN TRLDPETGHV QGVQLENGET IEADFFIDCS GFRGLLIEQA LKTGYEDWSS WLPCNSALAV PSENTGPPSP YTRATARKAG WQWRIPLQHR TGNGHVYCSD HISDDEAANI LLSNLDGPAL RDPLQLRFTT GHRKKFWNKN VLAIGLSAGF MEPLESTSIH LIQSGIARLM TLFPDRAFNP VDIDEYNRLT LQEYDYVRDF LVLHYRQTER DDSEFWRYCR NLPLTDHLAR KLALYQTNGR VSREKDELFN ETSWLAVLDG QGVVPQGHHP LVTGLGDAEV DARLEQILGA VRASARQMPS HADFIANQCA APADIQLFQA
|
| |