Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3184 |
Symbol | |
ID | 8327374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 3689718 |
End bp | 3691043 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644943701 |
Product | tryptophan halogenase |
Protein accession | YP_003100941 |
Protein GI | 256377281 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.153347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACG CGATCGTGAT GGGTGGTGGG CCCGCCGGGG CCGTCTGCGC CTCGGTGCTC GCCCGGCAGG GCCGCTCGGT GCTCGTGCTG GAGCGGCAGG AGTTCCCCCG CTTCCACATC GGCGAGTCGA TGCTGCCGTA CATGGTCGGC CTGCTGGAGC GGCACGGCCT GCTCGACGCG GTGCGGGAGC AGGGCTACGT CGTCAAGCGC GGCGGCGAGT TCATCGACCC GACGGGCACG AAGTTCTTCC GCGCCGGGGT GTTCCGCGCC GACTTCGCCA AGACCGGAGA CGGCCGCCAC CACGAGACGT TCCAGGTCGA GCGCTCGCAC TTCGACCGGG TGAACCTGGA CCAGGCGCGC GCGGCGGGCG CGACCGTGCG GGAGGGCGCG CAGGTCGTCG GGCTGCTGGA GGAGGGCGGC CGGGTGGTGG GCGTGCGCTA CCGCGAGGGC GGGGTCGAGC GCGAGGAGCG GGCCCGGTAC GTGGTGGACG CCACCGGCCG GGCAGGCGTG GTCGCCAACC GGTTCGGGCT GCGGCGGATG ATCGAGGACC TGCGCATGGT GGCGGTGTTC CACCACCGCG ACGGGCTGGA CGAGGCGCAC AACCCCGGCC ACGAGGGCGA CATCCAGGTC GGCAGTCACG CGGACGGGTG GATCTGGGCG ATCCCGCTGT CGGCGGACCG GATCAGCGTG GGCACGGTGA TGCACCGCGA CCGGCTGCGC GGTCGCACCC CCGCCGAGGC GTTCGCCGAG CACGTCGAGC GGGTGCCCAG GATCAACCAG CGGCTCACCG GCACCACGGC GACCTCGGAC TTCTGGGTGG AGACCGACTA CAGCTACCAC TCCGACCAGG TCACCGGCCC CGGCTGGGTG ATGGTCGGGG ACGCGGGCTG CTTCGGCGAC CCCATGTTCT CCGGCGGGGT GCTGGTGGGC ATGGCCACCG GCGCGGAGGC CGCCGAGGCG CTCGGGCGGG CGCTCGACTC GCCCGCCGAC GAGGAGCAGG CGCTCACCGG CTACTCGAAC TTCTTCAAGA CCGGCTACGA CACGTACGTG CGGCTGATCT TCTCCTTCTA CGAGGGCGAG CTGCTGCCCG CGCTGGCCGA GGCGCACTCG GAGGCGGGGG ACCTCTCCGA GGCGGACATG GAGATGTACG TGGTGCGGCT GCTGGGCGGC GACTTCTGGA GCGCGCGCAA CCCGGTGGCG AACGCGCTGC GCGCGAACCC GGCGTGGTCG ACGTTCTCGC CGTTCGAGCC GGTGCACCGC TGCCCGGTGT ACCCGGAGCT GGACGTGGCG GAGCTGGGCG GCCTCGCGGG CGCGGTCGGC CGGTGA
|
Protein sequence | MLDAIVMGGG PAGAVCASVL ARQGRSVLVL ERQEFPRFHI GESMLPYMVG LLERHGLLDA VREQGYVVKR GGEFIDPTGT KFFRAGVFRA DFAKTGDGRH HETFQVERSH FDRVNLDQAR AAGATVREGA QVVGLLEEGG RVVGVRYREG GVEREERARY VVDATGRAGV VANRFGLRRM IEDLRMVAVF HHRDGLDEAH NPGHEGDIQV GSHADGWIWA IPLSADRISV GTVMHRDRLR GRTPAEAFAE HVERVPRINQ RLTGTTATSD FWVETDYSYH SDQVTGPGWV MVGDAGCFGD PMFSGGVLVG MATGAEAAEA LGRALDSPAD EEQALTGYSN FFKTGYDTYV RLIFSFYEGE LLPALAEAHS EAGDLSEADM EMYVVRLLGG DFWSARNPVA NALRANPAWS TFSPFEPVHR CPVYPELDVA ELGGLAGAVG R
|
| |