Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5510 |
Symbol | |
ID | 8547923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 7556583 |
End bp | 7558346 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646390183 |
Product | tryptophan halogenase |
Protein accession | YP_003269886 |
Protein GI | 262198677 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.211989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCC CGAGCGATTG CGATGTCGCC GTGCTCGGCG CCGGCCCGGC CGGCAGCAGC TTCGCCGCGC TGGTCAAAAA GTACGCGCCC GGATTGCGCG TGGTGGTGCT CGAGCGCGCG CGCTTCCCGC GCTGGCGCAT CGGCGAATCC ACGATCCCGG TGGCCAACGC GGTGCTGCGC GATCTCGGCG TGTACGAGCG CCTGGCCGCC AGCGACGCGG TCAAGAAGAT CGGCATCACC TTCGTGTGGG GCAAGGACCG GCAGCCGTGG AACGCCGACT ACTTGCAGCT CGCGCGCGAG GGCGCGGGCG AGGACCCGGG CGCCGTGCTC GACGTCGTCG GCCAGGACTT CGCCGGCCTG CGCCGCGAGC AGCAGAGCGA GCCGTTCACG GCCTTCAACA TCCGCCGCGA TCGCTTCGAT GCCCTGCTCC TCGAGCAGGC GCGCGGGTTC GGCGCCGAGG CCTTCGAGGG CGTGCGCGCC ACCTCGGTCC GCCGCGAGGG CGACGAGATG CGCGTGGCCT GGAGCGACGA CGACGGCGCC AGCGGTACCT TGAACGCCGG CTTCGTGCTC GACGCCACCG GGCTGGGCGC GCTCATGACC CGCGGCCGCC GCGAGCGCGA CCCGCACATG AACAACTTCG CGGTCTACGG CTACTTCGCG GGCGCCGGCT GGAAGGTCAC CTACAGCGGC GAGCGCTCGC ACACCACCGT GTTCATCGCC AGCATCCCGC ACGGCTGGAT CTGGTACTTC CCCATCGCCG AGGACGTGAT GAGCGTCGGC GTGGTCACCC ACCGCGACCA CTTCCGCGAC CGCCTGGCCG GCATCGAGCT CGAGACCTTC TACCGCGAGC AGCTCGCGGC CTGTCCCGAG ATCGCGCCGC TGCTCGCCGA CGCCCGCCTG CGCGACGACG TCCTGCCCGG GGGCGCGCGC GTCGGCGCCA GCCAGGACTG GTCGTCGTGG GCCGAGCAGC CGGTGGGCCC GGGCTGGGCC GCGGCCGGCG ACGCCGCCGT GTTCGTCGAT CCCATCCTGT CCTCGGGCGT GACCCTGGCG CTGCAGAGCG GCCACCGCGC GGCCTACACC CTGCTCACCG CGCGCGCCCA TCCCGAGTTC GACCGGGACG CGCTGTGGCG CGCCTACGCC GATTATCTGC GCGGCGAGGC CGGCGCCTTC CTCAAGCTGG CGCGCTTTTT CTACGGCAAT AACCGCGCCG CCGAGTCGTG GTGGTGGGAG GCCCAGCGGC TGGTCAACGC CTCGGGGCAG CTCGACATCG ACCCGGCGCG CGCCTTCACC ATGGCCACGG CCGGCTTCTT TCCGCTGCCG CGGGCGCTGT CGCTCGAGAT CGTCGGGCCG CTGATCACGG GCGCTGCCGG CTCGGACGCG GACCTGCGCT ACGTGCACGA GAACAGCGGC GTGCCCGCGC CCGAGCAGCT CGCCGAGCAG AGCTATGAGG TGCTGACCCG CTTTCGCCTG GCGCTGCGCA CCGAGCCCGC ACGCAGCGCG CCGCCGGGGC AGCTGCGCGT GTTCCACGAC CTGGTGAGCG ACGATCCCGC GTTTTCGCAC CGCCTGGCCG CGGCGCCGAC CGAGATCTCG CCGCAGCTCG CGCCCGTGGT GGACGCCCTG CAGGAGGAGC GCAGCGTGCG CGCCCTCATG GATCGGGCGC CCTCGCTGGT GCCGCCCCAC CTGGCCCAGC CGGCCGATGC GCGCCGTCTG GCGGCGCACA TCGTGCGCGT GGCCGCCATC AAAGGCTTCG TGCAACTCTC GTGA
|
Protein sequence | MRIPSDCDVA VLGAGPAGSS FAALVKKYAP GLRVVVLERA RFPRWRIGES TIPVANAVLR DLGVYERLAA SDAVKKIGIT FVWGKDRQPW NADYLQLARE GAGEDPGAVL DVVGQDFAGL RREQQSEPFT AFNIRRDRFD ALLLEQARGF GAEAFEGVRA TSVRREGDEM RVAWSDDDGA SGTLNAGFVL DATGLGALMT RGRRERDPHM NNFAVYGYFA GAGWKVTYSG ERSHTTVFIA SIPHGWIWYF PIAEDVMSVG VVTHRDHFRD RLAGIELETF YREQLAACPE IAPLLADARL RDDVLPGGAR VGASQDWSSW AEQPVGPGWA AAGDAAVFVD PILSSGVTLA LQSGHRAAYT LLTARAHPEF DRDALWRAYA DYLRGEAGAF LKLARFFYGN NRAAESWWWE AQRLVNASGQ LDIDPARAFT MATAGFFPLP RALSLEIVGP LITGAAGSDA DLRYVHENSG VPAPEQLAEQ SYEVLTRFRL ALRTEPARSA PPGQLRVFHD LVSDDPAFSH RLAAAPTEIS PQLAPVVDAL QEERSVRALM DRAPSLVPPH LAQPADARRL AAHIVRVAAI KGFVQLS
|
| |