Gene Hoch_5510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5510 
Symbol 
ID8547923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7556583 
End bp7558346 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content73% 
IMG OID646390183 
Producttryptophan halogenase 
Protein accessionYP_003269886 
Protein GI262198677 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.211989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCC CGAGCGATTG CGATGTCGCC GTGCTCGGCG CCGGCCCGGC CGGCAGCAGC 
TTCGCCGCGC TGGTCAAAAA GTACGCGCCC GGATTGCGCG TGGTGGTGCT CGAGCGCGCG
CGCTTCCCGC GCTGGCGCAT CGGCGAATCC ACGATCCCGG TGGCCAACGC GGTGCTGCGC
GATCTCGGCG TGTACGAGCG CCTGGCCGCC AGCGACGCGG TCAAGAAGAT CGGCATCACC
TTCGTGTGGG GCAAGGACCG GCAGCCGTGG AACGCCGACT ACTTGCAGCT CGCGCGCGAG
GGCGCGGGCG AGGACCCGGG CGCCGTGCTC GACGTCGTCG GCCAGGACTT CGCCGGCCTG
CGCCGCGAGC AGCAGAGCGA GCCGTTCACG GCCTTCAACA TCCGCCGCGA TCGCTTCGAT
GCCCTGCTCC TCGAGCAGGC GCGCGGGTTC GGCGCCGAGG CCTTCGAGGG CGTGCGCGCC
ACCTCGGTCC GCCGCGAGGG CGACGAGATG CGCGTGGCCT GGAGCGACGA CGACGGCGCC
AGCGGTACCT TGAACGCCGG CTTCGTGCTC GACGCCACCG GGCTGGGCGC GCTCATGACC
CGCGGCCGCC GCGAGCGCGA CCCGCACATG AACAACTTCG CGGTCTACGG CTACTTCGCG
GGCGCCGGCT GGAAGGTCAC CTACAGCGGC GAGCGCTCGC ACACCACCGT GTTCATCGCC
AGCATCCCGC ACGGCTGGAT CTGGTACTTC CCCATCGCCG AGGACGTGAT GAGCGTCGGC
GTGGTCACCC ACCGCGACCA CTTCCGCGAC CGCCTGGCCG GCATCGAGCT CGAGACCTTC
TACCGCGAGC AGCTCGCGGC CTGTCCCGAG ATCGCGCCGC TGCTCGCCGA CGCCCGCCTG
CGCGACGACG TCCTGCCCGG GGGCGCGCGC GTCGGCGCCA GCCAGGACTG GTCGTCGTGG
GCCGAGCAGC CGGTGGGCCC GGGCTGGGCC GCGGCCGGCG ACGCCGCCGT GTTCGTCGAT
CCCATCCTGT CCTCGGGCGT GACCCTGGCG CTGCAGAGCG GCCACCGCGC GGCCTACACC
CTGCTCACCG CGCGCGCCCA TCCCGAGTTC GACCGGGACG CGCTGTGGCG CGCCTACGCC
GATTATCTGC GCGGCGAGGC CGGCGCCTTC CTCAAGCTGG CGCGCTTTTT CTACGGCAAT
AACCGCGCCG CCGAGTCGTG GTGGTGGGAG GCCCAGCGGC TGGTCAACGC CTCGGGGCAG
CTCGACATCG ACCCGGCGCG CGCCTTCACC ATGGCCACGG CCGGCTTCTT TCCGCTGCCG
CGGGCGCTGT CGCTCGAGAT CGTCGGGCCG CTGATCACGG GCGCTGCCGG CTCGGACGCG
GACCTGCGCT ACGTGCACGA GAACAGCGGC GTGCCCGCGC CCGAGCAGCT CGCCGAGCAG
AGCTATGAGG TGCTGACCCG CTTTCGCCTG GCGCTGCGCA CCGAGCCCGC ACGCAGCGCG
CCGCCGGGGC AGCTGCGCGT GTTCCACGAC CTGGTGAGCG ACGATCCCGC GTTTTCGCAC
CGCCTGGCCG CGGCGCCGAC CGAGATCTCG CCGCAGCTCG CGCCCGTGGT GGACGCCCTG
CAGGAGGAGC GCAGCGTGCG CGCCCTCATG GATCGGGCGC CCTCGCTGGT GCCGCCCCAC
CTGGCCCAGC CGGCCGATGC GCGCCGTCTG GCGGCGCACA TCGTGCGCGT GGCCGCCATC
AAAGGCTTCG TGCAACTCTC GTGA
 
Protein sequence
MRIPSDCDVA VLGAGPAGSS FAALVKKYAP GLRVVVLERA RFPRWRIGES TIPVANAVLR 
DLGVYERLAA SDAVKKIGIT FVWGKDRQPW NADYLQLARE GAGEDPGAVL DVVGQDFAGL
RREQQSEPFT AFNIRRDRFD ALLLEQARGF GAEAFEGVRA TSVRREGDEM RVAWSDDDGA
SGTLNAGFVL DATGLGALMT RGRRERDPHM NNFAVYGYFA GAGWKVTYSG ERSHTTVFIA
SIPHGWIWYF PIAEDVMSVG VVTHRDHFRD RLAGIELETF YREQLAACPE IAPLLADARL
RDDVLPGGAR VGASQDWSSW AEQPVGPGWA AAGDAAVFVD PILSSGVTLA LQSGHRAAYT
LLTARAHPEF DRDALWRAYA DYLRGEAGAF LKLARFFYGN NRAAESWWWE AQRLVNASGQ
LDIDPARAFT MATAGFFPLP RALSLEIVGP LITGAAGSDA DLRYVHENSG VPAPEQLAEQ
SYEVLTRFRL ALRTEPARSA PPGQLRVFHD LVSDDPAFSH RLAAAPTEIS PQLAPVVDAL
QEERSVRALM DRAPSLVPPH LAQPADARRL AAHIVRVAAI KGFVQLS