Gene Hoch_1673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1673 
Symbol 
ID8544055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2279139 
End bp2280161 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content64% 
IMG OID646386381 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_003266116 
Protein GI262194907 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0120269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0132289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAG AAAGCGAAAC GCTGTATATG ACGTCGACCA ATAGCGATCA GAACGCCAGA 
ATTCTCGCGA GCGAGGAGCT CATCGTCCTC GATCTGGCAC AGTTCCGGAA AGGCACGCCG
GCCGAGGTCG AGCGCTATCT CGCCGATCTG CGCCGCTCGG CGAGCGATCT CGGCTTTCTC
TGCGTGGCCA ATCACGGCGT CGACCTCGAC TTGCTCGATC GTTGCTACGA GCTGAGCGAG
CGCTTCTTCG CGCTGCCGCT CGCGGACAAG ATGGAGCTCC GCTACGATCA GATCGATCAA
AAGAAGTATA GCGACATCGG GTATTTCCCC TATCGCATCG AGACCGCGCA GGGCAGCGAG
ATGCCCGATC TCAAGGAGAT GTTCCACGTC GGTCAGGACG TCCGCGAAGG CCACCGCCTG
AGCGAGTACT ACGCGACCAA CGTCTGGCCG CGCTCGATGC CTCAATTCCG CGCCCCCTTC
GACACGCTGT TCGAGCAGCT CCGCGGCGCG GGCGACACCA TCATGCGCTC GATCGGGCGC
AGCTACGACA TGGACGCGGC CTACGTGGAC GAGCTGATCC ACGAGGGCAA CAGCATGCTG
CGGACGCTGC ACTATCCGCC GATCCAGGCG GGGGAAGAGG GCATGCGCGC GGAGGCTCAC
ACCGGTATCC AGCTCCTGGG ACTACAGCCG CGGGCCAGCG ACGACGGGCT GCAGTTCCTC
ACCCCGGCGG GCGAATGGGT CGCGGTCGAT CGCGCGGCGT GCAGCGACTA CTTGCTCATC
AATCTGGGCG ATATGCTGGC CTACATGCTC GAAGACAGCA TCCAGGCCAC CGTGCATCGG
GTGGTCAACG CCAACCGCGA GCGCGCCCGC TACGCCATCG TGTATTTCTA CCACCCCAGC
TCGGTGGCCT TTCTGTGCAA GCGCGGGGAC CCCGGCCAGC AGCGCGACAG CCTGCGCGCG
GGCGATTGGC TGCTCAAGCG TCTCGAGGAG ATCAAGCTGT TCTCGGGTCG AGCGGGCCAT
TGA
 
Protein sequence
MSEESETLYM TSTNSDQNAR ILASEELIVL DLAQFRKGTP AEVERYLADL RRSASDLGFL 
CVANHGVDLD LLDRCYELSE RFFALPLADK MELRYDQIDQ KKYSDIGYFP YRIETAQGSE
MPDLKEMFHV GQDVREGHRL SEYYATNVWP RSMPQFRAPF DTLFEQLRGA GDTIMRSIGR
SYDMDAAYVD ELIHEGNSML RTLHYPPIQA GEEGMRAEAH TGIQLLGLQP RASDDGLQFL
TPAGEWVAVD RAACSDYLLI NLGDMLAYML EDSIQATVHR VVNANRERAR YAIVYFYHPS
SVAFLCKRGD PGQQRDSLRA GDWLLKRLEE IKLFSGRAGH