Gene Hoch_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3904 
Symbol 
ID8546300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5386360 
End bp5387877 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content75% 
IMG OID646388576 
Productimidazolonepropionase 
Protein accessionYP_003268296 
Protein GI262197087 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.588827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.187263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA ACCTGACGAG CATCGAGGTC ATCCGCGCCG GCACGGCGCT GCGCGTGGGC 
CGCGCCGATG GCAGCGCGCG CACGGTTCGG CGCATCGACC TGGCCCGGGT GCTGGGCGCG
CCCAAGACCG ACGCCCCGCG CTTCGCGGCT GCCGAGATCG ACGGCGACCG CGTGCGCCTG
CGCTACGCCG ACGGCCACAG CGAGCTGCTG GCCGCGGCCT ATCTCGACGA GCCGCCCGAG
GACCGGCGCG CGGACCTGGT GGTCGAGGAC GCCGGGCTGC TGCTCACGGC CAACGGCCCG
GGCACCGGCG AAGAGGGGCT GGGACTGCTG CAACACGGCG CCGTCATCGC CAGCGGCGGC
CGGGTGCGCT GGGTCGGCCC CAGCGCCGAG GTCGGCCGCG CCGGCTTCGA CATCAGCGGC
GCCGAGCGCG TGCGCGCGGG CGGGCGCCTG GTGACGCCCG GGCTGGTCGA TTGCCACGTG
CACCCGCTGT TCGCCGGCAA CCGCGCCAGC GAGTTCGGCG CGCGCGCGGC CGGGCGCAGC
TACCAGGAGA TCGCCCAGGC GGGCGGCGGC ATCCAGGCCA CGGTGCAGCC GACCCGGGCG
GCGTCCTTCG ACACGCACAT CATCCAGACC GTGGCGCGCA TGAACCGCAT CCTGGCCGCG
GGCACCACGA CCTGCGAGGC CAAGAGCGGC TACGATCTCA CCGTCGCCGG CGAGCTGCGC
CTGCTGGCGA TCGCGCTGGC CGCGGACGCG CTGAGCCCGC TCGACCTGAG CCCGACGCTG
CTCGGCGCCC ACGCCCTGCC GCCCGAGTAC GCCGACGACC GCGCCGGCTA CGTGCGCGCG
GTGGCCGAGC AGATGGTGCC GCAGGTGGCG CGCGAGCGGC TGGCCGAGAC CGTCGACGTG
TACTGCGACG ACAACGCCTT CACGCTGGCC GAGACCCGGC AGATCCTCGA GGCCGCCAAG
CGCGAAGGTC TGGCGCTGCG CGTGCACGCC GGGCAGTTTG CCGACCTGGG CGCGGCCGAG
CTCATCGCCG AGCTCGGCGG CCTCAGCGCC GATCATCTCG AGCAGGTCTC GCCCGCCGGC
ATCTCGGCCA TGGCCGAGCG CGGGGTCGTG GCCGTGATGC TGCCGGGCGC CTGCGTGCAG
CTCGGGCTGC CGCAGCCGCC GGTCTCCGCG CTGCGCGAGG CCGGCGTGGC CATGGCCGTG
GCCACCGACA ACAACCCCGG CACCAGCCTG TGCGACAGCC TGCCGGTGCA GATGTGGCTG
GCGACCACGC ACTACCGCAT GTCGGTGCCC GAGGCCTGGC TCGGGGTCAC GCGCCACGCA
GCGCGCGCGC TGGGCCGCAA GGACATCGGC GTGCTGGCGC CGGGCGCCCG CGCCGACCTG
CTGCTGTGGA ACGCCGAGAC GCCGGCCGAG ATCCCCTATC ACTACGGCGC CAATCTGGTC
GATCGGGTGA TCAAGAACGG CCGCACCGTA CACGTGCGGC CCTCGACCCA GGCCGCCCAC
TGGGCCGGCA TGGCGTGA
 
Protein sequence
MSNNLTSIEV IRAGTALRVG RADGSARTVR RIDLARVLGA PKTDAPRFAA AEIDGDRVRL 
RYADGHSELL AAAYLDEPPE DRRADLVVED AGLLLTANGP GTGEEGLGLL QHGAVIASGG
RVRWVGPSAE VGRAGFDISG AERVRAGGRL VTPGLVDCHV HPLFAGNRAS EFGARAAGRS
YQEIAQAGGG IQATVQPTRA ASFDTHIIQT VARMNRILAA GTTTCEAKSG YDLTVAGELR
LLAIALAADA LSPLDLSPTL LGAHALPPEY ADDRAGYVRA VAEQMVPQVA RERLAETVDV
YCDDNAFTLA ETRQILEAAK REGLALRVHA GQFADLGAAE LIAELGGLSA DHLEQVSPAG
ISAMAERGVV AVMLPGACVQ LGLPQPPVSA LREAGVAMAV ATDNNPGTSL CDSLPVQMWL
ATTHYRMSVP EAWLGVTRHA ARALGRKDIG VLAPGARADL LLWNAETPAE IPYHYGANLV
DRVIKNGRTV HVRPSTQAAH WAGMA