Gene Hoch_5176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5176 
Symbol 
ID8547588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7126454 
End bp7128514 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content71% 
IMG OID646389853 
Productamidohydrolase 
Protein accessionYP_003269557 
Protein GI262198348 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.830849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCT TATCCCGCCC CCTCCTCACC TGCCTCGCTG TGCTCGCGCT CGCCGTGGGC 
TGCGCGCCCA GATCCGGCTC CGCCGAGAAC CCAGATCCAG AGCAGGACCA GCAGCAGAAC
CCGCAGCGGC GCTACAGCGT GCTGCTGATG GGCACGCGCG TGGGCGAGCA GGTCGTGCGC
CAGGACGCCA GCGGCCACAG CCTGGTGACA TTTTCGTACA ACGACCGCGG CCGCGGCCCC
GACCTCAACG CCGAATTCTC GGTGTCGCCG GACGGGCTGC CGGAGCGCGT CGCCATCACC
GGCCGCGACT ATCTCGGCCA GGCGGTCGAC GAATTCTTCG AACGCCGCGA CGGTCGCGCC
GAGTGGCGCA GCAACGGCAA CTCGGACAGC CGGACGGTAT CCGCGCCGGG GTTTTACTGG
GCCATGGCCG GCGTCCCCGC GCAAGGCGCC ATCCTGGCCC GCGCGCTGCT GCGCAGCCCC
GAGCACAGCC TGCCCATGTT GCCCGGCGGC CGAGCCGCGA TCGAGCGACT CGGCGCCGAG
ACCCTGCGCG CCCCCGATGG TTCGCCGCGG CAGGTGGTGC ACTACGCCAT CACCGGCCTC
AGCTTCGGAC CCGAGCACAT CTGGCTCGAC GACGAGCAGC GCTTCTTCGC CTCGCTGTCG
AGCTGGGTCG CCATCATCGA GGAGGGCTAC GAGGACGATG CAGAGCGCCT GGCCGATATT
CAGGAGCAGG CCGATGCCGC GCTGCGACAG GAGCAGGCCG AGCGCCTGGC CGAGCGCCCC
GCGGGCGCGC TGGTGATTCG CGGCGTCGCC GTATTCGACC CCGAGACCCG CACCCTGCTG
CGCGATCGCG ACGTCATCGT CGAGGGCGAG CGCATCGCCG CGCTCGCGCC CGCCGGCAGC
GCAAAGCTCC CCGCGCAGGC CACCGTGATC GACGGCGTGG GCAAGACCCT GCTGCCCGGG
CTGTGGGACA TGCACGTGCA CCTCAACGAC ACCGACAGCC TGCTGCATCT GGCCCTGGGC
GTGACCTCGG TGCGCGACCT GGGCAACGAC ATCGAGTATC TCAGCCGCTA CCAGCAGGCC
TGGCAGAGCG GCCAGCGGCT GGGCCCGAGG TTGGCGGTCA AAGCCGGACT CATGGACGGC
CCCGGCCCCT ACGCGGGCCC CACCAAGGTG CTGGTGGCCA GCCGCGAGCA GGCGCGCGCG
GCCATCGATC GCTACGCCGA GCTCGGCTAC CCGCAGATCA AGATCTACAG CTCGCTGCGG
CCCGAGCTGC TGCCCGATAT CATCGATTAC GCCCACGCCC GCGGGCTGCG CGTGAGCGGA
CACATCCCGG CGTTCATGAG CGCGGCCCAG CTCGTCGCGC TCGGCCTCGA CGAGCTGCAG
CACATCAACT TCGTGGTGCT CAACTTCCTC TTCGACGAGG TCAAGGACAC GCGCACGCCC
GCTCGCTTCC AGGCCGTGGC CGAACACGCC CACAAGCTCG ATCTCGACAG CCCCGAGGTG
CAGGCGTTTA TCGACCTGCT GGTCGAGAAC CAGGTCGTGG TCGATCCCAC GGTGTCGATC
TTCGAGAGCA TGTTCAACGA CCGACCGGGC GAGATGTCCA CGGTGTTCGC GCCGGTGGCC
GATCGCCTGC CCGTGCAGGT GCGGCGCAAT CTGCTCGACG GCGGCCTGCC CGCCGACGAA
GCCACGCGCG CGCGCTACGG CGACTCCTTC GACACCCTGC TGGCGCTGGT GGCGCGGCTG
CATCGCGCCG GCGTGAGCAT CGTCGCCGGC ACCGACTCGC TGGCCGGCTT CGCCCTGCAC
CGCGAGCTCG AGAACTACGT GCGCGCCGGC ATCCCGGCGC CCGAGGTGCT GCGCATCGCC
ACCCTCGAGG CCGCGCGCCT GGCCGGCGCC GCCGACCAGC TCGGCACCAT CGCGCCGGGC
AAGCTCGCCG ACATGGTGCT GGTCGAGGGC GACCCCACCA GCGACATCCG CGCCATCCGC
GCGGTCGAAC TCACGGTCCA GCGGGGGACG ATCTTCCGCT CCGCGCGACT GCTCCAGAGC
ATGGGCATCG CGCCGCGCTG A
 
Protein sequence
MRSLSRPLLT CLAVLALAVG CAPRSGSAEN PDPEQDQQQN PQRRYSVLLM GTRVGEQVVR 
QDASGHSLVT FSYNDRGRGP DLNAEFSVSP DGLPERVAIT GRDYLGQAVD EFFERRDGRA
EWRSNGNSDS RTVSAPGFYW AMAGVPAQGA ILARALLRSP EHSLPMLPGG RAAIERLGAE
TLRAPDGSPR QVVHYAITGL SFGPEHIWLD DEQRFFASLS SWVAIIEEGY EDDAERLADI
QEQADAALRQ EQAERLAERP AGALVIRGVA VFDPETRTLL RDRDVIVEGE RIAALAPAGS
AKLPAQATVI DGVGKTLLPG LWDMHVHLND TDSLLHLALG VTSVRDLGND IEYLSRYQQA
WQSGQRLGPR LAVKAGLMDG PGPYAGPTKV LVASREQARA AIDRYAELGY PQIKIYSSLR
PELLPDIIDY AHARGLRVSG HIPAFMSAAQ LVALGLDELQ HINFVVLNFL FDEVKDTRTP
ARFQAVAEHA HKLDLDSPEV QAFIDLLVEN QVVVDPTVSI FESMFNDRPG EMSTVFAPVA
DRLPVQVRRN LLDGGLPADE ATRARYGDSF DTLLALVARL HRAGVSIVAG TDSLAGFALH
RELENYVRAG IPAPEVLRIA TLEAARLAGA ADQLGTIAPG KLADMVLVEG DPTSDIRAIR
AVELTVQRGT IFRSARLLQS MGIAPR