Gene Hoch_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2108 
Symbol 
ID8544494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2926366 
End bp2927952 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content64% 
IMG OID646386815 
ProductNitrilase/cyanide hydratase and apolipoprotein N- acyltransferase 
Protein accessionYP_003266546 
Protein GI262195337 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.943392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAT GGAAGGGCAT GAGCGAAACC AGTGAAAACA AGAACCTCAC CGTACGCACT 
GCCGTCATCG ACGACATCCC GCGCATCCGC GATCTCGTCC ACCGGGTCTA TGAGAAGGCC
GATCTCGGCA CCTACAGCGA AGCCATGCTG CGCGGCCAGA TCAACAGCTT CCCCGAGGGC
AAGTTCATCG TCGAGTACGG CGACGACATC GTCGGCTACT GCTCGACCTT CTTGATCGAC
GGGGAGACCG GTCTCAATCC CCACACCTGG ACCGAGATCA CGGGCGGCGG CTTTGGTTCA
CGCCACAACC CCGAGGGCGA TTACCTCTAC GGCATGGAAG TGTGCGTCGA CCCTGCGGCC
CGCGGCCTGC GCATCGGCCA GCGGCTGTAC AACGAGCGCA AAAAACTGTG CCGGGCCTGG
CGCCTGCGCG GCATCATCAT CGTCGGTCGC ATCCCGAGCA TGACGCGTCG GCTCAAGAAG
TACGACTCGC CCGAGGCCCT GGTCGAAGAC GTGGTGGCGA AGAAAACCCG CGACCAGGTG
CTCAGCTTTC AGCTCCGCAA CGGCTTCGAG TACGTGCGCC TGCTGCCCGG GTATCTGCCC
TCGGACCACG AATCGGCCGG CTACGGCGTC CAGCTCGTGT GGCACAACCC GCACGAGCCC
TCGGACGAGC GCGAGTACAA GCAGCGCAGC TCGGGTCACG TGCAGGATCT GGTGCGCGTG
GCCACGGTGC AGTACATGCA GCGCCGCGTC GAGTCCTTCG ACCACTTCGT CAAGCTGGTC
CGCTACTTCG TCGACGTAGT GTCCGACTAC CGCGCCGACT TCGTGGTCTT CCCCGAGCTG
TTCACGCTGC AGCTCCTGTC CATGGAGGAC GAGGAGCTCA AGCCGGCGGC CGCCATCGAG
CAGCTCACGC GCTACGTGCC CCGGCTCAAG GAGGTGCTGC GCGAGCTGGC GATGAAATAC
AACGTCAACA TCATCGGCGG CTCGCACCCC ACGCACACCG AGTCCGGCGC GGTGCAGAAC
ATCGCGTACG TGTGCCTGCG CGACGGCACC GTGCACGAGC AACCCAAGCT GCACCCCACG
CCCAGCGAGG TGCGCTGGTG GAATATCGAG GGCGGCGACA CGCTCAAGGC CATCGACACC
GACTGCGGCC CCATCGGCGT GCTCATCTGC TACGACTCCG AGTTCCCGGA GCTGGCCCGG
CACCTCATCG ATCAGGGCGC CAACATCCTG TTCGTGCCCT TCTGCACCGA CGAGCGCCAG
AGCTATCTAC GCGTGCGCTA TAGCTGCCAG GCCCGCGCGG TCGAAAACCA GTGCTACGTG
GTCATGTCCG GCAACGTTGG CAACCTGCCC AACGTGTCGA ATATGGACAT CCAGTACGCG
CAGAGCTGCA TCCTTACGCC CTGCGATTTC CCCTTCGCGC GCGACGGCAT CGCGGCCGAC
ACCACGCCCA ACGTCGAGAC CGTGGCCTTT GCCGACCTGC GGATGGAGTC GCTGCGCTAC
GCGCGCAACA GCGGCACGGT GCGCAACCTC AAGAACCGCC GCCACGATCT GTATCGGGTG
GTGTGGAACG AGAAGAAGCC GAGCTGA
 
Protein sequence
MKEWKGMSET SENKNLTVRT AVIDDIPRIR DLVHRVYEKA DLGTYSEAML RGQINSFPEG 
KFIVEYGDDI VGYCSTFLID GETGLNPHTW TEITGGGFGS RHNPEGDYLY GMEVCVDPAA
RGLRIGQRLY NERKKLCRAW RLRGIIIVGR IPSMTRRLKK YDSPEALVED VVAKKTRDQV
LSFQLRNGFE YVRLLPGYLP SDHESAGYGV QLVWHNPHEP SDEREYKQRS SGHVQDLVRV
ATVQYMQRRV ESFDHFVKLV RYFVDVVSDY RADFVVFPEL FTLQLLSMED EELKPAAAIE
QLTRYVPRLK EVLRELAMKY NVNIIGGSHP THTESGAVQN IAYVCLRDGT VHEQPKLHPT
PSEVRWWNIE GGDTLKAIDT DCGPIGVLIC YDSEFPELAR HLIDQGANIL FVPFCTDERQ
SYLRVRYSCQ ARAVENQCYV VMSGNVGNLP NVSNMDIQYA QSCILTPCDF PFARDGIAAD
TTPNVETVAF ADLRMESLRY ARNSGTVRNL KNRRHDLYRV VWNEKKPS