Gene Hoch_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1853 
Symbol 
ID8544235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2550987 
End bp2552450 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content75% 
IMG OID646386559 
Productformiminoglutamate deiminase 
Protein accessionYP_003266294 
Protein GI262195085 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.520971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCC TGTTTGCCAG ACGCGCCCTG CTGCCCGAGG GCTGGGCGCG CGATGTCTGT 
GTGGAGATCG GAGAGGACGG TTGTGCCAGC GCGGTGCGCG CGGGCGCGGC GGCGCCGCCG
GATGCCGAGA CGGTGCCCGG GGTGCTGGTG CCCGGCATGC CCAACCTGCA CAGCCACGCG
TTTCAGCGCG CCATGGCCGG GCTGGGCGAG CGCGGCCACT TCGCCGCGCC GCCCACGGAG
GTGGGCGCGC CCGCGTCTGT GTCCGATGTC GTCCACGCGT CCGACGATGC CGCGGCCCGG
GCGGCGGCCG CGCGCGACAG CTTCTGGTCG TGGCGCGAGA CCATGTACGC GTTCGTGGCG
CAGCTCTCGC CCGACGACGT CCACGCCATC GCCAGCCAGC TCTTCGTGGA GATGCTGACG
GCCGGCTACA CCGGCGTGGC CGAGTTCCAC TATCTGCATC ACGCGCCCGA CGGCCGCCCC
TACGCGGATC CCGCGGCCAT GGCCGAGGCA GTGATCGCGG CCGCGCGCGA GACCGGGATC
GCGCTCACCC TGCTGCCGGT GCTGTATGTG CACGGCGGTT TCGGCGACCA GCCCGCGGGC
CCGGCGCAGC GCCGCTTCGT GCACGCGCCC GACGACTTCG CCGCGCTGCT GCAGCTCCTG
GCCACGCGCC ACGCCGGCGC GCCCGGGCTG CGTCTGGGCG TGGCCCCGCA CAGTCTGCGC
GCGGTGTCGC CGAGCGCGCT GGCCCGGGTC ATCGCGGCCG GTGACGCGCT CGGCGAGGAC
GCGCCGGTTC ACGTCCACGT GGCCGAACAA ATCCGCGAGG TCGAGGACTG CGTGGCCTGG
AGCGGTCGCC GGCCGGTGGC CTGGCTGCTC GACCACGCGC CGGTGGGGCC GCGCTGGTGC
CTGGTCCACG CCACGCACAT CGACGCCGAC GAGCTGCACG CGCTGGCGCG CTCGGGCGCG
GTCGCCGGGC TGTGTCCGAG CACCGAGGCC AATCTCGGCG ACGGGCTGTT TCCATTGGCG
GCCTATCTGG AGGCCGGCGG CGCGCTCGGC ATCGGCTCGG ACAGCAACGT GAGCGTGAGC
CCGCGCGCCG AGCTGCGGCT GCTCGAGTAC GGCCAGCGGC TGCGGGCGCA GGCGCGCAAT
ATCGCGGCCT CGCCGGCGCG ACCGGCGACC GGCGCGCGGC TCTACGCGGC CGCCCTGGAC
GGCGGTGCCC GGGCCCTGGG CCAGCCGATG GGCGCCATCG CGCCCGGCCA TCGCGCCGAC
TACCTCGCGC TCGACGACGA GCATCCGCTG TTAGTCGGAC GATCGGACGA TCTCGTGCTG
GACGCGCTGG TGTTCGCGGG CGAGCACAAT CCGCTGCGCC AGGTCATGGT CGGCGGCGCC
TGGGTGGTGC GCGACGGCGC GCACCCGGCC CAGGCCGAGG TCGCCGCGCG CTACCGGCGC
GTCATGCGCA AGCTGCTGGG CTGA
 
Protein sequence
MQRLFARRAL LPEGWARDVC VEIGEDGCAS AVRAGAAAPP DAETVPGVLV PGMPNLHSHA 
FQRAMAGLGE RGHFAAPPTE VGAPASVSDV VHASDDAAAR AAAARDSFWS WRETMYAFVA
QLSPDDVHAI ASQLFVEMLT AGYTGVAEFH YLHHAPDGRP YADPAAMAEA VIAAARETGI
ALTLLPVLYV HGGFGDQPAG PAQRRFVHAP DDFAALLQLL ATRHAGAPGL RLGVAPHSLR
AVSPSALARV IAAGDALGED APVHVHVAEQ IREVEDCVAW SGRRPVAWLL DHAPVGPRWC
LVHATHIDAD ELHALARSGA VAGLCPSTEA NLGDGLFPLA AYLEAGGALG IGSDSNVSVS
PRAELRLLEY GQRLRAQARN IAASPARPAT GARLYAAALD GGARALGQPM GAIAPGHRAD
YLALDDEHPL LVGRSDDLVL DALVFAGEHN PLRQVMVGGA WVVRDGAHPA QAEVAARYRR
VMRKLLG