Gene Hoch_3948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3948 
Symbol 
ID8546344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5446016 
End bp5447344 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID646388620 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_003268340 
Protein GI262197131 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0185231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCC TACTTCGTGG TGGTCGCGTC ATCGATACTT CTGGCGAGGC GGCAGGTGGT 
GGCTCCGGTC GCGCGGTGCT CGACGCCGCC TGTGACGTGC TCGTGCGCGA CGGACGCATC
GTCGAGATCG GCCGCGGCCT GGCCGCGCCC AGCGGCGTGC GCGAGCTCGA TCTCGCCGGC
AAGCTGGTGT GCGCCGGTCT GGTCGACCTG CACGTGCACT TCCGCGAGCC CGGCCACGAG
TACAAGGAAG ACATCGCCAG CGGCTCGGCC ACGGCCGCGG CCGGCGGCTT CACCACCGTG
TGTTGCATGC CCAACACCAA GCCGGTCAAC GACTGCCGCG CGGTCACCGA TCTCATCGTC
CGGCGCGCGC GCGAGGCCGG CCTGTGCCGC GTGCGCCCGG TCGGCGCCAT CTCGCGCGGG
CTGGCCGGCG AGGCCCTGGC CGAGATCGGC GAGATGCGCG ACGCGGGTAT CGTCGCGGTG
TCCGACGACG GCATGCCGGT GATGAACGCC GGCCTGATGC GCCGCGCGCT CGAGTACGCG
CGCACCTTCG ATTTGCCCGT GGTGCAGCAC GCCGAGGATC TCGACCTGGC CGAGGGCGGC
GCCATGAACG AGGGCGAGGT GGCCACCCGC ATCGGCGTGC GCAGCCAGCC CGCGCAGGCC
GAGTCGGTCA TGGTCGCGCG CGACATCGAG CTGGTGTCGT GGACCGGGGC CCGCTACCAC
GTCGCCCACA TCTCGGCCGC GCGCTCGGTC GATCTCGTGC GCGAGGCCAA GCGCCGCGGG
CTGCCGGTGA GCTGCGAGGT CACGCCGCAC CACTTCGCGC TCACCGACGA GGCCTGCGCC
AGCTACGACA CCCACGCCAA GTGCATGCCG CCGCTGCGCA CGCAGGCCGA TCTCGACGCC
ATCAAAGAGG GCATGGCCGA CGGCACCATC GACTGCATCG CCACCGACCA CGCGCCGCAC
TCCGAGGTCG AGAAAGAGAT CGAGTTCGAG CTGGCGGCGC CCGGCATGAT CGGCCTCGAG
ACCGCGGTGC CGCTCACCCT CGGCCTGGTG CGCGAGGGCG TCATCGACCT CGTGCGCGCG
GTGCACATGC TCACCGCGGC GCCGGCGCGG CTGTTCTCGA TGGACCGCGA GGGCGTGGGC
GCGCTGGCCG CCGGACGGGT GGCCGATCTG TGCGTCATCG ACCCCGAGCG CGAGCTGCAG
GTCGATCGCA CCGCCAGCCG CAGCAAGTCG TACAACACGC CCTTTCACGG CCAGGCGATG
CGCGGCGTCG CCGTGCTGAC CCTGCTCGGC GGCCGGGTGG TCTACGATCG CGAGGAGATG
CTGTCATGA
 
Protein sequence
MDLLLRGGRV IDTSGEAAGG GSGRAVLDAA CDVLVRDGRI VEIGRGLAAP SGVRELDLAG 
KLVCAGLVDL HVHFREPGHE YKEDIASGSA TAAAGGFTTV CCMPNTKPVN DCRAVTDLIV
RRAREAGLCR VRPVGAISRG LAGEALAEIG EMRDAGIVAV SDDGMPVMNA GLMRRALEYA
RTFDLPVVQH AEDLDLAEGG AMNEGEVATR IGVRSQPAQA ESVMVARDIE LVSWTGARYH
VAHISAARSV DLVREAKRRG LPVSCEVTPH HFALTDEACA SYDTHAKCMP PLRTQADLDA
IKEGMADGTI DCIATDHAPH SEVEKEIEFE LAAPGMIGLE TAVPLTLGLV REGVIDLVRA
VHMLTAAPAR LFSMDREGVG ALAAGRVADL CVIDPERELQ VDRTASRSKS YNTPFHGQAM
RGVAVLTLLG GRVVYDREEM LS