Gene Hlac_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0019 
Symbol 
ID7401367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp18147 
End bp19229 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content64% 
IMG OID643707073 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_002564695 
Protein GI222478458 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.146471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.113107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGC GTGACGAGAC CGAAACGATG GGCGCGTACG ATCTCTTGAA GCCGGCACTG 
TTCGGACTTC CCCCGGAGAC GGCACACGGG CTGACACACC GACTGTTACG TGCGGTGCAA
ACCACACCGG TGACTGAACA CCTCCACAGC CGGTTCTCCG TGAACGATCC TCGTCTGCGC
GTTGAGGCAT TCGGGAACGA GTTTCCCAAT CCCGTGGGCG TCGCGGCCGG CTTCGACAAG
AATGCTGAGG TCCCGCGCGG CCTCGCAGCG CTGGGGTTCG GTCACGTCGA GGTCGGCGGC
GTCACCGCTG AGCAACAGCC GGGGAATCCG CGACCGCGAC TGTTTCGGCT GCGCGAGGAC
GAAGCTCTCA TAAACCGGAT GGGGTTCAAC AACGAGGGTG CCGACATCGT CGGCGAACGG
CTCGATCGGG AGCCGCTGCC GGAGATTCCG GTTGGAATCA ACATCGGGAA GTCGAAGTCG
ACACCCCTCG CCGAGGCCCC CGAGGACTAT CTATATACCT ACGAGCGCGT GGCGGACGCC
GGCGACTACT TCGTTGTTAA CGTCTCCAGC CCGAACACGC CCGGTCTCCG CGAACTGCAG
AACCGCGCGG CGTTAGAGGA GATACTTGGC ACTCTTACGG ACGCGGGCGC CGATCCTCTC
CTTGTAAAAC TCTCCCCGGA CCTCCCGGAG CCAGCAGTCG AGGACGCGCT CGGAGTCGTC
GACGATCTCG GTCTCGACGG CGTCATCGCA ACCAACACCA CGACATCGCG TCCGAACTCT
CTAAAAAGTC CCCAGCAGGC TGAGCGCGGT GGACTCTCGG GGAAGCCGAT AGAGCCGATC
GCCACGGAGC GGGTCCGGTT CGTTGCCGAG CGCACCGACG TTCCGGTGAT CGGGGTCGGC
GGAATCTCGG ACGCGAAGGG TGCCTACGAG AAGATACGGG CGGGCGCGTC CCTCATCCAG
TTGTACACAG GGCTCGTCTA CGAGGGGCCG GGCCTCGCAC GCGACATCAA CGGGGGGGTC
CTCGATCTCC TCGATCGGGA CGGCTTCGAC TCGGTCGAGG CCGCTGTCGG CGCGGATCTA
TAG
 
Protein sequence
MPVRDETETM GAYDLLKPAL FGLPPETAHG LTHRLLRAVQ TTPVTEHLHS RFSVNDPRLR 
VEAFGNEFPN PVGVAAGFDK NAEVPRGLAA LGFGHVEVGG VTAEQQPGNP RPRLFRLRED
EALINRMGFN NEGADIVGER LDREPLPEIP VGINIGKSKS TPLAEAPEDY LYTYERVADA
GDYFVVNVSS PNTPGLRELQ NRAALEEILG TLTDAGADPL LVKLSPDLPE PAVEDALGVV
DDLGLDGVIA TNTTTSRPNS LKSPQQAERG GLSGKPIEPI ATERVRFVAE RTDVPVIGVG
GISDAKGAYE KIRAGASLIQ LYTGLVYEGP GLARDINGGV LDLLDRDGFD SVEAAVGADL