Gene Hlac_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2077 
Symbol 
ID7400597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2064441 
End bp2065763 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content72% 
IMG OID643709148 
Productdihydroorotase 
Protein accessionYP_002566725 
Protein GI222480488 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.133238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATCA CGGGTGCGGA GCTGGCCGAC GGGCGGGTCC GCGACGTTCG GATCCGAGAC 
GGGACCATCG ACGCGGTCGA ACCGACGAAC GCGGGACTCG ACGCCGACAC CGGCGAGCGC
GTCGTCGACG CGCGGGGACG CCACCTTCTC CCCGGCGCCG TCGACGTCCA CGTCCACTTC
CGCGAGCCGG GCGCGAGCCA CAAGGAGACG TGGACCTCGG GCTCGCGGGG CGCGGCCGCG
GGCGGCGTGA CGACGGTCGT CGACCAGCCG AACACCTCAC CCCCGACCGT CGACGGCGAC
GCCTTCGACG AGAAGGCCGC CCTCGCCGCC GACTCGCTCG TCGACTACGG GATCAACGGC
GGCGTCACCG CCGACTGGGA CCCCGAGAGC CTCTTCGAGC GCCCCCTGTT CGCGCTCGGC
GAGGTGTTCC TCGCGGATTC GACCGGCGAC ATGGGGATCG CACTGGACCT GTTCGAGGAG
GCGCTCGCCG AGGCCGCCGC CCGCGACGTT CCGGTCACGG TCCACGCGGA GGACGAGACC
CTGTTCGATG AGAGTGCGCT CGACGGCGAC CTCGGGGGGG TCGGCACCGC CGCGAACGCC
GACGCGTGGT CGGCCTACCG AACCGCCGAA GCGGAGACTG CGGCGATCGA GCGCGCCCTC
GACGCCGGCG CAGAGAGCGA CGCGCAGGTG CATATCGCGC ACACGTCGAC GCCCGAGGGG
ATCGACGCCG TGAGCGATAC CGACGCAACC TGTGAGGTGA CGCCGCACCA CCTCTTCCTG
TCGCGCGAGG ACGCGGGGCG GCTCGGCACC TTCGGGCGCA TGAACCCGCC GCTCCGCTCG
GAGGAGCGGC GCGCGGCCGT CTTCGAGCGG CTCCGCGACG GCGACGTCGA CGTGGTCGCC
ACCGACCACG CGCCCCACAC GGTCGCGGAG AAGCGACAGA GGCTCGTCGA CGCGCCCAGC
GGCGTTCCGG GCGTAGAGAC CCTCTATCCG CTTCTCTTGG AGTCCGTCCG CAAGGGGAAC
CTCTCGTTGG AGCGCGTTCG CGACGTGGTC GCCGCCAACC CGGCGTCGAT CTTCGAGATC
GAGGGGAAAG GGCGGATCGA ACCCGGCGCC GACGCCGATC TCGTCGTGGT CGATCTGACG
AACCCCCGCG AGATCGAGGC CGGCGCGCTC CACGGCGCGT CCGGCTGGAC GCCCTTTGAG
GGGTTACAGG GCGTCTTCCC GGAGCTGACG ACGGTCCGTG GCAAGATCGC CTACGAGCGC
GATCCGGTCA CCGGCGCGGA GTCGTTCGGC GAGACAGTCG GTCGAAACGT GCGAGAGTCG
TAA
 
Protein sequence
MLITGAELAD GRVRDVRIRD GTIDAVEPTN AGLDADTGER VVDARGRHLL PGAVDVHVHF 
REPGASHKET WTSGSRGAAA GGVTTVVDQP NTSPPTVDGD AFDEKAALAA DSLVDYGING
GVTADWDPES LFERPLFALG EVFLADSTGD MGIALDLFEE ALAEAAARDV PVTVHAEDET
LFDESALDGD LGGVGTAANA DAWSAYRTAE AETAAIERAL DAGAESDAQV HIAHTSTPEG
IDAVSDTDAT CEVTPHHLFL SREDAGRLGT FGRMNPPLRS EERRAAVFER LRDGDVDVVA
TDHAPHTVAE KRQRLVDAPS GVPGVETLYP LLLESVRKGN LSLERVRDVV AANPASIFEI
EGKGRIEPGA DADLVVVDLT NPREIEAGAL HGASGWTPFE GLQGVFPELT TVRGKIAYER
DPVTGAESFG ETVGRNVRES