Gene Hlac_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0001 
Symbol 
ID7401478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp780 
End bp2453 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content61% 
IMG OID643707055 
Productorc1/cdc6 family replication initiation protein 
Protein accessionYP_002564677 
Protein GI222478440 
COG category[L] Replication, recombination and repair
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1474] Cdc6-related protein, AAA superfamily ATPase 
TIGRFAM ID[TIGR02928] orc1/cdc6 family replication initiation protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.810382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG GAGAGAACAC ACCGCAGGCC GATGAATCGA AAGACAACAG CGGGATAACC 
GGCGATCGGT CCGGGACTGA CCAGGTTGAC GAAACCGACG GATCCGACGA AACCGACGCT
AACCTCGACG ATCCGCCATC ACCCGGCTCG ACGAACTCTG ATTTAAACAC GGATCTACAG
TCGGATCCCG AAACTGACGT TTCCACGGAC ATCGATGTCG GAAGCGGCGG ACGAGACCGC
TCCTCTCCGG ATGTCGACTT CGACGGTGTC GTCCTCGACG ACGACGACGA CAACCAGGGT
CTGTTCGACG ATCTGCTCTC CGGAGAGCCG ATATTCGAGA ATAAAGAGGT CCTCCGTCCC
TCCTACACTC CCCACGAGCT TCCCCACCGA AACGACCAGA TCAATCGGAT GGCGACGATC
CTCGTCTCCG CGCTGCGCGG GGAAACGCCC TCTAATATCC TCATTTACGG CAAGACGGGA
ACGGGGAAGA CGGCCTCCGC GAAGTTCGTC TCCCAAGAGC TTGAGTCCAC CTCACAGAAA
TACGACGTAC CCTGCGAGGT CGAGTACATT AACTGCGAGG TGACGGACAC GCAGTACCGC
GTCCTCGCGC AGCTCGCGAA CACCTTTATC GAGAAGAACC AGGCGGTCAT CGCGGACCAA
CTGGAGCGGT GTCGCGAACT CCGCTCTGCC GCCGCCGACG CTCCAGCCGC CCTCGCCGAC
ACCGAGTTCG CAACGCTCGA CGACCTCGAC GCGCGAATCG ACGAGCTCGA AACCGATGCC
GAAGAGATGG AGGAGGTCCC CATGACTGGC TGGCCCACCG ACCGGGTCTA CTCGACCTTC
TTCGAGGCAG TCGACTACCA CGAGCGCGTG GTTGTTATCA TGCTCGACGA GATCGACAAG
CTTGTCGAGA AGAGCGGGGA CGACACCCTC TATAACCTCT CTCGGATGAA CTCGGAACTC
AACAGGTCCC GGATCTCGAT CATGGGGATC TCGAACGATC TGAAATTCAC CGATTTCCTC
GACCCCCGTG TCAAGTCGAG CCTTGGCGAG GAAGAGATCG TCTTCCCGCC CTACGACGCG
AACCAGCTCC GCGACATCCT CCAGCACCGC GCCGATATTT CGTTCAAGCA GGACGCGCTC
ACGGACGACG TGATCCCCCT CTGTGCGGCG TTCGCCGCTC AGGAACACGG CGACGCCCGT
CGCGCGCTCG ATCTACTCCG TACTGCGGGC GAACTCGCCG AGCGCTCGCA GGCTGAGATC
GTCGCCGAGA AACACGTCCG GCAGGCGCAG GACAAGATCG AACTCGACCG CGTCGTCGAG
GTTGTCCGCA CCCTCCCGAC CCAGAGCAAG ATCGTGCTGT TCGCGGTCAT CCTCTTGGAG
AAGAACGGCG TGCACAACAT CAACACTGGC GAGGTATTCA ACATCTACAA ACGCCTCTGC
GAGGAGATCG ACGCCGACGT GCTCACCCAG CGCCGCGTCA CCGACCTCAT CAGCGAACTC
GACATGCTCG GGATCGTCAA CGCCGTCGTC GTCTCGAAGG GGCGCTACGG CCGGACCAAG
GAGATGGGCC TGTCGGTTCC CGTCGAGGAG ACCGAGGCCG TCTTGCTGTC CGACTCCCGA
CTCGGCGACA TCGAGAACGC GCAGCCGTTC GTCCAGGCCC GATTCGACAA CTGA
 
Protein sequence
MDEGENTPQA DESKDNSGIT GDRSGTDQVD ETDGSDETDA NLDDPPSPGS TNSDLNTDLQ 
SDPETDVSTD IDVGSGGRDR SSPDVDFDGV VLDDDDDNQG LFDDLLSGEP IFENKEVLRP
SYTPHELPHR NDQINRMATI LVSALRGETP SNILIYGKTG TGKTASAKFV SQELESTSQK
YDVPCEVEYI NCEVTDTQYR VLAQLANTFI EKNQAVIADQ LERCRELRSA AADAPAALAD
TEFATLDDLD ARIDELETDA EEMEEVPMTG WPTDRVYSTF FEAVDYHERV VVIMLDEIDK
LVEKSGDDTL YNLSRMNSEL NRSRISIMGI SNDLKFTDFL DPRVKSSLGE EEIVFPPYDA
NQLRDILQHR ADISFKQDAL TDDVIPLCAA FAAQEHGDAR RALDLLRTAG ELAERSQAEI
VAEKHVRQAQ DKIELDRVVE VVRTLPTQSK IVLFAVILLE KNGVHNINTG EVFNIYKRLC
EEIDADVLTQ RRVTDLISEL DMLGIVNAVV VSKGRYGRTK EMGLSVPVEE TEAVLLSDSR
LGDIENAQPF VQARFDN