Gene Hlac_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1085 
Symbol 
ID7400157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1085512 
End bp1086711 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID643708151 
Productorc1/cdc6 family replication initiation protein 
Protein accessionYP_002565750 
Protein GI222479513 
COG category[L] Replication, recombination and repair
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1474] Cdc6-related protein, AAA superfamily ATPase 
TIGRFAM ID[TIGR02928] orc1/cdc6 family replication initiation protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.666942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGGC CATTCAGTGA TATCGAGCGT TCGATTTTTG TCTCGAAGGA AGTTCTCTCC 
GAAGATCATC AGCCTGATCA GATTCTCGAA CGCGACGAGG AGATCGATCA ATACCGCCAC
GCGCTTCAAG ATGTTCTCTT CGGTCGCACC CCACAGAACG TCATGCTGTA CGGGAAGGCC
GGGCTCGGCA AAACCGCTGT CACGACGTAT ATGATGGAGG CGCTTCAAGA CGAGGTCACG
AAGCGACCGG ACGCCGACGA CGTACACGTA CACGAATTGA ACTGTAACGG AAAGTCTCTC
TACACTGTCG TTCGCACTCT GGTCAACGGA CTGTTACCCG AGCATGCAAG CGAGTTCCCG
AAACGTGGTC TTGGAACGGC TGACGCCTTC GAGGAACTCT ACACTCAACT TGACCGAATC
GGCGGAACTC ACCTCGTCGT CTTCGACGAG ATTGATCACT TGGACGATGT CGACACCCTC
CTGTATGAAC TCCCGCGAGC GCGATCGATC GGTCACATCA CGAACTCGAA GGTCGGAGTC
ATCGGAATCA GTAACAACTA CACGTTTCGG CAGTCGCTCT CGCCGAAGGT GAAAGACACG
CTGATGGAGA CAGAGATATC GTTCAGCCCG TACGATGCGA GCGAGCTCCG TACAATTCTC
GCGGACCGTG CCGATCGGGC GTTCGTAGAA GGTACCTGTG ACGACTCGGC CATCGCGAGG
GCGGCGGCGA TCGCGGCCAA GGATCGCGGA AACGCGCGCC AAGCGATAGA TCTCCTCCGT
GTCGGCGGCG AAGTCGCCAC ACGGGGTGAC GACGAACGGG TCGACGACTC ACACATCGTC
AAAGCCCAAG AACTCGTGCA GCGGGGACGA TTGCGGAACC GCATTCGAGA TCAGACACAG
CACGCACAGC TCCTGCTCGA AACCGCGGCG TACATCGAAC AACAAGGGGA GTCACCGGCA
CGGTCGAGAA CGATCAAGGA CCGATACGAG GCGGTCGCCG AATCACACGC TGTGGATCCA
CTTACGACCC TTAAGAGCAT CCAGAACCAT CTCTCTGACC TCCACATGCT CGGGTTTCTG
CAGCGGAGAG ACCGAAATCA CGGCGAAGGC GGCGGTCGGT ACTACGAGTA CCAACTCGAC
CTCGATCCGC AGATCGTCGT CGAAATCCGA CAGGAGGCCG AAGCCAAACC CTCCCCATAA
 
Protein sequence
MAGPFSDIER SIFVSKEVLS EDHQPDQILE RDEEIDQYRH ALQDVLFGRT PQNVMLYGKA 
GLGKTAVTTY MMEALQDEVT KRPDADDVHV HELNCNGKSL YTVVRTLVNG LLPEHASEFP
KRGLGTADAF EELYTQLDRI GGTHLVVFDE IDHLDDVDTL LYELPRARSI GHITNSKVGV
IGISNNYTFR QSLSPKVKDT LMETEISFSP YDASELRTIL ADRADRAFVE GTCDDSAIAR
AAAIAAKDRG NARQAIDLLR VGGEVATRGD DERVDDSHIV KAQELVQRGR LRNRIRDQTQ
HAQLLLETAA YIEQQGESPA RSRTIKDRYE AVAESHAVDP LTTLKSIQNH LSDLHMLGFL
QRRDRNHGEG GGRYYEYQLD LDPQIVVEIR QEAEAKPSP