Gene Hlac_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0003 
Symbol 
ID7399446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp3728 
End bp5494 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content65% 
IMG OID643707057 
ProductDNA polymerase II small subunit 
Protein accessionYP_002564679 
Protein GI222478442 
COG category[L] Replication, recombination and repair 
COG ID[COG1311] Archaeal DNA polymerase II, small subunit/DNA polymerase delta, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.626198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTGG AGTCGAACGC CCGGATCGTC AAGGAGCTCG CCAGGCGCGG ATACAACGCT 
GAACGCGAAG CGATCACCCT CTTGGCGAGC GCGCCCGACT CCGCCGCTGC CGTCGAATCG
GTCGTCGACC GCGCCCCGGA CGACGCCCTC CGCATCACCA CCGACCACGT CCGCGCCGTC
ACGAACGACC AGTCAGTTTC TGGCGGTGAT TCAGCCGTTT CACCCCCCGA CTCGGCTGCG
TCCGGCCCCG GCTCGGCTGC GTCCGGCCCC GGCTCGGCTG CGTCCGGCCC CGGCTCGACC
GCCTCTGTCG CCGATCCAAC CACAGAATCA CCTCCAACCC GAAACAGCGA TCAACCGCCT
TCTACGACCT CGACGGAGGG ATCTCCAGTC GAAATGGAGG GGTCTTCTTT GGACCACGTT
CCCGGCTCCG ATCGCGATGC CGAGAACCCG GAGACCGATA TCGATACCGA AATCGACCGC
GACTCCGACA GTAATGCCGA CAGCAGCGTC GATCGCAGCC CCGATCGCAG CCCCGACAGC
ACCTCTCACC ACGATCCCGA TATCCGAGAG CTGGAAGTCG GTAACGACAT GACCGGTCAG
AGTACTGGGA CCGGGGAGTA CAGCGACTTC GTTCGGACGT TTCGCGACCG GTACGAGCGG
CTCTCGAAGG TCCTTCGCGG CCGCGTCAAC CACCGCCCTG CCGAGGCGAT CGCGGAGATG
CCCGGCGGGA GCGACGCCGC GATGATCGGC CTCGTCAACG ATGTCCGGTC CACCAAATCC
GGCCACTGGC TCGTCGAACT AGAAGACACG ACCGGAACCT TCCCCGCGCT GATCATGAAA
GACAAGGGGC TCGCCGACCT CGTCGACGAG ATCATGATGG ACGAGTGCCT CGCTATCGAG
GGGACGCTCG CCGATGACTC CGGAATCTTA TTCGCCGACT CCCTCCACTT CCCCGACGTT
CCCCGGACTC ACCGGACGGG GGCGGCCGAC CGTCACGTGC AGGCCGCGCT GATCTCCGAT
ATCCACGTCG GCAGCGACGA GTTCATGGTC GACGCATGGA GTAGCTTCAC CGATTGGCTC
CACACGCCCG AGGCCGAACC GGTGGAGTAC CTGCTGCTCG CCGGCGACAT GGTCGAGGGC
GTCGGCGTCT ACCCCGATCA GGACGAGGAA CTGGAGATCG TCGACATCTA CGACCAGTAC
GAGGCGTTCG CGGAGTACCT CAAGGAGGTG CCGGCTGACA CCGAAATCGT GATGATCCCC
GGCAACCACG ACGCGGTCCG ACTCGCGGAG CCCCAGCCCG GGTTCAACGA CGAGATCCGC
TCCATCATGG ATGTCCACGA CGCGCAGATC GTCTCGAACC CCGCGACCGT CACCGTCGAG
GGCGTCGACG TGTTGATGTA CCACGGCGTC TCCCTCGACG AGGTAATCGC GGAGCTCCCC
GAGGAGAAGG CGAGCTACGA CGAACCCCAC AAGGCGATGT ACCAGCTTTT AAAGAAGCGT
CACGTCGCGC CGCAGTTCGG CGGCCACACC CGCGTTGCCC CGGAGGAGCG CGACTACCTC
GTCATCGAGG ACGTGCCCGA CGTGTTCCAC ACCGGTCACG TCCACAAGCT CGGCTGGGGG
AAGTACCACA ACGTGCTCGC CGTCAACTCC GGTTGCTGGC AAGCGCAGAC CGACTTCCAG
AAGTCCGTCA ACATCAATCC CGACTCCGGC TACGCGCCCA TCCTCGACCT GGACACTCTC
GACATGACCG TCAGGAAATT CGCGTGA
 
Protein sequence
MPLESNARIV KELARRGYNA EREAITLLAS APDSAAAVES VVDRAPDDAL RITTDHVRAV 
TNDQSVSGGD SAVSPPDSAA SGPGSAASGP GSAASGPGST ASVADPTTES PPTRNSDQPP
STTSTEGSPV EMEGSSLDHV PGSDRDAENP ETDIDTEIDR DSDSNADSSV DRSPDRSPDS
TSHHDPDIRE LEVGNDMTGQ STGTGEYSDF VRTFRDRYER LSKVLRGRVN HRPAEAIAEM
PGGSDAAMIG LVNDVRSTKS GHWLVELEDT TGTFPALIMK DKGLADLVDE IMMDECLAIE
GTLADDSGIL FADSLHFPDV PRTHRTGAAD RHVQAALISD IHVGSDEFMV DAWSSFTDWL
HTPEAEPVEY LLLAGDMVEG VGVYPDQDEE LEIVDIYDQY EAFAEYLKEV PADTEIVMIP
GNHDAVRLAE PQPGFNDEIR SIMDVHDAQI VSNPATVTVE GVDVLMYHGV SLDEVIAELP
EEKASYDEPH KAMYQLLKKR HVAPQFGGHT RVAPEERDYL VIEDVPDVFH TGHVHKLGWG
KYHNVLAVNS GCWQAQTDFQ KSVNINPDSG YAPILDLDTL DMTVRKFA