Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0003 |
Symbol | |
ID | 7399446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 3728 |
End bp | 5494 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643707057 |
Product | DNA polymerase II small subunit |
Protein accession | YP_002564679 |
Protein GI | 222478442 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1311] Archaeal DNA polymerase II, small subunit/DNA polymerase delta, subunit B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.626198 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCTGG AGTCGAACGC CCGGATCGTC AAGGAGCTCG CCAGGCGCGG ATACAACGCT GAACGCGAAG CGATCACCCT CTTGGCGAGC GCGCCCGACT CCGCCGCTGC CGTCGAATCG GTCGTCGACC GCGCCCCGGA CGACGCCCTC CGCATCACCA CCGACCACGT CCGCGCCGTC ACGAACGACC AGTCAGTTTC TGGCGGTGAT TCAGCCGTTT CACCCCCCGA CTCGGCTGCG TCCGGCCCCG GCTCGGCTGC GTCCGGCCCC GGCTCGGCTG CGTCCGGCCC CGGCTCGACC GCCTCTGTCG CCGATCCAAC CACAGAATCA CCTCCAACCC GAAACAGCGA TCAACCGCCT TCTACGACCT CGACGGAGGG ATCTCCAGTC GAAATGGAGG GGTCTTCTTT GGACCACGTT CCCGGCTCCG ATCGCGATGC CGAGAACCCG GAGACCGATA TCGATACCGA AATCGACCGC GACTCCGACA GTAATGCCGA CAGCAGCGTC GATCGCAGCC CCGATCGCAG CCCCGACAGC ACCTCTCACC ACGATCCCGA TATCCGAGAG CTGGAAGTCG GTAACGACAT GACCGGTCAG AGTACTGGGA CCGGGGAGTA CAGCGACTTC GTTCGGACGT TTCGCGACCG GTACGAGCGG CTCTCGAAGG TCCTTCGCGG CCGCGTCAAC CACCGCCCTG CCGAGGCGAT CGCGGAGATG CCCGGCGGGA GCGACGCCGC GATGATCGGC CTCGTCAACG ATGTCCGGTC CACCAAATCC GGCCACTGGC TCGTCGAACT AGAAGACACG ACCGGAACCT TCCCCGCGCT GATCATGAAA GACAAGGGGC TCGCCGACCT CGTCGACGAG ATCATGATGG ACGAGTGCCT CGCTATCGAG GGGACGCTCG CCGATGACTC CGGAATCTTA TTCGCCGACT CCCTCCACTT CCCCGACGTT CCCCGGACTC ACCGGACGGG GGCGGCCGAC CGTCACGTGC AGGCCGCGCT GATCTCCGAT ATCCACGTCG GCAGCGACGA GTTCATGGTC GACGCATGGA GTAGCTTCAC CGATTGGCTC CACACGCCCG AGGCCGAACC GGTGGAGTAC CTGCTGCTCG CCGGCGACAT GGTCGAGGGC GTCGGCGTCT ACCCCGATCA GGACGAGGAA CTGGAGATCG TCGACATCTA CGACCAGTAC GAGGCGTTCG CGGAGTACCT CAAGGAGGTG CCGGCTGACA CCGAAATCGT GATGATCCCC GGCAACCACG ACGCGGTCCG ACTCGCGGAG CCCCAGCCCG GGTTCAACGA CGAGATCCGC TCCATCATGG ATGTCCACGA CGCGCAGATC GTCTCGAACC CCGCGACCGT CACCGTCGAG GGCGTCGACG TGTTGATGTA CCACGGCGTC TCCCTCGACG AGGTAATCGC GGAGCTCCCC GAGGAGAAGG CGAGCTACGA CGAACCCCAC AAGGCGATGT ACCAGCTTTT AAAGAAGCGT CACGTCGCGC CGCAGTTCGG CGGCCACACC CGCGTTGCCC CGGAGGAGCG CGACTACCTC GTCATCGAGG ACGTGCCCGA CGTGTTCCAC ACCGGTCACG TCCACAAGCT CGGCTGGGGG AAGTACCACA ACGTGCTCGC CGTCAACTCC GGTTGCTGGC AAGCGCAGAC CGACTTCCAG AAGTCCGTCA ACATCAATCC CGACTCCGGC TACGCGCCCA TCCTCGACCT GGACACTCTC GACATGACCG TCAGGAAATT CGCGTGA
|
Protein sequence | MPLESNARIV KELARRGYNA EREAITLLAS APDSAAAVES VVDRAPDDAL RITTDHVRAV TNDQSVSGGD SAVSPPDSAA SGPGSAASGP GSAASGPGST ASVADPTTES PPTRNSDQPP STTSTEGSPV EMEGSSLDHV PGSDRDAENP ETDIDTEIDR DSDSNADSSV DRSPDRSPDS TSHHDPDIRE LEVGNDMTGQ STGTGEYSDF VRTFRDRYER LSKVLRGRVN HRPAEAIAEM PGGSDAAMIG LVNDVRSTKS GHWLVELEDT TGTFPALIMK DKGLADLVDE IMMDECLAIE GTLADDSGIL FADSLHFPDV PRTHRTGAAD RHVQAALISD IHVGSDEFMV DAWSSFTDWL HTPEAEPVEY LLLAGDMVEG VGVYPDQDEE LEIVDIYDQY EAFAEYLKEV PADTEIVMIP GNHDAVRLAE PQPGFNDEIR SIMDVHDAQI VSNPATVTVE GVDVLMYHGV SLDEVIAELP EEKASYDEPH KAMYQLLKKR HVAPQFGGHT RVAPEERDYL VIEDVPDVFH TGHVHKLGWG KYHNVLAVNS GCWQAQTDFQ KSVNINPDSG YAPILDLDTL DMTVRKFA
|
| |