Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2466 |
Symbol | |
ID | 7401518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2444145 |
End bp | 2445575 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643709538 |
Product | UMUC domain protein DNA-repair protein |
Protein accession | YP_002567109 |
Protein GI | 222480872 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.666562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGCG GAACGCTCCC CGGGACCGAC GCGGACGACG ACGCGGGGAG CGACCGCATC GTGTTTCACG TCGATATGGA CTGCTTTTAC GCCTCCTGCG AGCGCCTGCG GCGCCCCGAA CTGGCCGACG AGCCCCTCGT CGTCGGCATG GGCTACGAGG CGGGAGAGGC GATCGGCGCG GTCGCGACCG CGAGCTACGA GGCCCGCGCG TTCGGCGTCG AGAGCGCGAT GCCGATCTCG GAGGCGCTCG ATCTGCTCCC CAGGCGGGTG AACGCCGACC CCGACGACCC GGACGCGCCC GACCCCGACG AGGCGGGTCG GTACCTCCCC GTCGACCTCG ACTTTTATAA GGAGGTCGCG AGCGACGTGA AGACCGTCCT CCGCGACTGC GCCGACGTGC TCCGCGAGGT GAGCGTCGAC GAGGCGTACC TCGACGTCAC CGACCGGACG GCGTGGGATG CCGCCGGCGG GGAAAGCACC GCGACCGGCC CCGCCGAGAC CCGGACGCTG GCGGAGGGGT ACGCCAGACA CATCAAAGAG CGCATCGAGC GCGAGGCCGG CGTGCCCGCG AGCGTCGGCG TCGCGCCGAA CATGTCGACC GCGAAGATCG CCAGCGACGC CGACAAGCCG GACGGGCTGG TCGTCGTGCC ACCGGGCGAG GTGGAGTCCT TCCTCGCGTC GCTGCCGACC GCCGAGATCC ACGGCGTCGG CCCGGTGACG GAGCGGACGC TGGCGGAGCT TGGGATCGAG ACCGCCGGCG ACCTCGCCGC CGCCGATCCC GACCGGCTCG CCGACGAACT CGGCGACCGC GGTCGGGAGC TGTACCGGCG GGCGCGGGGC GACGACGACC GAGAGGTCAC GCCCACGGGA CTCCCGAAGA GTCTCTCGCG GGAGTCGTCG CTGTCGGCGA CCGCCGACGA GGAGCGCAAG CGCGAGACGG TGCGCGCGCT CGCCGCCGAC GTGGCCCGTC GTGCCCGCGA GCGGGGCTGT CTGTACCGCA CCATCGGGAT CAAGGCGGTC GAACCTCCCT ACGAGGTGAA CACTCGCGCC CGGAGCCTCC CCGGTCCGGT CGACGACCCC GACCTCGTGG AGTCGGTGGC GCTCGACCTC CTCGGCGAGT TCGCCGGTGA CCGAGTCCGT AAGCTCGGGG TGCGCGTCTC GAAACTCGAC TTCGCCGAGA GCGATCAGGC GACGCTCGGC GGGTTCGACG CGAGTGGGGG CGGCAGGGGC ACCGCGGGAG GGGACAGCGG AGCCGACCGC GACCGCGAGT CCGAGGCCCA GTCAGTCGCC ACCGACGGCG AGGGCGGGAA GCTCACCGAC TGGGTGGGAG GCGAGCCGAA CGCCGGGGAC GCGGGCGAGG CGGACGAGAC CGGAGACCGA CCGACGAGCG AGACCGGCGA CGGACAGGCA TCGCTCGGCG ACTGGTCGTG A
|
Protein sequence | MAGGTLPGTD ADDDAGSDRI VFHVDMDCFY ASCERLRRPE LADEPLVVGM GYEAGEAIGA VATASYEARA FGVESAMPIS EALDLLPRRV NADPDDPDAP DPDEAGRYLP VDLDFYKEVA SDVKTVLRDC ADVLREVSVD EAYLDVTDRT AWDAAGGEST ATGPAETRTL AEGYARHIKE RIEREAGVPA SVGVAPNMST AKIASDADKP DGLVVVPPGE VESFLASLPT AEIHGVGPVT ERTLAELGIE TAGDLAAADP DRLADELGDR GRELYRRARG DDDREVTPTG LPKSLSRESS LSATADEERK RETVRALAAD VARRARERGC LYRTIGIKAV EPPYEVNTRA RSLPGPVDDP DLVESVALDL LGEFAGDRVR KLGVRVSKLD FAESDQATLG GFDASGGGRG TAGGDSGADR DRESEAQSVA TDGEGGKLTD WVGGEPNAGD AGEADETGDR PTSETGDGQA SLGDWS
|
| |