Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2024 |
Symbol | |
ID | 7402043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2017097 |
End bp | 2018605 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643709095 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_002566672 |
Protein GI | 222480435 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.724572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.445146 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGC GCGAGTTCGA CGGGATCCGA CTGGAGAAAG TGCGGGAGCA CGTCTGGGAG ATCCCCCGCG AGGGCGACAT GAACGTCCCC GCGCGGGTGC TCGCCAGCGA GAGCCTGCTA GCGGAGATCG GCGAGGACAA AACCCTCCAA CAGCTAAAAA ACGCCACGCA CCTGCCCGGA ATGGTCGAGC CCGCCCTCTG TATGCCCGAC GGCCATCAGG GGTACGGGTT CCCGGTCGGC GGCGTCGGCG CGATCGACGC CCGAACCGGC TGTATCTCGC CCGGAGCGGT CGGCTATGAC ATAAATTGCG GCGTCAGAAT GGTGAAAACT AATCTTACCT ACGACGACGT GCGCGGCCGC GAGCCGGAAC TCGTCAACGC GCTTTTCGAG GCGATCCCCT CCGGGCTCGG CGGCGGCGGC GTCATCGAGG GCGACGCCGA CGCGATCGAG GGCGCCCTAG AACGGGGCGT CGAGTGGGCC GTCGAAGAGG GGTACGGAAT CGAAAGCGAC CTCGCGCGCT GTGAGGACGA GGGGCGACGG CCCGACGCCC GCCCCGAGTA CGTCTCCCAG AAGGCGATGG ACCGAGGACG CAACCAGATG GGGTCGCTCG GCTCGGGGAA CCACTTCCTC GAGGTGCAGC GCGTCACGGA CGTGTTCCGC GAGGAGGTCG CCGACGAGTA CGGGCTCGAA GAAGACGGAA TCGTCGTGTT GATCCACTGC GGGAGCCGCG GACTCGGCCA CCAGACTTGC AACGACTACC TCCGGCAGAT CGAGAAGAAA CACGGCGACC TGCTCGCCGA GCTGCCCGAC AAAGAGCTCG CGGCCGCGCC CGCCGGCTCC GAGCTGGCAG ACGAGTACTA CGGTGCGATG GGCGCGTGCA TCAACTTCGC ATGGGTGAAC CGCCAGCTGA TCACCCACCA AGCCCGCAAA ACGTTCGGCG AGGTGTTCGA CGCCGACCCG ATCGAGGACC TCGAGATGGA ACTGCTGTAC GACGTGGCAC ACAACATCGC CAAGAAGGAG ACCCACGAGG TCGGCGTCGA CGCCGACGGA CTGCCCGCGG TCGGCGACGA GGCGGTCGAC CGTGCGGATC GGGAGCTGTA CGTCCACCGC AAGGGCGCGA CCCGCGCGTT CCCGGCCGGC CACGAGGACG TACCCGAAGT CTACCGCGAC GTGGGCCAGC CCGTGATCAT CCCCGGCAGC ATGGGCGCCG GGTCGTACGT GCTCCGCGGC GGCGACGAGT CGATGGGCGT CTCCTTCGGC TCCACCGCCC ACGGCGCCGG CCGGCTGATG AGCCGGACGC AGGCGAAACA GGAGTTCTGG GGCGAGGACG TGCAAGACGA CCTCGAAGAC GGCCAGCAGA TCTACGTGAA AGCGCGGTCC GGCGCTACCA TCGCCGAGGA GGCGCCGGGC GTGTACAAGG ACATCGACGA GGTGATCCGC GTCAGCGACG AACTCGGCAT CGGCGACAAG GTCGCGCGGA CGTTCCCCGT CTGTAACATC AAGGGGTGA
|
Protein sequence | MTTREFDGIR LEKVREHVWE IPREGDMNVP ARVLASESLL AEIGEDKTLQ QLKNATHLPG MVEPALCMPD GHQGYGFPVG GVGAIDARTG CISPGAVGYD INCGVRMVKT NLTYDDVRGR EPELVNALFE AIPSGLGGGG VIEGDADAIE GALERGVEWA VEEGYGIESD LARCEDEGRR PDARPEYVSQ KAMDRGRNQM GSLGSGNHFL EVQRVTDVFR EEVADEYGLE EDGIVVLIHC GSRGLGHQTC NDYLRQIEKK HGDLLAELPD KELAAAPAGS ELADEYYGAM GACINFAWVN RQLITHQARK TFGEVFDADP IEDLEMELLY DVAHNIAKKE THEVGVDADG LPAVGDEAVD RADRELYVHR KGATRAFPAG HEDVPEVYRD VGQPVIIPGS MGAGSYVLRG GDESMGVSFG STAHGAGRLM SRTQAKQEFW GEDVQDDLED GQQIYVKARS GATIAEEAPG VYKDIDEVIR VSDELGIGDK VARTFPVCNI KG
|
| |