Gene Hlac_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2024 
Symbol 
ID7402043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2017097 
End bp2018605 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content67% 
IMG OID643709095 
Productprotein of unknown function UPF0027 
Protein accessionYP_002566672 
Protein GI222480435 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.724572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.445146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC GCGAGTTCGA CGGGATCCGA CTGGAGAAAG TGCGGGAGCA CGTCTGGGAG 
ATCCCCCGCG AGGGCGACAT GAACGTCCCC GCGCGGGTGC TCGCCAGCGA GAGCCTGCTA
GCGGAGATCG GCGAGGACAA AACCCTCCAA CAGCTAAAAA ACGCCACGCA CCTGCCCGGA
ATGGTCGAGC CCGCCCTCTG TATGCCCGAC GGCCATCAGG GGTACGGGTT CCCGGTCGGC
GGCGTCGGCG CGATCGACGC CCGAACCGGC TGTATCTCGC CCGGAGCGGT CGGCTATGAC
ATAAATTGCG GCGTCAGAAT GGTGAAAACT AATCTTACCT ACGACGACGT GCGCGGCCGC
GAGCCGGAAC TCGTCAACGC GCTTTTCGAG GCGATCCCCT CCGGGCTCGG CGGCGGCGGC
GTCATCGAGG GCGACGCCGA CGCGATCGAG GGCGCCCTAG AACGGGGCGT CGAGTGGGCC
GTCGAAGAGG GGTACGGAAT CGAAAGCGAC CTCGCGCGCT GTGAGGACGA GGGGCGACGG
CCCGACGCCC GCCCCGAGTA CGTCTCCCAG AAGGCGATGG ACCGAGGACG CAACCAGATG
GGGTCGCTCG GCTCGGGGAA CCACTTCCTC GAGGTGCAGC GCGTCACGGA CGTGTTCCGC
GAGGAGGTCG CCGACGAGTA CGGGCTCGAA GAAGACGGAA TCGTCGTGTT GATCCACTGC
GGGAGCCGCG GACTCGGCCA CCAGACTTGC AACGACTACC TCCGGCAGAT CGAGAAGAAA
CACGGCGACC TGCTCGCCGA GCTGCCCGAC AAAGAGCTCG CGGCCGCGCC CGCCGGCTCC
GAGCTGGCAG ACGAGTACTA CGGTGCGATG GGCGCGTGCA TCAACTTCGC ATGGGTGAAC
CGCCAGCTGA TCACCCACCA AGCCCGCAAA ACGTTCGGCG AGGTGTTCGA CGCCGACCCG
ATCGAGGACC TCGAGATGGA ACTGCTGTAC GACGTGGCAC ACAACATCGC CAAGAAGGAG
ACCCACGAGG TCGGCGTCGA CGCCGACGGA CTGCCCGCGG TCGGCGACGA GGCGGTCGAC
CGTGCGGATC GGGAGCTGTA CGTCCACCGC AAGGGCGCGA CCCGCGCGTT CCCGGCCGGC
CACGAGGACG TACCCGAAGT CTACCGCGAC GTGGGCCAGC CCGTGATCAT CCCCGGCAGC
ATGGGCGCCG GGTCGTACGT GCTCCGCGGC GGCGACGAGT CGATGGGCGT CTCCTTCGGC
TCCACCGCCC ACGGCGCCGG CCGGCTGATG AGCCGGACGC AGGCGAAACA GGAGTTCTGG
GGCGAGGACG TGCAAGACGA CCTCGAAGAC GGCCAGCAGA TCTACGTGAA AGCGCGGTCC
GGCGCTACCA TCGCCGAGGA GGCGCCGGGC GTGTACAAGG ACATCGACGA GGTGATCCGC
GTCAGCGACG AACTCGGCAT CGGCGACAAG GTCGCGCGGA CGTTCCCCGT CTGTAACATC
AAGGGGTGA
 
Protein sequence
MTTREFDGIR LEKVREHVWE IPREGDMNVP ARVLASESLL AEIGEDKTLQ QLKNATHLPG 
MVEPALCMPD GHQGYGFPVG GVGAIDARTG CISPGAVGYD INCGVRMVKT NLTYDDVRGR
EPELVNALFE AIPSGLGGGG VIEGDADAIE GALERGVEWA VEEGYGIESD LARCEDEGRR
PDARPEYVSQ KAMDRGRNQM GSLGSGNHFL EVQRVTDVFR EEVADEYGLE EDGIVVLIHC
GSRGLGHQTC NDYLRQIEKK HGDLLAELPD KELAAAPAGS ELADEYYGAM GACINFAWVN
RQLITHQARK TFGEVFDADP IEDLEMELLY DVAHNIAKKE THEVGVDADG LPAVGDEAVD
RADRELYVHR KGATRAFPAG HEDVPEVYRD VGQPVIIPGS MGAGSYVLRG GDESMGVSFG
STAHGAGRLM SRTQAKQEFW GEDVQDDLED GQQIYVKARS GATIAEEAPG VYKDIDEVIR
VSDELGIGDK VARTFPVCNI KG