Gene Hlac_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3173 
Symbol 
ID7399302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp403254 
End bp404507 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content62% 
IMG OID643706973 
Producthypothetical protein 
Protein accessionYP_002564595 
Protein GI222476074 
COG category[S] Function unknown 
COG ID[COG4983] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATATG AGCCACCACT TGCTGTCTCG GAGTGCCCGG AGACGTTACG CGAACGCGAG 
CAGTGGGTGT GCTGGCGGGA AGAAACACGC GACGGTAAAC CGACGAAAGT ACCGGTGACG
CCAGGGACAG GAGGATTCGC GTCGTCGACA GACCCCGAGA CGTGGGATGC CTTCGAGACA
GCACTCGAAT ACACCGAGAC GGAGCACGCT GATGGTGTCG GGTTCGTATT CACTGACGAC
GATCCCATCG TCGGCGTTGA CCTGGACGAC TGCCGCGATC CCGAAACGGG CGACGTCGAC
GACGCCGCAC AAGACATCAT CAAGCGACTC GACTCCTATA CGGAGGTATC GCCGTCCGGT
ACCGGCTATC ACGTCCTGAT CACCGGCGAA CTTCCCGAAG GACGGAACCG TCGCGGGAGC
GTCGAACTGT ACGACACGGC ACGTTTTTTC ACCGTCACTG GCGACCACGT CGATGAGACT
CTCGGTCGCG TTGCACGTCG ACAGGACGCG CTCACAGCGA TTCACCGCGA GTACGTCCAG
GACACCGAGC GTGACACAGC ATCCGAGTCC GAGCCGGGGA ATGGCACTGA CGACCAGTCA
ACGGCGACCG GGACAGCCGA CGTCGACGTT GATCTCGAGG ATGAGGACCT CCTCGAGAAA
GCGCGAAACG CGTCGAACGG CGAGAAGTTC GAGCGGCTCT GGAACGGGAA TACGGTCGGC
TACGACAGTC AGTCCGAGGC CGATATGGCC CTGTGTTGTC TGCTGGCGTT CTGGACCGGT
GGCGACCGGA CGCAGATGAA GCAGCTGTTC CGGCAGTCGG GATTGCTTCG GGAGAAGTGG
GACGAGGTCC ACTACGCTGA CGGGTCGACG TACGGGGAGA AGACCATCGA GCGAGCGATT
GCGACCACGT CGGAGTTCTA CGACCCGGAC GCCGGCGACG ATACCGCGGA CGACACCCCC
GGCGGATCGT CTCCAGACGT CGGCGCTGCT GACTCGGAAC GGAGTCGCGC GTATCTAGCC
GAGAAGAATC GGCTATTGAG CGAGCGCGTC GACGAACTCG AGGCGACACT CACGGAGAAA
ACCGAGCGCA TCGACGCTCT CGAAGCGGAG ATCGAGCGAC TCACTGACGA ACTCGCTACC
CGTGGCCGGG AAGAAGAGTC CCAGGGCGAG CACGTCTCTA CTGCGAATGA GAACGGTGCT
GAGTCAGAGT CATCCTCTAT GTTGAGTCGA TTATTCGGCG GTCGGTTCGA GTAG
 
Protein sequence
MEYEPPLAVS ECPETLRERE QWVCWREETR DGKPTKVPVT PGTGGFASST DPETWDAFET 
ALEYTETEHA DGVGFVFTDD DPIVGVDLDD CRDPETGDVD DAAQDIIKRL DSYTEVSPSG
TGYHVLITGE LPEGRNRRGS VELYDTARFF TVTGDHVDET LGRVARRQDA LTAIHREYVQ
DTERDTASES EPGNGTDDQS TATGTADVDV DLEDEDLLEK ARNASNGEKF ERLWNGNTVG
YDSQSEADMA LCCLLAFWTG GDRTQMKQLF RQSGLLREKW DEVHYADGST YGEKTIERAI
ATTSEFYDPD AGDDTADDTP GGSSPDVGAA DSERSRAYLA EKNRLLSERV DELEATLTEK
TERIDALEAE IERLTDELAT RGREEESQGE HVSTANENGA ESESSSMLSR LFGGRFE