Gene Hlac_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3233 
Symbol 
ID7399357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp493168 
End bp494625 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content57% 
IMG OID643707028 
Producthelix-hairpin-helix motif protein 
Protein accessionYP_002564650 
Protein GI222476129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTTG ATCTTGGTGT ACGTCTACTT ACTCTCCGAT GTGATGCGCT GGCTGAAGCG 
TCGACGACTA CCATCACTGA TCTGGTCGAC TACTTCGAAG CAGATCTCAT CTATATTATT
GAAGAAAAAC TGGACATGCG AATCGTGAGC ACAGTCGAGC GTACTGCTTC TTGTTCAGTA
GTATACACTC GAAGTAGCGC GGTTCACACC GAGACTGTTG ACGGGGTCAC CGTGTCTATT
GTCAGCTCGC TCGACTTCAT AGGTGGGGCC TCCACCGCCA GCGGGCGAGA GATTCCAGAG
GATGTCAACT ACGTCATCTG TGATGAGATT CAGACAAGCG CCGACTCGGT CACATTAGAT
GTCTCACTCG ATGGTCTCGA CCACCTCGCT CGCTTCCAGA ATCGAACCGA CCGAGAAGTA
ACGTTCCTCA CCGGGGCCAT GGAGGCCAGT TACGATTTCG TATGGAAGGC CGACGTCGAC
GACGAGAGCG TTCGCCTCCC CATCCGCGGA CTTGCTCCAA CCCGCCGGCA AGGGGCACCC
GAACTCGCCT GCATTTCGCT TGATTCGGGT GGCCGCATCG CTGTGTCCAC GACACCGGCG
GATAAATTCG GTCTGCAATC TCTTTCGGGT GTGGGCAAGG GAACCGCCCC AAAGCTCGCT
CGAAATGGGT ACGAGACGCG TGACGACGTC GCAGCCGCGA CGGAACAAGA GCTTCGTGAG
GTGCAAGGTA TCGGTGAGTC GAAAGCTCAG AGCATCCGCC AGAGCGCCCA CGCATTATCC
GAGGGATGCG TCATCCGTCT CACGGACGAA ACTGTCCCGG CAGCAGAGTA TAGTCCGCTG
TTCATCGATA TCGAGACCGA CGGCCTTAAT CCGAGTATCA TCTGGCTCAT CGGCGTCTAT
GACCCTGAAA CAGACGAGTA CGTCGACTTC ATCGATACAG AACCGTCGCG AGAAAACCCA
GGCAAAGCCA CCCGAGAGTT TGTTGCGTGG CTAGCCAGCA AGTACGATCG CACGTCCCTC
ATCGCGTGGA ACGGCCACAA CTTCGACTTC AAACACCTCA GCCGATTCAT CCGAGGACAC
GCACCGGAGT ACGCAGACTA CTGGTCGAAC TCCGTATTCG AGTACGACCT GTTCGATTGG
GCTGTGCGAA AAGACAACGC CATCCTCCCC GGTCGGACGA ACCGGGTTGA AGATGTCGCT
GAGGCTCTCG GGCATGGGCG CGACTCGGCT GCTGCCGCCG TGGATGGGAA ATCACTGGCG
AAGACCATCC AACGGCTTCT CGTGTCTCCA GAGCGCGCTC GAGACCTAGA CTGGGAGGCC
GCTCGGGCAT ACTGTGAGGC AGACGTCCGT GAGCTGGCTG CAGTCTACGA ATCGATTGCT
GAGGCGACGC CCGGCCACAA GCGTGCCAGC GTTCCTGCTG ATGAAAACAC CACGCAGACC
GGACTCATGG ACTTCTAA
 
Protein sequence
MSVDLGVRLL TLRCDALAEA STTTITDLVD YFEADLIYII EEKLDMRIVS TVERTASCSV 
VYTRSSAVHT ETVDGVTVSI VSSLDFIGGA STASGREIPE DVNYVICDEI QTSADSVTLD
VSLDGLDHLA RFQNRTDREV TFLTGAMEAS YDFVWKADVD DESVRLPIRG LAPTRRQGAP
ELACISLDSG GRIAVSTTPA DKFGLQSLSG VGKGTAPKLA RNGYETRDDV AAATEQELRE
VQGIGESKAQ SIRQSAHALS EGCVIRLTDE TVPAAEYSPL FIDIETDGLN PSIIWLIGVY
DPETDEYVDF IDTEPSRENP GKATREFVAW LASKYDRTSL IAWNGHNFDF KHLSRFIRGH
APEYADYWSN SVFEYDLFDW AVRKDNAILP GRTNRVEDVA EALGHGRDSA AAAVDGKSLA
KTIQRLLVSP ERARDLDWEA ARAYCEADVR ELAAVYESIA EATPGHKRAS VPADENTTQT
GLMDF