Gene Hlac_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3423 
Symbol 
ID7402271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp170674 
End bp171945 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content49% 
IMG OID643709966 
Producthypothetical protein 
Protein accessionYP_002567532 
Protein GI222481296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCCCT CAGAAATAGA TGGCAAAGCG ATTCAGTCGT TCTCTGCCTA CAGCCCCAGT 
ACTTTGTATG TTACAGTATT CGACAACACT GTGCCTGAGA CCACAATAGA GGCCCACCTG
CAAGAAATTG AGGAGCGACT GGCTGATGTT AACCGACCAC CAGCAACCAC ATTAGACATT
CTTGGGGAAG CCAAACGCGA ACGATACTGG GAGAACCTGC TTGTCTATTT TCTTGACCCC
GAAAACCCTC ATGGATTCGG AACAGATGTT CTTAGAGTAT TCCTACAAGC ACTTGCTGAA
CACGAGGAGA CTGTGCTTCC ACTTCAACAG TCCAAGCTCG GAGAGGTCAA GGTTCAATCG
CAGGTTCCTA CCGGCAAGGG GCCTTTTGAT ATCTTCTTGT GGAGCAAAGA TGCCTGGTAC
GTCGTTATCG AGTTGAAAGT CGCTGCAGCT GAAACGAGAA CTCAAACAAA ACGATATGCC
CAGGCCTCAA AGCTGGGCGA CCTCAACGTG AGCCGACACG ATGGGACGAG TGAGTACGTT
TACCTCGCCC CCCGAAGTGC AGGTGCATCG ACATCTGAGA CATTCGTCGA TGTATCATGG
GAGCACATCG TCCCCTATCT CGAAGATGTA CTGACGACAA GTCATGGCCA ATATCCATCG
AAAAGTCACG CCCAGCTCGC TGATTACCTC GACACAATAA GACAGACACT CAATATGGAC
GATTTCACCA CCATCTCAGA AGAGACGAAA CTGTACACCG AATACTCCGA TACGATTGAT
CGACTCGTTA AGGCCTACAA AAACGATAAA GCCAAGATTT TCAATCACCT TCAGACGGCT
TTTCTCGATG CACTAGACGG CCCCAAAAAA GACTGGACAG TAAACAATCG ACCGAAGACG
TACATCAACT TCGCCAAAAT AAACTGGGAG AACGTGGCGG GTAATGTCCG AATCGAATAT
GAACCCCATG TCCATCTCAA CCGCGATCAT CCAGAAATTC GGCTCCGCCT CGATATCGAA
AATTCAGGAA ATCAGCAAAT AAGAGAAGAG TTCAGCGAGA AACTAGGTCA GGAAGACTGG
GAAGCATTAG AAGACGCCGA CTGGGAAGTC GTTGATGGTA GCTACGCGTA TCTCGCAAAA
TCAGTTCCGT TCGATACGGA ACATCCAGCG GACTCAATTC GTCGTACTAT CCAAGAACTC
AATGGTCTCC GTGCAATCGT TGAGCCATAT ATCGACGGAA TCGTGCAAGA ACATCAGAAT
AGCACCCACT AG
 
Protein sequence
MNPSEIDGKA IQSFSAYSPS TLYVTVFDNT VPETTIEAHL QEIEERLADV NRPPATTLDI 
LGEAKRERYW ENLLVYFLDP ENPHGFGTDV LRVFLQALAE HEETVLPLQQ SKLGEVKVQS
QVPTGKGPFD IFLWSKDAWY VVIELKVAAA ETRTQTKRYA QASKLGDLNV SRHDGTSEYV
YLAPRSAGAS TSETFVDVSW EHIVPYLEDV LTTSHGQYPS KSHAQLADYL DTIRQTLNMD
DFTTISEETK LYTEYSDTID RLVKAYKNDK AKIFNHLQTA FLDALDGPKK DWTVNNRPKT
YINFAKINWE NVAGNVRIEY EPHVHLNRDH PEIRLRLDIE NSGNQQIREE FSEKLGQEDW
EALEDADWEV VDGSYAYLAK SVPFDTEHPA DSIRRTIQEL NGLRAIVEPY IDGIVQEHQN
STH