Gene Hlac_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3354 
Symbol 
ID7402209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp110272 
End bp112239 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content47% 
IMG OID643709905 
Producthypothetical protein 
Protein accessionYP_002567471 
Protein GI222481235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAACT ACGAAGACTA TCTAGAGCAC TTCTTTGACA ACGCAGAGGC CTCTATCCGA 
GAAGGAAAAG GCCAGGAGCT CTCAGACAAT CTTTCTCACA TCGCAGAGCT GATTCATGAA
CTAATCGACA AGGAGGTGGA TTCAAAAAGA CAGTTCCGGT CGAACTACGC TTTCTGCAGA
CGACAGTACA TCCAGCTGTA CGATACAGTG CTTGAGAACG GAGCTGACGA AGATCTAAGA
ACAAGCATTA TCGACTCGAT ATCAGCTATC ACCAATTACT CCCGCCAAGC AAATGATATA
GAGGCCTTCG ACCAACTGCT TAACTCCATC ACAAGCTGCT ATAGACAGTC ATACCCCCAA
CCCGGATTTG ACGACGCTGT AGGGAATATT TTCGAGAGAT ACAGTCATAT CCAGCACGGA
GTAACACAGA ACTTCGAAGA CGTAGACAGT GTCGACAGAT TAGCCAAAAG CCAGGAAATC
ATCGACACCC TTCTCCAGTA CTACCGTGAA CTCTGGCGGT GGTCTATTGA AAACGGATGT
AAGGAGTCTA TCAAAAGACT GCATCACAAT CTGGATGACG TAAGAGCATT TGAACGAGCC
CAATACTTGC CGATAGGAAC CCCAGAGAGC GAATACAACG AAGACTTCCT TGATCAAAAG
CAGGAGATCG CTAACACCTT CCGGAAGCGA ATCCAGATAC AGAAATTTGC CGGCTACTCC
TGGGGATACA ACCTCTACGT AAAGGGGATA ATATCTGAGG AAGAGTTTAT CGAGGAACTG
CTTCAGAAAT ACGCCGAGCA GAACTTTTCC TCCATTAGCT CTCTTACAGA GACCTACTTC
GAGATTCAAT CCATTCTCGG TGAGGTACCT TACTGGGAAG AGTGGGAGAC AAACAGACAA
CTACAGCAGT CACTCGGCCC AGTCATGACT TCTATGGGGA CAAACAGTTG GATTCCCTCT
TTCTACCTAG CGTTTTCACT GTATCTCTTC GACGAAGACA CTCAAGAGAA CTTCTCAAAC
TCAACTCCTG AAGAACTGCC TTTCCCCACA GGAAGCAAGG AACGTATTGA GATCAACAGT
CTACGGGATG CGATTGAAGG ATTTGAAGAC GACTATCCTC TGGACTTCCT TCTAGACGAC
CAGACGGACA TAAATGACAG AATCGAGAAA CTGTCTGAAA TCCTCAATCG AGCGCATTCC
TACGCTGAGA AACAGGATAT AATGCGGATG CGGAATCATC CCATCGAATC AGAGTACGTC
GACTCTTGGG AAAAAGAGGT CAACGACCAA TTCGATAGCT CATGTCTGTT GAGACAAGCG
CTGAAAGAAA TCGGGTTGCT GAAGCAGAAA CCGTTCCCCC CGGATCTTGA CGGTATCAAG
GTCTCAGTTG GATATCCGCG TAAGCGAAAC TTCGTGCCGG AAGAAGCAGT ACACAAATCT
CCAACAGGCA ACTTCCGAAG TATCTTAGAC GATTATCGAG AGTATGTTTT GAGGCGGCTG
ACCTTAGAGG AGCACACCGT AGACACTGTT GACGAGTTAC TCGATGAAAT CGAGGATCAA
GTGGAGAGAC GAGATCCGTC AGTGATTCTC TTCAAGACTG GAGAGCACCG CAGAAAGCTT
TTAGAAGATG ACCGGTTCAC ACATGGAAGC GATTTCCCTA ACTCCCATCA CACGTTCTTG
GACATCCCTG TTCTCACTGA ACCCACTGAA ACGTACAACG CTCTCCTGCT TCTAGAGAAC
GAAAGTCACG GTGTAGAGTT CGTTGAAGAC GATATCGTAT TCAATCTGGA AGCTACGCCA
GGAGAGGAAG CCGAGGTAAT TGATATGCCG AACAAGCCTC TGGAGTCAAT TCCATATACT
AACGCACCTC ATGACTTTGT AGAGATGGAA GTCCGGCTCC GAGGATACAT ACAGACTGAG
GAGCTCGATG GGGTACGCTT CCAAATCAAC TCTGAGGTTC CAGAATAG
 
Protein sequence
MPNYEDYLEH FFDNAEASIR EGKGQELSDN LSHIAELIHE LIDKEVDSKR QFRSNYAFCR 
RQYIQLYDTV LENGADEDLR TSIIDSISAI TNYSRQANDI EAFDQLLNSI TSCYRQSYPQ
PGFDDAVGNI FERYSHIQHG VTQNFEDVDS VDRLAKSQEI IDTLLQYYRE LWRWSIENGC
KESIKRLHHN LDDVRAFERA QYLPIGTPES EYNEDFLDQK QEIANTFRKR IQIQKFAGYS
WGYNLYVKGI ISEEEFIEEL LQKYAEQNFS SISSLTETYF EIQSILGEVP YWEEWETNRQ
LQQSLGPVMT SMGTNSWIPS FYLAFSLYLF DEDTQENFSN STPEELPFPT GSKERIEINS
LRDAIEGFED DYPLDFLLDD QTDINDRIEK LSEILNRAHS YAEKQDIMRM RNHPIESEYV
DSWEKEVNDQ FDSSCLLRQA LKEIGLLKQK PFPPDLDGIK VSVGYPRKRN FVPEEAVHKS
PTGNFRSILD DYREYVLRRL TLEEHTVDTV DELLDEIEDQ VERRDPSVIL FKTGEHRRKL
LEDDRFTHGS DFPNSHHTFL DIPVLTEPTE TYNALLLLEN ESHGVEFVED DIVFNLEATP
GEEAEVIDMP NKPLESIPYT NAPHDFVEME VRLRGYIQTE ELDGVRFQIN SEVPE