Gene Hlac_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3082 
Symbol 
ID7399053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp336864 
End bp338222 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content60% 
IMG OID643706886 
Productprotein of unknown function DUF790 
Protein accessionYP_002564508 
Protein GI222475987 
COG category[S] Function unknown 
COG ID[COG3372] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGACCG CGAACCTCGC CCGGTCACGC ACGACCGACG AAGAAGTCAA ACCGCTGTTC 
ATCGATCCCG ACGAGGAGCG CTACCAACAG ACTGCTCGAG AACTCATCCA GCTGTTCGAG
GCCCATCTCG GTGAGCCGAA AGGCGACCTC GAGGACGCGA TTGACGAGCT GACCATCGCG
GATACCGACT ACAAGATCGT CCAAGGGCTG GCGAAACTCC TGAAAGACGA GTGTGAGTTC
GAGGTCGTCG CCTCCGTCGA ACCGCGTGAG ATCCGCCGGC GACTCTTCGA GAAAGCCAAC
GAGCGCTATC CGATCGTCCG CCAGCCGACG CTGGGCGAGG ACACACAGAA GCTGGAGGTG
TACAGCGCGG TCGCCGACGA CCTCGGGGTG TCGTTGGAAG AGTGCTATCG CGGGATGTAC
GCCGATCTCG AAGACAACAA ACGACTCGTC CGAATCGGAA CGCGGACGGC CGACCAGTAC
GCCAGTGATG ACGATACGTC GACGTCGACG ACCAACCTGA CCGGCAGCAG CGACGCGGAG
TATGAACACA CGGGTCTCAC CGTGGACTGG TTGGTGACCC GGTACAACCT CGCGCTCGCC
CAGGCGGTGC TCTACGACGC CACAGAAATG CGGATTCGGG TGTGGGACCA CTTCGGGACG
GTGTTCAGTT ACGTGAAGCT GTTCGGGTTG ATGCATCGCA TCTATCCGAT CGACAGCGAC
GGTGAACGCG TCGCGAACAC GGACCAAGCC GCCGGCTACG AGGCCGTACT GGACGGCCCG
GCATCGCTAT TCTCAAAGTC GCAGAAGTAC GGGATTCGCA TGGCGAACTT CCTGCCGGCA
TTGCCCCTCT GTGACCGCTG GGAGATGGTT GGTGAGATCC TCGTCGACGA GACGACCGGC
GAGACCCGAC AGTTCGCGCT CGACCCCACG GAGGATCTCG ATTCACACTA CAGCGCGGGC
GACCAGTTCG ATAGCGACGT CGAGCGGACG CTCGCCGATA AATGGGAGCG AGCGAATACG
GACTGGAAGT TGGTGCGGGA AGACGATGTC TTCGACCTAG GTGCTGAGGT GATGATTCCC
GACTTCGCGA TCGAACATCC CGATGGCAGG CGTGCGATCC TCGAGATTGT CGGCTTCTGG
ACGCCCGAAT ATCTGGACGC GAAACTGGAG AAGATTCGAA AGGTGGAGGC CGACAATTTC
GTGCTGGCTG TCTCGGAGCA ACTGGATTGT GCGAGCGAGG AGTTCGGGAG CGCCGCCGAT
CGAGTGCTGT GGTTCAAAAC GGGAATTCAC GTCTACGATG TAGTCGATTT AGTTGAGCAA
TACGCGACAG GGATGTCACA GAGTGAAGAG CAGGCTTGA
 
Protein sequence
MLTANLARSR TTDEEVKPLF IDPDEERYQQ TARELIQLFE AHLGEPKGDL EDAIDELTIA 
DTDYKIVQGL AKLLKDECEF EVVASVEPRE IRRRLFEKAN ERYPIVRQPT LGEDTQKLEV
YSAVADDLGV SLEECYRGMY ADLEDNKRLV RIGTRTADQY ASDDDTSTST TNLTGSSDAE
YEHTGLTVDW LVTRYNLALA QAVLYDATEM RIRVWDHFGT VFSYVKLFGL MHRIYPIDSD
GERVANTDQA AGYEAVLDGP ASLFSKSQKY GIRMANFLPA LPLCDRWEMV GEILVDETTG
ETRQFALDPT EDLDSHYSAG DQFDSDVERT LADKWERANT DWKLVREDDV FDLGAEVMIP
DFAIEHPDGR RAILEIVGFW TPEYLDAKLE KIRKVEADNF VLAVSEQLDC ASEEFGSAAD
RVLWFKTGIH VYDVVDLVEQ YATGMSQSEE QA