Gene Hlac_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3340 
Symbol 
ID7402196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp92604 
End bp94025 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content55% 
IMG OID643709892 
Producthypothetical protein 
Protein accessionYP_002567458 
Protein GI222481222 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC AACCAGAATA TGGGACTCTT AGAATTACCA CACGAGGCAG CGAGCAGGCT 
TCTACTTCGA AATCTGTTCT CAGAACCTCA TCCAGACAAC AGCCGGGGCA TAACCCAATG
AGTCACCCTG AGTCCGATCA GGAAAGCGAG CTCGAACCAC AGCAGCCTTC GTTCGACGAT
GTCCGGTGGT CGGCCTGTTC ACTCGAAGAC TTCACACAGC TCTACTGGGA GCAGGTTGCT
CCGTGTCTCG AAGCAGAAGG CCTTGATCCT ACGGCGGAGA AACCAACCCA CCAGTGGTTC
AGTGATCACG GTGTGCGGTC ATTTCTCGCG GCCTTTCGTC GACACCACGA CCGATCTTTC
GGAGAGTTCT GGAGTGAAGA TCTCGGACTT GGTGACGACG ATGACGGCTA CACTTGGGCA
ACTTCCGATG AGCAAACAGT CGACGCACTC GAGCGATTCT TGGATCGTCG ACAGTCGCGG
TACGGTCTTT CGACGTCTTC TGTCGACACC CTCCGAACGC GGCTGAACCT CTATGTCCGG
GCGTACTCTG AGGCAAACGA CACGGATGAT CTCCTCTCGC CAATTCAACG TGATCGAGAC
GCACCCGCAT ACGAAGCTGT CGATGCATGC TATGGTGCAT TTGACTGGCT GAATGAGGGG
GCCGAACGCG AGTACAGTGC TCAGACCCTC CAACGGGTGC GACGCATCGT CGACGCTTGG
TATCAGCATC TGGTCGGTCG ACGAATCGCT TCGATGAATC CCGCCAGCGG ATTGTATGAA
GAATTCAAGT GGGAAACCAA AGACTCGCCG ACCCCATCAC TGTCAGCGGC CCATATTCGC
CAGCTGATGG AGATGGAAAC GACCTCACGA GACCAACTAT TGGTGGTTGC CCTCGCTGGG
TGGGGACTCC GAGCAGGCGA GGTCGCGGCA CTCCACATTT CGCAGTTCAA TCGCGATGTT
CCCGACGACG ACGTCCCCCA TATCGCATTC GAGAGCCGTA AGAACGGTCC TGGAGAAGTA
TCGGTACTGT TCGGTCTAGA TATCCTGGAC TCCCGAATTG ATGAACTTGG AGAAGATGAG
ACGTGGGACG GATACTTGTT CCCCTCACCG CAGGGCCAAA TCCCACACGT AACGCGGGAC
ACAATCCGTA ATTGGTTCCA AAAGCTTGCT TCAGAAGCCG ATCTTCCAGA TCGGATCGAA
GGCGAGCGTC CGAGTCCGCA GCTCTGTCGA CGGTTCTGGT ATGATACCTA TACTGCAGTT
CTCGAAGGAG TCCTCGAAGG CGTCGAAGAA ATAGCTGCAG AGCAGGGTAG TAGCGATCCA
CAGGTCGTTA TGCAGAATTA CCTCTCCGAC TCACGATCTC GCCAGTTACG TCGCGAATTC
ATGCGTGAGC AACTGATGGG AATCTTCAGG GGTGAGAGTT AG
 
Protein sequence
MTEQPEYGTL RITTRGSEQA STSKSVLRTS SRQQPGHNPM SHPESDQESE LEPQQPSFDD 
VRWSACSLED FTQLYWEQVA PCLEAEGLDP TAEKPTHQWF SDHGVRSFLA AFRRHHDRSF
GEFWSEDLGL GDDDDGYTWA TSDEQTVDAL ERFLDRRQSR YGLSTSSVDT LRTRLNLYVR
AYSEANDTDD LLSPIQRDRD APAYEAVDAC YGAFDWLNEG AEREYSAQTL QRVRRIVDAW
YQHLVGRRIA SMNPASGLYE EFKWETKDSP TPSLSAAHIR QLMEMETTSR DQLLVVALAG
WGLRAGEVAA LHISQFNRDV PDDDVPHIAF ESRKNGPGEV SVLFGLDILD SRIDELGEDE
TWDGYLFPSP QGQIPHVTRD TIRNWFQKLA SEADLPDRIE GERPSPQLCR RFWYDTYTAV
LEGVLEGVEE IAAEQGSSDP QVVMQNYLSD SRSRQLRREF MREQLMGIFR GES