Gene Hlac_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0030 
Symbol 
ID7401383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp32182 
End bp33564 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content69% 
IMG OID643707089 
Producthypothetical protein 
Protein accessionYP_002564706 
Protein GI222478469 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0419378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACTGA CCGAGGACCG CGAGCGGTTC CGCGAGGTGG GTGAGAAGCG CCGCGAGGAT 
CTCGCGGAGT TCATCAGCCA CGGCGATCTG GGCGGCTCCG ACCCCGACCG AGTTCGTATC
CCGGTGAAGA TCGTCGATCT CCCCGAGTTC AAGTACGACC GCCGCGAGGC CGGCGGCGTC
GGGCAGGGGG AGAGCGGACA GCCCCAGCCC GGCCAGCCGG TCGAACCAAA GTCCGACGGC
GATGAGGAGG GGGACCCGGG CGAGGAAGGC GGCGAGCACG AGTACTACGA GATGGACCCG
GAGGAGTTCG CGCGCGAGCT CGACGAGGAG CTGGGACTCG ACCTCGAACC GAAGGGGAAA
CGAGTCGTTG AGGAGATGGA AGGCGATTAC ACCGACACTG CCCGCGCCGG CCCGCGGGGG
ACCCTCGACG TCGACGAGTT CTTCAAGCGC GGGCTGAAGC GCCACCTCGC GACCGACTTC
GACGAGGATT ACGTCCGCGA AGGACTCCAC GTCGCCGGCG CGGACGTCGA CGACGTGTTC
GCATGGGCGC GCAACCAAGG GATTCCCGTC TCGCGGGCGT GGATCGCCGA CGCGGCTGCG
ACCGCGGCGG ACGAGCACGG CGAGCCCGTC GAAGCGCTCG ACCGGTGGGT GAGCTTCGAC
GCGCTCGACG AGGATATCGA CCGCGAGCCC GCCACTCATC GGATTCGCCG GGAGGGGCTC
GACAGCGTCC CCTTCCGCCG AGAAGACGAG CGCTTCCGTC ATCCCGAGGT GATCGAGAAG
CGCGAGCGCA ACGTCGTCGT GGTGAACGTC CGTGACGTGT CCGGCTCGAT GCGCGAGACG
AAACGCGAGC TGGTCGAGCG GGTGTTCACC CCGATGGACT GGTACCTGAC CGGGAAGTAC
GACGCCGCGG AGTTCCGCTA CGTCGTCCAC GATGCAGAAG CGTGGGAGGT CGACCGCGGC
GAGTTCTTCG GGATCCAGTC GGGCGGAGGC ACCCGGATCT CCAGCGCCTA CGAGCTGGTC
GCGGAGATCC TCGAAGAGTA CCCCTACAGC GAGTGGAACC GCTACGTGTT CGCCGCCGGC
GACTCCGAGA ACGCCGGCAG CGACACGACC GAGTCGGTGA TGCCGCTGAT GGAGTCGATC
GACGCCAACC TCCACGCGTA CGTGGAGACG CAGCCGGGCG GCGCCGCGCC GAACGCCCGT
CACGCCGACG AGGTGGAAGA GGCGCTCGGC AACACGGGCA ACATCGCCGT CGCCCGCGTC
TCGGAGCCGG GCGACGTCAC CGACGCCATC TACGAGATCC TCAGTACCGA AGCCGAAGGC
GACGGCGCCG CGACCGCCGG CGAGGCGGCA AGCGGGCTGA CCGACGGAGG TGAGCGCCGA
TGA
 
Protein sequence
MGLTEDRERF REVGEKRRED LAEFISHGDL GGSDPDRVRI PVKIVDLPEF KYDRREAGGV 
GQGESGQPQP GQPVEPKSDG DEEGDPGEEG GEHEYYEMDP EEFARELDEE LGLDLEPKGK
RVVEEMEGDY TDTARAGPRG TLDVDEFFKR GLKRHLATDF DEDYVREGLH VAGADVDDVF
AWARNQGIPV SRAWIADAAA TAADEHGEPV EALDRWVSFD ALDEDIDREP ATHRIRREGL
DSVPFRREDE RFRHPEVIEK RERNVVVVNV RDVSGSMRET KRELVERVFT PMDWYLTGKY
DAAEFRYVVH DAEAWEVDRG EFFGIQSGGG TRISSAYELV AEILEEYPYS EWNRYVFAAG
DSENAGSDTT ESVMPLMESI DANLHAYVET QPGGAAPNAR HADEVEEALG NTGNIAVARV
SEPGDVTDAI YEILSTEAEG DGAATAGEAA SGLTDGGERR