Gene Hlac_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1889 
Symbol 
ID7400083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1889553 
End bp1890947 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content69% 
IMG OID643708960 
Productprotein of unknown function DUF21 
Protein accessionYP_002566537 
Protein GI222480300 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.483019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.437772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGATA TCGCGATCTC CGCCACTCGG GTCGTCGCGG CGCTCGTGTT GGTCGTGTTG 
AACGGCTTCT TCGTCGCCTC CGAGTTCGCC TTCGTCCGCG TCCGCTCCAC CTCGGTCGAG
CAGCTGGTCG AGGAGGGGCG CCCCGGCTCG GGAGCGCTGC AGGACGTGAT GGGGAGCCTC
GACGACTACC TCGCGGCGAC GCAGCTCGGG ATCACGCTCG CCTCGCTCGG GCTCGGGTGG
GTCGGCGAGC CCGCGATTGT GGCGCTGATC GAGCCGGCGC TCGGACCGCT GCTCCCCCCG
AACCTCGTCC ACATCGTCGC GTTCGCGATC GGGTTCAGCA TCATCACGTT CCTCCACGTC
GTGTTCGGTG AGCTGGCGCC GAAGACGATC GCCATCGCGC AGGCCGAGCG GGTCGCGATG
ATGTTGGCGC CCCCGATGAA GCTCTCCTAC TACCTGTTCT CGCCCGGCAT CGTCGTGTTC
AACGGGGCGG CTAACGCCTT CACGCGCATG CTCGGCGTCC CGCCGGCCTC GGAGACGGAC
GAGACGATGA AGGAGCGCGA GATCCGCCGC GTGCTCGCGC GCTCCGGCGA GGCCGGTCAC
GTCGCAGACG TCGAAGTCGA GATGATCGAC GCCGTCTTCG AACTCGACGA CACCGTGGTC
CGGGAGGCGA TGGTCCCCCG ACCGGACGTG ACGAGCATCC CGGCCGGCGC GGACCTCGCC
GCGATCCGCA CGACCGTACT CGACGCGGGT CACACCCGGT ACCCGGTAGT AGCGGCTGAC
GACGCAGACC GCGTGGTCGG CTTCGTTGAC GCGAAAGACG TGTTGCGCGC GGGTGAAGCG
GGTGACGAGT CGGTCACGGC TGCCGATCTC GCACGCGACC TCGTGATCGT TCCGGAGACC
ACGTCGCTGA GCGATCTGCT CGTGCAGTTC CGCGATGAAC GCCGTCAGAT GGCCGCCGTC
GTCGACGAGT GGGGCGCCTT CGAGGGGATA GTCACCGTCG AAGACACGGT CGAGACGCTC
GTCGGCGACC TCCGCGACGG CTTCGACGCC GCGGGCGGCG ACCACGCGGT TCGGAAGACC
GGGGCGGGGG CCTACGAGGC CGACGGGTCG GTCTCGCTAT CCGTCGTCAA CGACGCGCTC
GGCACCGACT TCGACGGCGA CGGGTTCGAG ACGCTCGGCG GACTCGTGCT CGATCGACTC
GGTCGCACCT CGGAAACCGG GGATACGATC GCGGCCGGCG ACTACCTCTT CGAGGTCACG
GCGGTCGACG GCGCCCGCAT CTCGACAGTC CGGATCGAGG AGGTCGACGA AGGCGACGAA
GTCGACGGCG AGGACGAGGT CGACGGAGCC GGTGACGGGG CCGATGACGA ATCCGGCGGG
GCGGACGGCG CCTGA
 
Protein sequence
MIDIAISATR VVAALVLVVL NGFFVASEFA FVRVRSTSVE QLVEEGRPGS GALQDVMGSL 
DDYLAATQLG ITLASLGLGW VGEPAIVALI EPALGPLLPP NLVHIVAFAI GFSIITFLHV
VFGELAPKTI AIAQAERVAM MLAPPMKLSY YLFSPGIVVF NGAANAFTRM LGVPPASETD
ETMKEREIRR VLARSGEAGH VADVEVEMID AVFELDDTVV REAMVPRPDV TSIPAGADLA
AIRTTVLDAG HTRYPVVAAD DADRVVGFVD AKDVLRAGEA GDESVTAADL ARDLVIVPET
TSLSDLLVQF RDERRQMAAV VDEWGAFEGI VTVEDTVETL VGDLRDGFDA AGGDHAVRKT
GAGAYEADGS VSLSVVNDAL GTDFDGDGFE TLGGLVLDRL GRTSETGDTI AAGDYLFEVT
AVDGARISTV RIEEVDEGDE VDGEDEVDGA GDGADDESGG ADGA