Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1889 |
Symbol | |
ID | 7400083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1889553 |
End bp | 1890947 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643708960 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002566537 |
Protein GI | 222480300 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.483019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.437772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGATA TCGCGATCTC CGCCACTCGG GTCGTCGCGG CGCTCGTGTT GGTCGTGTTG AACGGCTTCT TCGTCGCCTC CGAGTTCGCC TTCGTCCGCG TCCGCTCCAC CTCGGTCGAG CAGCTGGTCG AGGAGGGGCG CCCCGGCTCG GGAGCGCTGC AGGACGTGAT GGGGAGCCTC GACGACTACC TCGCGGCGAC GCAGCTCGGG ATCACGCTCG CCTCGCTCGG GCTCGGGTGG GTCGGCGAGC CCGCGATTGT GGCGCTGATC GAGCCGGCGC TCGGACCGCT GCTCCCCCCG AACCTCGTCC ACATCGTCGC GTTCGCGATC GGGTTCAGCA TCATCACGTT CCTCCACGTC GTGTTCGGTG AGCTGGCGCC GAAGACGATC GCCATCGCGC AGGCCGAGCG GGTCGCGATG ATGTTGGCGC CCCCGATGAA GCTCTCCTAC TACCTGTTCT CGCCCGGCAT CGTCGTGTTC AACGGGGCGG CTAACGCCTT CACGCGCATG CTCGGCGTCC CGCCGGCCTC GGAGACGGAC GAGACGATGA AGGAGCGCGA GATCCGCCGC GTGCTCGCGC GCTCCGGCGA GGCCGGTCAC GTCGCAGACG TCGAAGTCGA GATGATCGAC GCCGTCTTCG AACTCGACGA CACCGTGGTC CGGGAGGCGA TGGTCCCCCG ACCGGACGTG ACGAGCATCC CGGCCGGCGC GGACCTCGCC GCGATCCGCA CGACCGTACT CGACGCGGGT CACACCCGGT ACCCGGTAGT AGCGGCTGAC GACGCAGACC GCGTGGTCGG CTTCGTTGAC GCGAAAGACG TGTTGCGCGC GGGTGAAGCG GGTGACGAGT CGGTCACGGC TGCCGATCTC GCACGCGACC TCGTGATCGT TCCGGAGACC ACGTCGCTGA GCGATCTGCT CGTGCAGTTC CGCGATGAAC GCCGTCAGAT GGCCGCCGTC GTCGACGAGT GGGGCGCCTT CGAGGGGATA GTCACCGTCG AAGACACGGT CGAGACGCTC GTCGGCGACC TCCGCGACGG CTTCGACGCC GCGGGCGGCG ACCACGCGGT TCGGAAGACC GGGGCGGGGG CCTACGAGGC CGACGGGTCG GTCTCGCTAT CCGTCGTCAA CGACGCGCTC GGCACCGACT TCGACGGCGA CGGGTTCGAG ACGCTCGGCG GACTCGTGCT CGATCGACTC GGTCGCACCT CGGAAACCGG GGATACGATC GCGGCCGGCG ACTACCTCTT CGAGGTCACG GCGGTCGACG GCGCCCGCAT CTCGACAGTC CGGATCGAGG AGGTCGACGA AGGCGACGAA GTCGACGGCG AGGACGAGGT CGACGGAGCC GGTGACGGGG CCGATGACGA ATCCGGCGGG GCGGACGGCG CCTGA
|
Protein sequence | MIDIAISATR VVAALVLVVL NGFFVASEFA FVRVRSTSVE QLVEEGRPGS GALQDVMGSL DDYLAATQLG ITLASLGLGW VGEPAIVALI EPALGPLLPP NLVHIVAFAI GFSIITFLHV VFGELAPKTI AIAQAERVAM MLAPPMKLSY YLFSPGIVVF NGAANAFTRM LGVPPASETD ETMKEREIRR VLARSGEAGH VADVEVEMID AVFELDDTVV REAMVPRPDV TSIPAGADLA AIRTTVLDAG HTRYPVVAAD DADRVVGFVD AKDVLRAGEA GDESVTAADL ARDLVIVPET TSLSDLLVQF RDERRQMAAV VDEWGAFEGI VTVEDTVETL VGDLRDGFDA AGGDHAVRKT GAGAYEADGS VSLSVVNDAL GTDFDGDGFE TLGGLVLDRL GRTSETGDTI AAGDYLFEVT AVDGARISTV RIEEVDEGDE VDGEDEVDGA GDGADDESGG ADGA
|
| |