Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0049 |
Symbol | |
ID | 8740612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 54517 |
End bp | 55920 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646510612 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003401623 |
Protein GI | 284163344 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTAT CTCTGTCGCC CGCGGTCGTA GCCGCCGCAT ACGAGGTCCC AGTCGTGGGT CTCGAGTTCG ACGAGTCGAT GGTGACGATT CTCGGCAGCG TCGCCATACT CCTGCTTATC GCGCTCTCTG CGTTCTTCTC CTCATCGGAG ATCGCGATGT TCAACCTGCC GAAACACCGC CTCGAGGGGA TGGTCGAGGA CGGCGTCCCA GGCGCCAAAC TGGTCAAGTC CCTCAAGGAC GATCCCCACC GGCTTCTCGT GACGATCCTC GTCGGTAACA ACATCGTCAA CATCGCGATG TCCTCGATCG CGACGGCGAT CCTGTCGCTG CACTTCGGCG GACTGGTCGG CGTGTTTCTG GCGACGTTCG GGATCACCGC GCTCGTCCTC CTGTTCGGCG AGAGCGTTCC CAAGTCCTAC GCCGTCGAGA ACGCCGCACC GTGGTCGATC CGAATCGCCA GACCGCTGAA GGCGACGGAG TACTTCCTGT TCCCGCTGAT CGTCCTCTTC GACTATCTCA CTCGACAGGT CAACAAGCTC ATCGGCTCGA CCGGTGCGAT CGAGTCGCCC TACGTCACCC GCGACGAGAT CCAGGAGATG ATCGAATCCG GCGAGCGCGA GGGCGTCTTG GAGGAGGAAG AACACGAGAT GCTCCAGCGG ATATTCCGCT TTAACAACAC TATCGTCAAG GAGGTCATGA CCCCCCGCCT CGACATGACC GCGGTCCCCA AGGACGCCGG CATCGACGAG GCCATCGAAA CCTGTATCCA GAGCGGCCAC GCCCGCGTGC CGGTCTACGA GGGCAGCCTC GACAACGTCC TCGGCGTCGT CCACATTCGC GACCTCGTCC GCGATCTCAA CTACGGCGAG ACGGAGGCCG ACGACCTCGA ACTCGAGGAC CTCATCCAGC CGACGTTACA CGTCCCCGAG TCGAAGAACG TCGACGAACT GCTGACCGAG ATGCGGGAAA ACCGGATGCA CATGGCAATC GTTATCGACG AGTTCGGCAC CACCGAGGGG CTGGTCACCG TCGAGGACAT GATCGAGGAA ATCGTCGGCG AGATCTTGAA ATCCGGCGAG GACGAACCGA TCGAACAGCT CGACGACCGC ACCGTCATCG TCCGCGGCGA GGTCAACATC GAGGACGTCA ACGAGGCCTT AGAGATCGAC CTCCCCGAGG GCGAGGAGTT CGAGACCATC GCCGGTTTCA TCTTCAACCG CGCGGGCCGG CTCGTCGAGG AGGGCGAGGA GATCACCTAC GACGGCGTCC GTATCACCGT CGAGACCGTC GAGAACACCC GCATCATGAA AGCCAGACTG CGAAAACTCG AGCAGCCGAC CGAATCCCTC GAGGAGGCGC CGGAGGAAGC CGAGTCCGAC GAGGAGTCGG TCCCCTCGGA GTAG
|
Protein sequence | MALSLSPAVV AAAYEVPVVG LEFDESMVTI LGSVAILLLI ALSAFFSSSE IAMFNLPKHR LEGMVEDGVP GAKLVKSLKD DPHRLLVTIL VGNNIVNIAM SSIATAILSL HFGGLVGVFL ATFGITALVL LFGESVPKSY AVENAAPWSI RIARPLKATE YFLFPLIVLF DYLTRQVNKL IGSTGAIESP YVTRDEIQEM IESGEREGVL EEEEHEMLQR IFRFNNTIVK EVMTPRLDMT AVPKDAGIDE AIETCIQSGH ARVPVYEGSL DNVLGVVHIR DLVRDLNYGE TEADDLELED LIQPTLHVPE SKNVDELLTE MRENRMHMAI VIDEFGTTEG LVTVEDMIEE IVGEILKSGE DEPIEQLDDR TVIVRGEVNI EDVNEALEID LPEGEEFETI AGFIFNRAGR LVEEGEEITY DGVRITVETV ENTRIMKARL RKLEQPTESL EEAPEEAESD EESVPSE
|
| |