Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1804 |
Symbol | |
ID | 7399678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1819692 |
End bp | 1820495 |
Gene Length | 804 bp |
Protein Length | 267 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643708871 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002566453 |
Protein GI | 222480216 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATACC GCACGGAGAC GGACATAAGC CGTCGGACGT ACCTGAAGCT GACCGGCGGG GCCAGCGCCG TCGGGCTGAC CGGCGTCGCC GGCTGTCTCG GCGACGGGGG GGACAGTACC ACGATCACTC CGGGCACCGC ACCCGGCTTC CCGCCGTTCG AGATGCAAGA AGACGGCGAG CTGGTCGGCT TCGACATCGA TCTGCTGGAG GCCGTCGTGG ACGAGACTGA GTACGAACTG GGCGAGTGGG CCACCTTCGA GTTCGACTCG CTCATCCCCG CGCTCACGCA GAACGAGGAG ATCGACGTGA TCGCCGCCGC GATGACGATC ACCGAGGACC GCCAGGAGAC GATCGCCTTC TCTGACCCCT ACTGGGAGTC GGATCAGGCG ATCCTCGTGC GCGAAGGCGG CGACTTCCAG CCGTCGGCGT GGGAGGACTT CGAGGGCGTC AGTGTCGGCG CGCAGTCCGG GACGACCGGC GCCGATCAGG TCCAGTCGAA CCTCGTCGAT CCCGGGCTCG TCGCCGAGGA CGACTACCGC ACGTACGGAA GCTACGTCCT CGCGGTCGAG GACCTCGAAA ACGAGAACAT CGACGCCGTC GTCATCGACC TCCCGGTCGC GGAGACGTTC GCGGCCAACC GCGACGTAGA AATCGCGTTT ATCGAAGAGA CCGGCGAGCA GTTCGGGTTC GGCCTCCGGC AGGGCGAGTC GGAGTTCCAG TCGGCGCTGA ACGACGGGCT CGCGACCGTC CGCGACGACG GGACGTACAG CGAGATCACG AACACCTGGT TCGGACAGGA GTAG
|
Protein sequence | MSYRTETDIS RRTYLKLTGG ASAVGLTGVA GCLGDGGDST TITPGTAPGF PPFEMQEDGE LVGFDIDLLE AVVDETEYEL GEWATFEFDS LIPALTQNEE IDVIAAAMTI TEDRQETIAF SDPYWESDQA ILVREGGDFQ PSAWEDFEGV SVGAQSGTTG ADQVQSNLVD PGLVAEDDYR TYGSYVLAVE DLENENIDAV VIDLPVAETF AANRDVEIAF IEETGEQFGF GLRQGESEFQ SALNDGLATV RDDGTYSEIT NTWFGQE
|
| |