Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2482 |
Symbol | |
ID | 7401534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2459733 |
End bp | 2460935 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643709554 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002567125 |
Protein GI | 222480888 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0758562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGACA GCGAATCCGA TCGCGAGAGC GGACGTGTGG GGGTATCGCG GCGACAGTTC TTGGAGGTGA CGGGCGCAAC GGGGGCAGCC GTCGGGCTCG CCGGCTGTTC CGGCGGTGGC GGCGGCGGCG ACGGTCCGAT CCAGATCACG ATGGATGCCG AGTGGGGAGG CATCTCTGAC GCCCTCACTC AGAGCCTGTA CGACGCGGGT CTCGACGAGT CGATCGAAAT CGAGATCCTG CCGGGCGACT TCGAGTCGGG GGCGCGCCGG TCGGAGTTCA CGTCGGCGCT CGACGCCGGG CGGGCGAGCC CGGACATCTT CATGATGGAC TCAGGGTGGA CGATCCCGTT TATCGCGCGC GGTCAGCTCG TGAACCTGAG CGACGAGCTC TCCTCCGAGA CGCTCGATTA CGTCCAGAAC GACTACCTGC CGAGCGCGGT AAACACCGCG AGCGACCCAG AGAGCGGCGA CCTGTTCGGA CTGCCGCTGT TCCCGGACTA CCCGGTGATG CACTATCGAA AGGACCTGGT CGAGGACGCC GGCTACGACC CGGACGGCGA GAACTGGGCG ACCGAGCCGA TGAGCTGGCA GGAGTTCGCC GAGATGGCCG CCGACGTGTG GGAGCAGAAC GGCGGCCCCG GTGGCGACTT CGATTACGGA TTCACGACTC AGGGCGACAA CTACGTCGGG CTCGCCTGCT GTACGTTCAA CGAGACGATG ACTTCCTTCG GTGGCGCGTA CTTCGGCGAC CACGAGAACC TCTTCGGCCC GATCGGCGAT CGGCCGATCA CGGTCAACGA GGAGCCCGTT CACGACACGA TCCGCATGAT GCGGTCGTTC ATGGAGGGGC CCGACGCCGA GTACGCTCAC CCGGACTTCC CGCAGATTTC GACGACAGAT CTGCTCTCGT TCACCGAGGA GCCGTCCCGT GAGCCGTTCA CGTCCGGGAA CGCGATTTTC CACCGGAACT GGCCGTACGC GATCCCGCTC AACCTCGACT CCGAGGAGTT CAGCGCGGAG GATTACGACG TGATGCCGCT TCCGTACGGC ATCGAGGCAG GCGAGGGCGA GTACGAGGGC ACCGGCGGCG CCGCGGCGCC GCGGCGGCGC TCGGCGGCTG GCACCTCACG ATCAACCCGA ACACCCCGCG GCTCGACGAC TGCGTTCAGG TGCTCGAGGC GTTCGCCAAC GAGGAGGTCA TGA
|
Protein sequence | MVDSESDRES GRVGVSRRQF LEVTGATGAA VGLAGCSGGG GGGDGPIQIT MDAEWGGISD ALTQSLYDAG LDESIEIEIL PGDFESGARR SEFTSALDAG RASPDIFMMD SGWTIPFIAR GQLVNLSDEL SSETLDYVQN DYLPSAVNTA SDPESGDLFG LPLFPDYPVM HYRKDLVEDA GYDPDGENWA TEPMSWQEFA EMAADVWEQN GGPGGDFDYG FTTQGDNYVG LACCTFNETM TSFGGAYFGD HENLFGPIGD RPITVNEEPV HDTIRMMRSF MEGPDAEYAH PDFPQISTTD LLSFTEEPSR EPFTSGNAIF HRNWPYAIPL NLDSEEFSAE DYDVMPLPYG IEAGEGEYEG TGGAAAPRRR SAAGTSRSTR TPRGSTTAFR CSRRSPTRRS
|
| |