Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1813 |
Symbol | |
ID | 8742407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1887698 |
End bp | 1888933 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646512391 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003403371 |
Protein GI | 284165092 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAGG GTGCAGGGAT AGCAGGGACG GCGAGTCTCG CGGCTCTCGC GGGCTGTACC GGCGGTGGCG GCAGTACGCT CGAGGTCCTC CACGGATGGA CCGGCGGCGA CGGCGCCGAG GCCGCCGACG CGCTCTTCTC GGCGTTCGAA GAGGAGCACT CGGACGTCGA CTACAACGAG AAGCCGATCG GTGGCGGTGG GAACACGACG CTCGACCAGA CGGTCGCCAA CCGCCTCCAG GGCGGCGACC CGCCGAGTTC GTTCGCCGGC TGGCCGGGTG CGAACTTAGA GCAGTACGAG GACGCCGTCG GCGACATCGA GTCGGAGGTC TGGGACGAGG CTGGCCTGAA GGACGCCCAC GTCCAGGAAG CGGTCGAACT CTGCCGGCAC AACGACGGCT TCTCGGCGGT CCCGCTCGGC TCCCACCGCC TGAACGACCT CTTTTACAAC GTCGAGGTCG TCGAGAGCGC CGGCGTCGAT CCGAGCTCGA TCGACAGCGC CGACGCGCTG ATCGACGCGC TGGACGCCGT GGAGTCGGAG ACCGACGCGA CGCCGTTCGC GTTCTCGCTC GCGCCGTGGT GTATCCTCCA GACGTGGGCG CAGACGATGC TCGGCGAACA CGGCTACGAG GCCTACATGA ACTTCATCGA GGGCAACGGC GACGAGAGCG CCGTCCGCGA CACCTTCGAG AAGCTCGAGC AACTCCTCGG CTACATTAAC AACGACGCAG CCTCCGTCGA CTTCACCGAG GTCAATCAGG ACATCATGAG CGGCGACGCC GCGTTCATCC ACCAGGGCAA CTGGGCCGCC GGCGCGTACA TCTCGGGCGA CCAGGATATC GAGTACGGCA CCGACTGGGA CGCGATCCGA TACCCCGGGA CGGAGGACTA CTACACCCTC CACATCGACT CGTTCATCTA CCCGAGCGAC AATCCGACGC CCGACGACAC CGCGACTTGG CTGCAGTTCG TCGGCTCCGA GACGGCACAG GTCGCGTTCA ACCAGTACAA GGGGTCGATC CCGACGCGGA CGGAGGTGTC CACCGACGAG TTCAACGCCT ATCTCACGGA CACGATCGAG GACTTCGATA ACGCCTCGGA GAAGCCGCCA ACGCTCGCAC ACGGACTCGC CGTGGATCCG AGCACACAGG CCGACCTCGA GGACGTCCTC AACAACAGCT TCGCGGACCC CTACGACGTC GACGGTGCGA CGAGCGGGTT CATGGACGCC GTCTAA
|
Protein sequence | MLKGAGIAGT ASLAALAGCT GGGGSTLEVL HGWTGGDGAE AADALFSAFE EEHSDVDYNE KPIGGGGNTT LDQTVANRLQ GGDPPSSFAG WPGANLEQYE DAVGDIESEV WDEAGLKDAH VQEAVELCRH NDGFSAVPLG SHRLNDLFYN VEVVESAGVD PSSIDSADAL IDALDAVESE TDATPFAFSL APWCILQTWA QTMLGEHGYE AYMNFIEGNG DESAVRDTFE KLEQLLGYIN NDAASVDFTE VNQDIMSGDA AFIHQGNWAA GAYISGDQDI EYGTDWDAIR YPGTEDYYTL HIDSFIYPSD NPTPDDTATW LQFVGSETAQ VAFNQYKGSI PTRTEVSTDE FNAYLTDTIE DFDNASEKPP TLAHGLAVDP STQADLEDVL NNSFADPYDV DGATSGFMDA V
|
| |