Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1855 |
Symbol | |
ID | 8742449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1929602 |
End bp | 1930927 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646512433 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003403413 |
Protein GI | 284165134 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCAG ACTCGGGTGT CGAGGGGATC ATACACAGCG ACGGGGACGG GCCCTCGGTC CAGCAGGCGC TCTGGGACGC CGGTCTCGAC GACGACATCA GCCTCGAGAT TCAGACCGTC GTCAGCGACT CCGCGTCGCG GATGCAGACG GCCCAGTCGG CCCTCGAGGC GGGTCGCGCC CCGCCGGACA TCCACATGAT GGACAGCGGC TGGACGGTCC CGTTCGTTCT GCGCAATCAG ACGGTCAACC TGACCGAAGA GCTCCCCGAG GAGACGGTCT CGTTCGTCAA CGAGAACTAC CTCGACGCGA TCCTCGAGAC GGCTCGCCAC CCCGAGTCCG GCGACCTCCA CGGGCTGCCG CTGTTCCCGG ACCTCGGGTT CACTCTCTAC AGACAGGACC TGATCGAGGA CGCCGGCTAC GACACCAGTA GTTGGGGGAC GGACCCGCCG CAGTGGGAAG AGTTCGCGAA CGCGGTCAGC GACGCGAGAG ACCAGGCCGA CCTCAACTAC GGGTACACGA CGCAGGCGGC CGCCTACGAG GGGCTGTCCT GCTGTACGTT CAACGAGGTG ATGACGAGCT GGGGCGGAGC GTACTACGGC GGCGTGGACA ACCTCTTCAC CGCGGGCGAC CGCCCGGTCA CCGTCAACGA ACAGCCGGTC ATCGACGCGA TTCGAATGAT GCGCTCGTTC ATCGAGGGCG AGAATCAGAA CACTCTCGAC GGCTACGCCC AGATCAGTCC GTCGCCGATC GTCCAGTGGA CCGAACAGGA GTCGCTCAGC CCGTTCGACG CCGGCAACGC CGTCTCGAAC CGGAACTGGT CGTTCGCGAT CGCCCAGACC GGCGCGGAGG AGGCCTTCGG TGAGGACCTC GGCGTCACGA CGAGTCCGGT CGGGGTCCCC GAGGAGGAAG CCGAGTTCGA AGGCACCGGC GGCACCGCCG CGGCGCTCGG CGGCTGGAAC CTGGTCGTGA GCCCGTTCTC GGATCGCAAG GAGGAAGCGC TGCAGGTCCT CGAGGCGTTC GCCAACGAAG AGGTGATGCT CACGATCTTC GAACTCGGGG GATACCTCCC GCCGAATCTC GACCTGGTCG CGGAGGCCAG CCCGGACGAC GTCGGCCCGG TCGCCCGCTA CGGCGACGTC GTGCAGGCGG CCAGCGACAA CGCGATTCCG CGGCCGGCGA CCGACCTCTG GCCCGAGCAG TCGGCGCTGA TCTATCAGTC GGTCAATTCG GCCTACCGCG GCGCAAATGC GCCAGAGGCG GCGATGAACG ATCTCGCGGA AGAACTCCAG CAGAGCGAAT CGGAGGTGCA AACGAATGGC AACTGA
|
Protein sequence | MTADSGVEGI IHSDGDGPSV QQALWDAGLD DDISLEIQTV VSDSASRMQT AQSALEAGRA PPDIHMMDSG WTVPFVLRNQ TVNLTEELPE ETVSFVNENY LDAILETARH PESGDLHGLP LFPDLGFTLY RQDLIEDAGY DTSSWGTDPP QWEEFANAVS DARDQADLNY GYTTQAAAYE GLSCCTFNEV MTSWGGAYYG GVDNLFTAGD RPVTVNEQPV IDAIRMMRSF IEGENQNTLD GYAQISPSPI VQWTEQESLS PFDAGNAVSN RNWSFAIAQT GAEEAFGEDL GVTTSPVGVP EEEAEFEGTG GTAAALGGWN LVVSPFSDRK EEALQVLEAF ANEEVMLTIF ELGGYLPPNL DLVAEASPDD VGPVARYGDV VQAASDNAIP RPATDLWPEQ SALIYQSVNS AYRGANAPEA AMNDLAEELQ QSESEVQTNG N
|
| |