Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2649 |
Symbol | |
ID | 8743262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2718809 |
End bp | 2720011 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646513237 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003404198 |
Protein GI | 284165919 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATGA AACGCAGACC AGTACTCAAG GGGATCGGCG GTACCGTCGC GGGACTCTCG CTCGCGGGTT GCATGGGCTA CTTCACCGGG GACGACACGT CGCCGCTGTG GCACGAGTTC ACCGATTCTG AGGAGCGCAC CTTCGAGAGT CACCTCGAGA CGTTCACCGA GGAGACCGAC CACGACCTCG AGGCCTCCGG CGTCTCGAAC ATGCAAGATC AGTTAGAGAC CGCCCTCCCG GCGGGCGACG GGCCGATGAG TTTCACCTGG GCACACGACT GGATCGGCGC CCAGCATGAA GACGAGACCC TCTATGACGC ATCTGACTCG ATCGACGTCG ACCTCGAGGG AACGTACTCG GAGGCGGCGG CCAACGCGGT TCAGTGGAAG GACAACGTGT ACGGACTCCC CTACGCCGCG GAAACGGTGA CGCTGATGTA CAACAAGGAT ATGGTCGAGG AACCGCCGGA GACGATCCCC GAGATGATCG AGATCATGGA GTCGTACGAC GGCGACGACC AGTACGGCAT CGGCTATCCG GGGGACGCGT ACCACTTCAG CGCCTACCTG CAAGGGTTCG GCGGCGTGCT CTACGACGAG GACGCCGACG AACTGGGAAT CGACGACGAT GCGGTCGTCG AGGGGCTCGA ACTCGTCCGG GACAGCATCT ACGAGTACAG TCCGAACGAC CTGAACAAGG ACCCGAACCT CTCGGTCTTC CAGAACGGAA ACGCGCCGTT CGTCGTGACC GGCCCGTGGA ACCTCGGCGG GCTCCGCGAT GCGGGCATCG ACGTCGGGGT CGCGCCGCTG CCTGCGCCCG AGGGCGGAGA ACCGACGCCG TTTACGGGCG TTCAGATGTG GTACTTCACG TCTCGCCTCG AGGACGCGGA GGACGACGTC CACGACGCGG TGCTCGACTG GGCGGAGTGG TACACCACGA CCGAGGACGT CGCCACGACC AACGCACAGG ACCACGCGAT GATTCCCGTC CTCGACTCGG TCGTCGGGAG CGACGACCTC GGTTCGGACG TCGACGCGTT CAGCCAGAGC GTGGGCATGG GGATGTCGAT GCCCGCGAGC GAGAAGATGG ACGCCGTCTG GGACCCGCTC GAGTCCGCGA TCGACGTCGT GCTCGGCTCG GGCGGCGACG CCCGAGAGGA ACTGGAGTCG GCCGCCGAGC AGATCCGAGG CTCCTGGGAG TAA
|
Protein sequence | MPMKRRPVLK GIGGTVAGLS LAGCMGYFTG DDTSPLWHEF TDSEERTFES HLETFTEETD HDLEASGVSN MQDQLETALP AGDGPMSFTW AHDWIGAQHE DETLYDASDS IDVDLEGTYS EAAANAVQWK DNVYGLPYAA ETVTLMYNKD MVEEPPETIP EMIEIMESYD GDDQYGIGYP GDAYHFSAYL QGFGGVLYDE DADELGIDDD AVVEGLELVR DSIYEYSPND LNKDPNLSVF QNGNAPFVVT GPWNLGGLRD AGIDVGVAPL PAPEGGEPTP FTGVQMWYFT SRLEDAEDDV HDAVLDWAEW YTTTEDVATT NAQDHAMIPV LDSVVGSDDL GSDVDAFSQS VGMGMSMPAS EKMDAVWDPL ESAIDVVLGS GGDAREELES AAEQIRGSWE
|
| |