Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4502 |
Symbol | |
ID | 8745131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 102894 |
End bp | 104177 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646515039 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003405986 |
Protein GI | 284172604 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAGA ATGACAAACT GACTAATGGA CTCGATAGAC GAACGTATCT CAGTGGGATT GCGGCGGGTG CTGTGGCCGG ATTAGCGGGC TGTACCGATA GCGAAGACGA CGGACTCGAA GTGCTCCACG GCTGGACCGG CGGTGACGGG GCGGCTGCGA TCGAGACGCT CACCGAGGCG TTCAAGGAGC AACACTCCGA TATCGACGGA ACCTTCGAAG CCGTCGGCGG CGACGGGAAC GTCGAACTGA ATACGGCCGT CCTGCAGCGG TTGACGAACG AGAACCCGAT GAGTTCGTTC GCCAACTGGC CGGGCAACAA TCTCAAGCGG TACGAAGGCG TCCTCATGGA TCTCGAGGAG GACGTCTGGG AGGCCGACGG ACTGAAGGAC AACATCCAGG AGCGTGCCGT CGAACTCTGT ACGTACAACG ACAAGATGCC GGCGGTCCCG GTCGGCTCAC ACCGGATGAA CAACCTGTTC TACAACACGG CCGCCTTCGA CGAGGCGGGG ATCAATGCGG AGGACCTCGG GAGTATGTCC GACCTCATGG ACGCCCTCGA GACGATCGAT CAGGACACCG ATTACATCCC GTTCGCTCAC GGGATGCGGT CAGCGTTCCT CGGCTTGCAG ACGTGGGTTC AGATCCTCTC GAGCCAGTCC GGAGTCGACG CCTACATGGA CTTTATCGAG GGCAACGGCG ACAGGGACGC GGTGATCGAC GCGCTGGAAA CGCTCCAGGA GATCCAGGAG AACTATATCT CCGACGACGC ATCGTCGATC GGCTACACGC AGGCCGCGCA GAAACTGATC GCCGGCGAGG CCGCGTGTAT TCACGGCGGA AACTGGCAGT ACGGGATGTA CCGATCCGAC GAGTACGACG TCGAGTTCGG CGAGGACTGG GACTGGATCC CCTTCCCCGG AACCGAGGGG ACGTACTTCT ACCACCTCGA CGCCTTCATC GCCCCCGGTG ACAACCCGAG TCCGGAGGAT ACGATCGAGT GGCAGAAATT CGTCGGGACG GCGGAGGCAC AGATCGAGTT CAACAATCTC AAGGGATCGG TACCGCTTCG AACGGACATC GACTCGAGCG AGTTGACGGA CTTCCTGGCG ATGACGTACG AGGACCTCCT CAATTCTGAG CGGTATCCGC CGACGCTCGC ACACGGCCTC GCCGTCGAAC CCCAACAGCA GAACGATTGT GAGGGTGCGA TCGGCGATCA CTTCATGGGT CCGTACGATG CCGATGCAGC CGCGGACGCC CTGCTCAATG CCGTCTCCGA ATAA
|
Protein sequence | MVENDKLTNG LDRRTYLSGI AAGAVAGLAG CTDSEDDGLE VLHGWTGGDG AAAIETLTEA FKEQHSDIDG TFEAVGGDGN VELNTAVLQR LTNENPMSSF ANWPGNNLKR YEGVLMDLEE DVWEADGLKD NIQERAVELC TYNDKMPAVP VGSHRMNNLF YNTAAFDEAG INAEDLGSMS DLMDALETID QDTDYIPFAH GMRSAFLGLQ TWVQILSSQS GVDAYMDFIE GNGDRDAVID ALETLQEIQE NYISDDASSI GYTQAAQKLI AGEAACIHGG NWQYGMYRSD EYDVEFGEDW DWIPFPGTEG TYFYHLDAFI APGDNPSPED TIEWQKFVGT AEAQIEFNNL KGSVPLRTDI DSSELTDFLA MTYEDLLNSE RYPPTLAHGL AVEPQQQNDC EGAIGDHFMG PYDADAAADA LLNAVSE
|
| |