Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4399 |
Symbol | |
ID | 8745027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 667896 |
End bp | 669263 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514936 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003405883 |
Protein GI | 284167605 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0609802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACTA GTGGCAGGGC AACGAATCGG AGTGATAGGT CCGAGAATGG CGATTCCGCA GCGATCACGA AGCACAGTTC GCGACGTCGA TTTCTGGCCG GCGTCGGCGG TGCGGCGGCC GTAACCGGGC TCGCAGGTTG CGTCGGTGGT GGCGGTAACG ATAGCGACGT CAGCATCATC GCCGTCGAGG GCGAAGGCCG ACTCGTGGAG AATTTGATCG ACGACTACGT CAGGGACGAG ACGGATCTCT CCATCGACGT CACGTTATTC CCGTACGCGA ACCTCTACGA ACGCGTCAGC AGCGTCCTCA CGACCGGCGG AACGGGGTAC GACGCCATCC TCATGGACGA CACGTGGTTC CCGCAGTTCG CGGCGAACCT CGATCCGCTC GAGCAGTGGC TTCCCGACGG GCTGCCCACG GAGCAACTCA TCGACACGAC GGTCGACATC ACCACGTGGC CGACGCCCGG CGCCCCGAAA GTTCCGTCCG CCGAGGATAT GGACGAGAAG ATCCGCGGGC AGGTCGTCGT GGGGAACACG CAGATGTTCG TCTACAACAC CGCCTACTAC GAGGAGGTCG GTGAAGAGGA GCCGAAGACG TGGGACGATG TGTTACGCGC CGGGCAGAGC ATCGACGAGG AGATCGCGGA CACGAACGGG TACGTGATCC GCGGTCAGCG CGGCAACCCG GCGAACACGA ACTTCATGAG CATCGGGTGG TCGAACCTCG GAGACATGTT CGACGAGGAC TGGCGGTACC AGTGGGACTC CAGCGAGGGC GAGGACGTTG TCAGTTTCTT CGTCGACGAT CTGCGATCGA TCTCGCCGGA CGGCGTCGGA TCGTTCAACA GCGATCAGGT GCTGAATCGG ATCGGCGAGG GCTCGGCCGC CCAGGGGATG GCGTGGCCGG CCGCGGCGTC GACGCTGCTC GACGACGACA CCGCAGAAGC CGACAATCTG GAGTTCATTC CGATCCCGGA AGGCGAGGTA CAGCAGGCGC CGATGCAGGG CAACTGGCTG CTCGGTATCA ACTCGAACAT CTCCGACGAC CGGAAGGAAG ACGCCGGCAC GGTCATCCAG TCCATCATCT CCAAGGAGGC ACAGGACCGC TACGTGGAAC TCGGCGGCGT CCCCTTCCGC CACGACACCT TCGAGGACAA CATGGACGCC GAGCCGTGGT ACGAAGCGCT GTACGAGAGC CTGCAAAACG CCAAACCGCG GCCGCGGACG CCCCTCTGGA ACGAGATCGA CGTGACCCAA GGGGAGTACC TCAACAGCGC ACTGACCGGC GACATGAGCC CGGCCGAGGT CGTGAGCGAA ACCAAGAACG AGGTCGAGTC GATCCTCGAA AACGCGGGAT ACTACTAG
|
Protein sequence | MPTSGRATNR SDRSENGDSA AITKHSSRRR FLAGVGGAAA VTGLAGCVGG GGNDSDVSII AVEGEGRLVE NLIDDYVRDE TDLSIDVTLF PYANLYERVS SVLTTGGTGY DAILMDDTWF PQFAANLDPL EQWLPDGLPT EQLIDTTVDI TTWPTPGAPK VPSAEDMDEK IRGQVVVGNT QMFVYNTAYY EEVGEEEPKT WDDVLRAGQS IDEEIADTNG YVIRGQRGNP ANTNFMSIGW SNLGDMFDED WRYQWDSSEG EDVVSFFVDD LRSISPDGVG SFNSDQVLNR IGEGSAAQGM AWPAAASTLL DDDTAEADNL EFIPIPEGEV QQAPMQGNWL LGINSNISDD RKEDAGTVIQ SIISKEAQDR YVELGGVPFR HDTFEDNMDA EPWYEALYES LQNAKPRPRT PLWNEIDVTQ GEYLNSALTG DMSPAEVVSE TKNEVESILE NAGYY
|
| |