Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4488 |
Symbol | |
ID | 8745117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 82550 |
End bp | 83854 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646515025 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003405972 |
Protein GI | 284172590 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.395657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACA ACCGAATATC GTCGAAATCA CGTCGTAGAT TCATCAAGAC TGCCGGTGTC GCGGGCACCG CTTCACTCGC CGGCTGTATC GGTGGCGGCT CTAGCGGCTA CACTATTGCG TTCTGGGAGC TGTTCAGCGG TGGCGAGGGA CCCGTGATGG AGGACATCGT TCAAAAGTTC AACGAGGAAC AGCCACTCGA CGTGGACGAG GAGGTGACGA TCGACCGTCA GCGAACGCCG TGGGACCAGT ACTACAATAA CCTGTACACT ACGCTCGCGG GCGGTAGCGG ACCCGATCTG GCAGTTATGC ATGCAGCATA TCTGCGTGCG TGGGACGATA CGATCGTGCC GATGGACAAC TATATCGATA CCGGGGAAAT CGAAGGCGAT TACCTCGATA ATCACTGGGA CCTCGTGTCG GTCGAAGGAG AGACGCGAGC GCTTCCGATG GACCTGCACC CGGTTGGGAT GTACTACAAT AAGGCGATCT TCGAGGAAGC GGGCCTCGAC CCCGAGTCCC CGCCGACGAA CTGGGAGGAG TTCCAGGCCG CGGGGAACGC GATCGCCGAA GAAACCGATA AATGGGCGTT CAGCCAGACC CCCTACAACG ACGGGTTCGG CTCGTGGCGA ACGTGGAGTA CGCTCGTCAA GCAACAGGGC GGAGAGCTCT TCGACGACGA CTGGAATCCG ACGTTCGACG GTCAGGCGGG CCAGGACACC TCCGAACTGT TCTGGAATAT GACCGGCGAT ATGGAGTGGT CCCCCCAGAC AACCGAAGCG GACTGGGGCG CGAACGCCTT CGAGAACGGA AACCTGGGCA TGACGATGAA CGGGACGTGG TACGTGGCCA CCCTAGAGGA GTCGGACATC GACTGGGGAT TCTTCAAGCC GACCATCGCT CCCAACAGGA CCCAGGATCG GGTCTGGACG GACGGTCACA CGATCGTGTT ACCCCGGAGC CAGGACCGCG GCGACGACAA ATCCGAAATC GCGGCGAAGG TCGCCCACTG GTTGACGACC GAGAACCCCG AGTGGGGTGC GCGAGCGGGT CACCTTCCCG CCGCCGCGGA CATCAGGGAG TCCGACGTGT TCCAAAACGC GTCGTTCTAC GACAAAACCC TCAGCAAATA CCTCGAGATG GCCGAAGAGG ACATGTACTT CTACCATCCG AAGGTGCCGA ACGGCGACCC CAACTCGGAG AGTTGGTACC AGTGGCTGCT CGACATGTGG GGCCACAACT ACGAGAGTCC CCAAGAGGCA CTCAACGACG GTGTCTCGAT GATCAGTAAC GGCCTCGAGG AGTAA
|
Protein sequence | MTNNRISSKS RRRFIKTAGV AGTASLAGCI GGGSSGYTIA FWELFSGGEG PVMEDIVQKF NEEQPLDVDE EVTIDRQRTP WDQYYNNLYT TLAGGSGPDL AVMHAAYLRA WDDTIVPMDN YIDTGEIEGD YLDNHWDLVS VEGETRALPM DLHPVGMYYN KAIFEEAGLD PESPPTNWEE FQAAGNAIAE ETDKWAFSQT PYNDGFGSWR TWSTLVKQQG GELFDDDWNP TFDGQAGQDT SELFWNMTGD MEWSPQTTEA DWGANAFENG NLGMTMNGTW YVATLEESDI DWGFFKPTIA PNRTQDRVWT DGHTIVLPRS QDRGDDKSEI AAKVAHWLTT ENPEWGARAG HLPAAADIRE SDVFQNASFY DKTLSKYLEM AEEDMYFYHP KVPNGDPNSE SWYQWLLDMW GHNYESPQEA LNDGVSMISN GLEE
|
| |