Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1456 |
Symbol | |
ID | 8742047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1513870 |
End bp | 1515468 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646512032 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003403015 |
Protein GI | 284164736 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCTG ATCAGAGGGA CAAGCGAACG CACAGAACGA CGCGACGCGA CCTGCTCGTC GCTCTCGGCG GTGCGAGCAC GGCAGCGCTG GCGGGCTGTT CGACGACGCT GGGAACGGAC TCGGGCTCGA CGCTCCGCGT CGGAACGCTC CGTCCGCCGC TCTCGCTCGA TCCGATCACC GCGCGGGCGA TCGGCTCGGA GCAGGCGATC GATCGGATTT TCGAGGGGCT CTACGGCTAC GGCGAGGGAA CCGATATCGT TCCCGCGATC GCGGCCGGCG AGCCGGAAAT CGCCGACAAC GATCGAGAAG TCGTCGTCGA ACTCGACGAC GGCGCGCGAT TTCAGAACGA CCGAGCGGTG ACCGCCGAGG ACGTGGTCTA CTCCTACACC GCGCCGCTCG AGGAGGACGC GCCGACGGAA TGGCTGGCGA GCCCGTTCGA CTCGGTCGAG TCCGACGGCG AGCACACCGT TCGGTTCACG CTGGCGGAGC CGTACCCGGC GCTCGAGCAC GCGCTGACAC ATCCGATCGT GCCCCGACAG GAGCGCGAGG ACGACAGGGA GGCGTTCGCC ACGAACCCGA TCGGCGCCGG CCCGTTCGAG GTCGCGTCGT TCAGCGCGGA GAAGAAGACC ACGCTCCGTC GCTGGGACGA CTACTGGGGC GAGACTCCAC CCGCGATCGA TCGGTTCACG ATGGTCTACG TCGAGTTCCC GGTGACTCAG CTGACCAGCC TCCGGACGAA TCGCAACGAT CTGATCGAGC CGGTCTCACC GCTGATCGTC GATCACGTCA GCGACGTCGC GAACGCGTCG GTGAAGCGCC AGCAGGGATA CACGTCGTTT TACTTCGGCT TCAACTGCAA CGAGGGGCCG ACGACCGACC CCCGAGTGCG AGAGGCGATC AGCTACTGCA TCGACCTCGA GAAGGCGGTC TCCGAGTTCG TCGAGCCGAT GGGCCAGCGT CAGTACAGCC CGCTGCCGCC GCAGGTCGCC GAGGAGTGGA ACATGCCGAC CGACGAGTGG GCCGAACTCG CGAACGAGCA GAACCCCGAA CGCGCCCGTG ACCTCTTTCG CGAGGCCGAC GCGGCCAGCG GTCAGCTTCG CATCCTGACC TCGACGGATC CGAAACACAA AGAGTTCGGC GAGGCGCTCG CCGGCGGCCT CCGGGATGCC AGCCACGGCG CGCTCACCAT CTCGACGTCC GAAACGAAAT TCCTCGAGCG GCACGTCACC GGCTCCGAGC GCGACTACTC GGTGTTCGTC GGGGAGATCA CGGGGACGCC CGATCCGGAC ACCCATCTCT ACCCGACGTT CCACGAGAAC ATGACCGGCG TGACGAACGG GACCTTCTAC CGCGAGGACG CGGTCATGGA ACGGCTCGCG TCGGCGCGAA CGACGACGGA TCGCGAGCAG CGGCGCGATC TCTACGAGAC GGCGATCACC CGATTGCTCG AGGATCGCGT CTGCCTGCCG ATCTGCTCGT TCGAGAACAG CTTCGCCGTG GATGCGGGCG TCGAGAACTT TCGCGTCCAC CCGATCGCGC GGGTCAATCC CCGGCTCGTG TGGGAGGACG GCGTCGTGAC AGTGGGGTCG GAATCATGA
|
Protein sequence | MMADQRDKRT HRTTRRDLLV ALGGASTAAL AGCSTTLGTD SGSTLRVGTL RPPLSLDPIT ARAIGSEQAI DRIFEGLYGY GEGTDIVPAI AAGEPEIADN DREVVVELDD GARFQNDRAV TAEDVVYSYT APLEEDAPTE WLASPFDSVE SDGEHTVRFT LAEPYPALEH ALTHPIVPRQ EREDDREAFA TNPIGAGPFE VASFSAEKKT TLRRWDDYWG ETPPAIDRFT MVYVEFPVTQ LTSLRTNRND LIEPVSPLIV DHVSDVANAS VKRQQGYTSF YFGFNCNEGP TTDPRVREAI SYCIDLEKAV SEFVEPMGQR QYSPLPPQVA EEWNMPTDEW AELANEQNPE RARDLFREAD AASGQLRILT STDPKHKEFG EALAGGLRDA SHGALTISTS ETKFLERHVT GSERDYSVFV GEITGTPDPD THLYPTFHEN MTGVTNGTFY REDAVMERLA SARTTTDREQ RRDLYETAIT RLLEDRVCLP ICSFENSFAV DAGVENFRVH PIARVNPRLV WEDGVVTVGS ES
|
| |