Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4181 |
Symbol | |
ID | 8744809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 449626 |
End bp | 450864 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514729 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003405676 |
Protein GI | 284167398 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGAGA GTAGCAACCG GATAGCCACT GATCGGGCGT CTCAGTCGTC GATATCGAGA CGCCAGTATC TAGCGAGCGG TGCAGCAGTA GGTGTCGGAG CCGCACTCGG TGGCTGTCTC GGTAGCGATG ACGGATTGAC TGTGGGATAC GCGCTGCCGT TTACCGGTAC CTACGCGATG CTCGGCGAGA GTATCGTCAA CGGGTTCGAG CTGTACCTCG AGCAGAGCGA CGGAGAGATC GACGGTCAGG AAGTCGAGAC TGTCGAGCGC GACACCGAGG CTGACACGAA CCGTGGTGTC GACGTGACCC GAGAGCTCAT GGTCGAAGAA CAGGCCGATG CGATCGTTGG GCCGGTCTCG AGCAGCGTCG CGATCGCCAT GATGCAGACG ATCGAAAACG AGGCAAGCGC GATCTGGCTG AACGCGAACG CGGGCGATTA CCGCGTCGTG GAGGAGGGCT GTCTGAACTA CCACTTCCGC ACGTCGTTTA ACGATTGGCA GACGAGCGCA CCACTCGCGG AATGGGTGTA CGATAACGTC GCAGACAACG TCTGTCTGGC CTACGCCGAC TACGCGTTCG GGCAGAACTC GAAGAACTTC TTCCAGAAGG CGTTCGAGGA GGCCGGTGGC GAGGTCGTCG ACGAAGTCGG TGCTCCGTTG GGGACTGACG ACTACTCGAC GTACTTGAGC GACATCGAAA GCAGCGGTGC CGACGCGGTG TTTTCATTCT TCGCGGGGAG CGACGCCGTA AACTACATTT CGGACTTCGC CGACTACGGT CTCGACGCGG AGATGACTCA GGTCGGGAGC GGGTTCCTGC TGTCGGAGGA CACGCTTCCC GCACAGGGCG AAGCGGCGTT GGGTATGTAC TCACTGCTCC ACTACACGCC GACCCAGGAG ACCGACCGGA ACCAAGAGTT CATCGGGAAC TATCGCGAGG CGTACGATAG CTCGGCGAAT GTCTATGCCT GTCAAGGGTA CGATTCCGCG CAGGCGTTCG CCGCCGCCGT CGCAGAAGCG GGCAGTTCGG ACCCCGACGA GATGGCCAAC GCGCTCGGCG GAATGGAGCT CGACAGCCCG CGGGGCAGTT TCCGGTTCAA CTCCGAGACG CACGAGGCGA TTCAGAACAT GTACGTCCGA GAGGTCGTCG AAAGCGACGG GGACCAACCC GTCGAGAATC AAGTCGTCGA GACGATCGAC TCGGTCGAAA GCCCTAACTG GGGATGCAAC CTTGAGTAG
|
Protein sequence | MGESSNRIAT DRASQSSISR RQYLASGAAV GVGAALGGCL GSDDGLTVGY ALPFTGTYAM LGESIVNGFE LYLEQSDGEI DGQEVETVER DTEADTNRGV DVTRELMVEE QADAIVGPVS SSVAIAMMQT IENEASAIWL NANAGDYRVV EEGCLNYHFR TSFNDWQTSA PLAEWVYDNV ADNVCLAYAD YAFGQNSKNF FQKAFEEAGG EVVDEVGAPL GTDDYSTYLS DIESSGADAV FSFFAGSDAV NYISDFADYG LDAEMTQVGS GFLLSEDTLP AQGEAALGMY SLLHYTPTQE TDRNQEFIGN YREAYDSSAN VYACQGYDSA QAFAAAVAEA GSSDPDEMAN ALGGMELDSP RGSFRFNSET HEAIQNMYVR EVVESDGDQP VENQVVETID SVESPNWGCN LE
|
| |