Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4448 |
Symbol | |
ID | 8745077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 30891 |
End bp | 32705 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646514985 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405932 |
Protein GI | 284172550 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGACG GTAATAGCTG TCGGAATGGC GATCACAGTC GAAGAGATGT CCTAAAATAC GGCGCTGTGG GTGGCACGGC TCTCGTCGCT GGCTGTCTCG GTGGGGGAAG TAGCACGGAT CGGTTCCGGG TGTTCGATCC GCAGTCGAGC GGGACGCTCC CGTCGGAGCG CCACTGTAAT CCGTTCAACC CGACCCAACG CGGGACGTGG CATCCGGGGG CACTCATCTT CGACCGTCCC GCCATCCACA GTCCGGCGGA AGACGAGGTC TATCCCCTGG TCGCGACCGA CTGGGAGATG GTCGACGATA CCACGCTGGA GTTCACGTTC AGCGACGAGT GGACCTGGCA CAACGGCGAT CAACTCGTCG CCGACGACTG GGTTATGCAG CTCCAGATGG CGCTCGCGAT TCTCGAGCAC CAGGCCGAGG ACGGCGACCG TCCGCACCAG TTCATCGAGT CCGCCGAAGC GCCCGACGAG CAGACGGTAC AGATCAGCCT CCACGATCCG CTCTCGGAGG CGGTCGCGGT CCAGAACGCT ATCGCGGACC TCGTCGGCGA CGAGAGCCGC GGCATCTTCA CCAAACACGA CGACGACCAG TGGAGCGAGT GGCACAGCCA GCTCATGGAG GCCGACGACT CCGAGATGGA ATCGCTCCTC GAGGAGCTCA CATCGGAGGG ATATCCGTTA CTCGAGGACG CGATCGGTAA CGGCCCGTTC GAGGTCGCCG ACATCGGTGA CAACGTAATG GTCTTCGAGA AGTACGAGGA CCATCCGAAC GCTGACAACA TCAACTTCAG CGAGTACTCG GTCCACCTCT ACGAGAACAA CAACCCAACC CAACCGTACG CTAACGGCGA GGTCGACGCC GCACACACGC AGTTCCCAGT CGAAGACGAC GTCAAAAGCC AACTCCCCGG GGGACACACG CTCATCAAGG AGAGCTTCTC GACCAACAAG CTGTTCTCGT TCAACTGCGG ACACGACGTT TCCTACGACA CGTACCTCTC GAACGCGAAC GTCCGCAAGG CGGTCTGCCA CGTCTTCGAC CGCCAGCAGG TCACCGAGGT CCTCGAAGGC GTCAACCGGA TGTTCGACTG GCCGTCGTGT CGCGTCCCCG GAAACGTGCT GGACAGCGGT TCCCACGACG CCGCGGAGTG GATTCAGGAC TTCACCGAGT ACGGCCAGAA CGACACCGAA CGCGCCGCTG AACTCCTCGA ACGGGAGGGA TTCCAGCGAG ACGACGGTGC GTGGTATACG CCGGACGGTG ACCGGTTCGA GATCGACGTC CTGGGTGGGA CCAAACGGAA GGACTTCGGC GTTCTCAAGG ACAATCTGAA CGAGTTCGGT ATCGCGACGA ATCAGGAAGA AGTCGACGAC GCGACGCTCT CCGAGCGCCG ACAGAACGGC GAGTTCGACA TCGTGCCCGA CGGCTCCTCG GCCAACGGCG TCCGGGCGAT GTGGGCGCTG GATCTCGTGC CGGGTTGGCT CAGTCAGATC ACGCATTTCG ATCCCAACGC GGAGATTCCG ATGCCCGTCG GCGACCCCGA GGGCTCGAGC GGGACGAAGA CGTTCAACGT CGAGGAACAC ATTCGGGAGT GGCAGGTCAC CGACGACGAT CAGTACCACA AGGAACTGAT GTGGTGGTGG AACCAGACCG TTCCACAGAT GGAAACGATG TATCAGCCCG ACGCCGGCGC CTACAACGCC GACAACTGGG AACTCGACGC GCCCGACGGC GTCATCGACG GCACCGAGGA CGCGCTCTAC CTCATCCCGA AGATGGACGA GGCGAGCATG GAGTACACGG GCTGA
|
Protein sequence | MEDGNSCRNG DHSRRDVLKY GAVGGTALVA GCLGGGSSTD RFRVFDPQSS GTLPSERHCN PFNPTQRGTW HPGALIFDRP AIHSPAEDEV YPLVATDWEM VDDTTLEFTF SDEWTWHNGD QLVADDWVMQ LQMALAILEH QAEDGDRPHQ FIESAEAPDE QTVQISLHDP LSEAVAVQNA IADLVGDESR GIFTKHDDDQ WSEWHSQLME ADDSEMESLL EELTSEGYPL LEDAIGNGPF EVADIGDNVM VFEKYEDHPN ADNINFSEYS VHLYENNNPT QPYANGEVDA AHTQFPVEDD VKSQLPGGHT LIKESFSTNK LFSFNCGHDV SYDTYLSNAN VRKAVCHVFD RQQVTEVLEG VNRMFDWPSC RVPGNVLDSG SHDAAEWIQD FTEYGQNDTE RAAELLEREG FQRDDGAWYT PDGDRFEIDV LGGTKRKDFG VLKDNLNEFG IATNQEEVDD ATLSERRQNG EFDIVPDGSS ANGVRAMWAL DLVPGWLSQI THFDPNAEIP MPVGDPEGSS GTKTFNVEEH IREWQVTDDD QYHKELMWWW NQTVPQMETM YQPDAGAYNA DNWELDAPDG VIDGTEDALY LIPKMDEASM EYTG
|
| |