Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4449 |
Symbol | |
ID | 8745078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 32951 |
End bp | 34765 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514986 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405933 |
Protein GI | 284172551 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAGG GTAACAACTG GCGGAACAGC GGTCCTAGCC GACGAGATGT CCTGAAATAC AGCGCTGTGG GTGGCACAGC CCTCGTTGCC GGATGTCTCG GTGGGAGCAG TAGCACAGAT CGATTCCGAG TGTTCGACCC GGAGACGAGC GGGACGCTGC CGTCCCAGCG TCACTGCAAC CCCTACAACC CGACCCAGCG CGGGACGTGG CACCCAGGGG CGCTCATCTT CGACCGACCC GTGATGTACA GTCCGGCGGA GGACGCGGTC TATCCCCTGA TCGCGACCGA CTGGGAGATG GCCGACGACA CCACGCTGGA GATGACGTTC AGCGAGGACT GGACCTGGCA CAACGGCGAC GACCTCACCG CCGAGGACTG GGTCATGCAG CTGCAGATGT CCCTCGCGAT CCTCGAGTAC CAGGCGGAGG ACGGGGCGCG ACCACACCAG TTCATCGAGT CCGCCGAGGC GCCCGACGAG TACACGGCGC AGATCAACTT CCACGACCCG CTCTCGGAGA CAGTCGCGGT CCAGAACGCC ACCGCCGATC TCGTGGGTGA CGAGAGTCGC GGCATCTTCA CCAATTCCTC GGACGACCAG TGGACCGACT GGCACGAGCA GTTGCAGAAC GCCGACGACT CCGGGATGGA GTCCATCCTC GAGGAGATCA CGTCGGAGGG GTATCCGAAC CTCGAGGACG CGAACGGGAA CGGTCCGTTC CAGGTCGCCG ATATCGGGGA CAACGTGATG GTCTTCGAGA AGTACGAGGA CCACCCGAAC GCCGACAACA TCAACTTCAG CGAGTACTCG ATGCACCTCT ACGAGAACAA CAACCCGACC CAGCCGTACG CAAACGGCGA GGTCGACGCC GCGCACACCC AGTTCCCCGT CGAGGACGAC GTCAAGAGCC AGCTCCCCAG CGGGCACACG CTCATCAAGG AGAGCTTCTC GACCAACAAA CTGTTCACGT TCAACTGCGG CCACGACGTC TCTTACGACA CGCCCTTCTC GAGCGTGAAC GTCCGCAAGG CGGTCTGTCA CGTCTTCGAC CGCCAGCAGG TCACCGAGGT CCTCGAAGGC GTCAACCGGA TGTTCGACTG GGCGCCGTGT CGCGTTCCCG GAAACGTGCT GGACAGCGGT ACTCACGACG CCGCGGAGTG GGTCCAGGAC TTCCCCAAGT ACGGCCAGAA CGACACCGAG CGCGCTACCG AACTCCTCGA GGAGGAGGGC TACACGCTCG AGGACGGGCA GTGGTACACG CCGGACGGAG AGGAGTTCGA GATCAACATC ATGAACGGCT CCGAGCGGAA GGACTTCGGC GTTCTCAAAC AGAACCTGAA CGACTTCGGC ATCAAGACGA ACCAGGAGCA GGTCGACGAC GCGACGTTCG ACGAGCGCCG ACAGAACGGC GAGTACGACA TGATGCCCGA CGGTTCCTCG GCCAACGGCG TCCGCGCGAT GTGGGCGCTC GATCTCGTGC CGAACTGGAT CCAGTCGATC TCGCACTTCG ACCCCAACGC GGAGATCCCG ATGCCCGTCG GCGACCCGGA GGGCTCCGAG GGGACGAAGG AGATCAACGT CGAGGAGCAC ATCCGTCAGT GGCAGGTCAC CGACGATGAT CAGTACCACA AGGAACTGAT GTGGTGGTGG AACCAGACCC TCCCGGAGAT GGAAGTGATG TTCCAGCCCG ACGCCGGCGC CTACAACGCC GACAACTGGG AACTCGACGC GCCGGACGGC ATCATCGACG GCACCGAGGA CGCGCTCTAC CTCATCCCAA AGACGGACGA GGCGAGCATG GAGTACATCG GGTAA
|
Protein sequence | MPEGNNWRNS GPSRRDVLKY SAVGGTALVA GCLGGSSSTD RFRVFDPETS GTLPSQRHCN PYNPTQRGTW HPGALIFDRP VMYSPAEDAV YPLIATDWEM ADDTTLEMTF SEDWTWHNGD DLTAEDWVMQ LQMSLAILEY QAEDGARPHQ FIESAEAPDE YTAQINFHDP LSETVAVQNA TADLVGDESR GIFTNSSDDQ WTDWHEQLQN ADDSGMESIL EEITSEGYPN LEDANGNGPF QVADIGDNVM VFEKYEDHPN ADNINFSEYS MHLYENNNPT QPYANGEVDA AHTQFPVEDD VKSQLPSGHT LIKESFSTNK LFTFNCGHDV SYDTPFSSVN VRKAVCHVFD RQQVTEVLEG VNRMFDWAPC RVPGNVLDSG THDAAEWVQD FPKYGQNDTE RATELLEEEG YTLEDGQWYT PDGEEFEINI MNGSERKDFG VLKQNLNDFG IKTNQEQVDD ATFDERRQNG EYDMMPDGSS ANGVRAMWAL DLVPNWIQSI SHFDPNAEIP MPVGDPEGSE GTKEINVEEH IRQWQVTDDD QYHKELMWWW NQTLPEMEVM FQPDAGAYNA DNWELDAPDG IIDGTEDALY LIPKTDEASM EYIG
|
| |