Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2585 |
Symbol | |
ID | 8384890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2649797 |
End bp | 2651608 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644973660 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003131480 |
Protein GI | 257053647 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAGTG AACAGTCTTC AGGGAGTTTC GAGGACGTAG TGAATCGCCG CAACTTCATG CGGCTGGCAG GCGCGACCGG CGCGACCATG CTTGCCGGGT GTCAAGGTGA GGAGACAGAC ACGGAAACGG AGAGCGGTGG AAACGGTGAC ACCGAGACGG ACAGCGGTGG CGACACCGAC GTCTACGACG TCACGGTCGA ATCCGCTGCC GCCCGTGGGA TGGACCTGCT CGACTGGAAC GCCCAGTTCG CCGGGTGGCC GAACATCTGG GGGCGCTGGC TCGCTTACGA GCGCTATGCC CAGTACAACA TGACGGAAAA CGAGTGGATC CCCCGGCTCA TCCAGGACTG GTCGGTCGAC GGGACGACTG TCACCCTCAA CATCCGTGAG GATCACACCT ACGCGAACGG CGACCAGGTG ACGGCCGACG ACATCAAGGC GAACACCGTG ATGAACCTCG CGACGGGGGC CGCGTTCGCG GATGTCTTCG ACTCGTTCAA CGCACCTGAC GACAAGACGC TCGAGATCGA GACCACAAAG GAAGTCAATC CGGATATCCT CGAGTTCACG CTCCTCTCCC AGCTCCAGCA GGCGAAGATG GCCGAGCCCT ACGACGAGCT CTATCAGCGC TACTGGGAGG ACGAGGAGGA GGGCGTCTCC AGCGACATTC AGGGTCGTGA GCCCAATTCG CCCGACTACG TCTCCGGGAT CTTCGGGCAG CAGAGCAAGG ACGACGAGCA GTACATGATG AACCGTAACC CCGAGCACCC TGACGCCGGG AACGTCAACT TCGAGCGTTA CCGGTTCCCG AACTACCCGG GCAACGAGGG TAATTGGGAA GCCATGATCG GTGACGATAT TGACACGATC ATGAGCGCGT TTACTCCGTC GAACATCCGG GCGGAGCTTG CCGACCACTG GCAGGAATAC AACTTCCCCG GCTACTGGGG TGTCGGCTAC GTGTTCAACC ACGACGAGGA AGCCGCGCCG CACATCTCGA AGCGATCGGT CCGACAGGCG ATCACGCACG CCATCAACCG GGAGGACGTC GTCACGGCCG CGGGCGCAGA GATCAAGGAG GCCTTCCCCA CTCCGGCCGC CGTCTCCGCG AACGTCCAGG ACGAGTGGCT CGACGTGGGT GGCCAGTTCC CCGCGATGCA GGGTGGCGCA GAGCAGGCCG CCGAGATCAT GCAGGACGCC GGCTACGAGA AGAACGGCAA CGGCAACTGG GCCATGGACG GCGAGACCGT CACGTTCCAG GTCGTCGTCC CGAGTAGCTG GAGTGACTGG GTCACGGCGA CCCAGGCTGT GGTCCCACAG CTCCAGGACG CCGGGTTCGA CGCGGAGATG AACCGTGTCG ACAACATCAA CACGGTCGTC GGAGAAGGGA ACTTCAAGAT GGCGGCCCGT CCGTGGTCGC CCGGCAACGC CCGCTCCTCG CACCCGTACT TCCCACTGAA CTGGGTCTTC GGGCGCGCGT ACAACAACGC CCACAGCTAC CCGGGTGCCG AGGCTGGCAC CGAGATCGAG GTTCCGGCGA TGGACGGAGA CGGTACCATG TCGGTCGACG TCCAGCAGCG CCTCGACGAC CTCTCGACCT CGAGCGGCGA GGAAGCCCAG TCCATCGTTC AGGAGCTCGC GTGGGTCTCC CACCAGGACC TCCCGTACCT GCCGATGATC AACAAGGTAG AGCAGTCGTG GATCAGTACG CGGCGGCTGT CGGCTCCCGA AACTGACGAC CCAGCAGGCA ACGTCAAGTG GCCGACGTTC TACGCCCCGC GTGTCGGAAA GATGCAGTGG CAGGGCGAGT AA
|
Protein sequence | MASEQSSGSF EDVVNRRNFM RLAGATGATM LAGCQGEETD TETESGGNGD TETDSGGDTD VYDVTVESAA ARGMDLLDWN AQFAGWPNIW GRWLAYERYA QYNMTENEWI PRLIQDWSVD GTTVTLNIRE DHTYANGDQV TADDIKANTV MNLATGAAFA DVFDSFNAPD DKTLEIETTK EVNPDILEFT LLSQLQQAKM AEPYDELYQR YWEDEEEGVS SDIQGREPNS PDYVSGIFGQ QSKDDEQYMM NRNPEHPDAG NVNFERYRFP NYPGNEGNWE AMIGDDIDTI MSAFTPSNIR AELADHWQEY NFPGYWGVGY VFNHDEEAAP HISKRSVRQA ITHAINREDV VTAAGAEIKE AFPTPAAVSA NVQDEWLDVG GQFPAMQGGA EQAAEIMQDA GYEKNGNGNW AMDGETVTFQ VVVPSSWSDW VTATQAVVPQ LQDAGFDAEM NRVDNINTVV GEGNFKMAAR PWSPGNARSS HPYFPLNWVF GRAYNNAHSY PGAEAGTEIE VPAMDGDGTM SVDVQQRLDD LSTSSGEEAQ SIVQELAWVS HQDLPYLPMI NKVEQSWIST RRLSAPETDD PAGNVKWPTF YAPRVGKMQW QGE
|
| |