Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1244 |
Symbol | |
ID | 8383519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1216792 |
End bp | 1218558 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644972303 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003130153 |
Protein GI | 257052320 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.863592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTCCT CGGATACGCC CAGTGGTCTC TCCCGCCGCA GTCTACTTGG CGGTGCCAGC GCTGGCTTCG CCGCTTCGAG TGCCGGCTGC CTCCAGTACG CCCGCAGTCT CGTCGATCGA GAGTCCCCCA AGCAGGTGTC AGTGCGGATC AAGACTGTCC CGGCAGACGA AGACGAAGCA GCGGTCCAGA TTGCACGGGC CTTGCAGAAA AACATGGAGA CGGTCGGCAT CGACGCGTCG ATCGTCCTGA TGACCGGAGC AGAGCTGCTT CGGGACGTGC TCCTCAACGA ATCGTTCGAC ATCTACGTGA GTCAGTACCC CTCACATCAC GATCCTGATT TCCTGCGACC GGCGCTACAC TCGACGTTCG TCACCGAGCA GGGATGGCAA AATCCGTTCG GCATCTCCGA TCTCGATCTC GACGAGCAGT TGACCGAACA GCGAACGGCC GTCGGAGCGG AGCGACAGCG TGCCGTCGCC GGCGTCGTCG GGTCCGTCAC GGAGATCCAG CCGTTCGCGA TGGTGTGTTT CCCCACGCGA ATCACCGTCG TCCGAAACGA CCGCTTCACC AACTGGAACG GCCTTGAGGA CCCGATCAAC TACCTGGCGT TGCGACGGAC CGAAGACGCC CCCGACGGCG AACCCACGCT CCGGATCGCG TTAACGGACG ACCGAATCAC GAAGAACTAC AATCCCCTTG CCGTGGAGTA CAGGCGGCCG AACCCCCTGA CTGACCTCCT TTACGATCCA CTCGCTCGCC GTTCCGATGG AGAGATCCAA CCCTGGCTGG CTGGTGACGT CACGTGGCAG TGGACCGAGG ACACAGCCGT GACGGCGACC GTCAAACTCC GGGACGGGCT GACCTGGCAC GACGGCCAAC CCCTGACCGC AGCCGACGTC GCCTTTACGT ACCGATTCCT CGACGACACG AGCCTCGGGA CAGGAAACAT GAACGTCCCC TCGCCCCGGT TTCGGGGGCG GACCTCACTG GTCGAGTCGG TCGAACGCCT CGACGCCCAG ACAGTCCGGT TGGAGTTCGG TGACACGGCG GAAGCCGTCG CGGCCCGTGC GCTGACTGTC CCGATCCTGC CGTCGCACGT CTGGGAGGAA AAATCAGCGG CCACGAACAT CGCCGGCATC AACATTTCCG AAGGAGTCAC GCAAGCACTG ACCTGGGCCA ACCCCGATCC GGTGGGAAGC GGCCCGTTGC GATTCGAGTC TCGGACGCCG GGGGAGCGGG TCGTATTCTC GCGGTTCGAC GACCACGTTC TGGTCGGTGG TGGCCCGGAC CGCGTTTCCG TCCCCTTCGA ACGCTTCGTC GTCCAGATCG CGCCGTCGGA CACGGCAGCA GTCTCACTGG TTACCGACAG CACGGTCGAC GCAACCGGCG ATCCGATCCA CCCGAAGGTT CTCGACAGGG TGACCGAAAA CGACCCCATT GAGGTCCTCT TCGGGAACTC ACGGTCGTTC TATCACGTCG GATTCAACAC GCGGCGTGAG CCGTTCGGCA ACGTCCGGTT TCGGCAAGCA GTCGCCCGAC TCCTCGACGC CGAACACATC GCCTCGTCGG TGTTCGACGG CCATGCCACG CCAGCAGCGA CGCCCCTGGC CGGCACCGAC TGGGAGCCAC CGGAGTTCGA ATGGGACGGC ACAGATCCGG TCGTTCCCTT TGCCGGGACG GACGGTGAAC TCGACATTTC GGACGCAAGA GGCGCGTTCA GGGAAGCCGG GTATAGGTAC GACGGCGACG GGAGACTCCT GAAATGA
|
Protein sequence | MDSSDTPSGL SRRSLLGGAS AGFAASSAGC LQYARSLVDR ESPKQVSVRI KTVPADEDEA AVQIARALQK NMETVGIDAS IVLMTGAELL RDVLLNESFD IYVSQYPSHH DPDFLRPALH STFVTEQGWQ NPFGISDLDL DEQLTEQRTA VGAERQRAVA GVVGSVTEIQ PFAMVCFPTR ITVVRNDRFT NWNGLEDPIN YLALRRTEDA PDGEPTLRIA LTDDRITKNY NPLAVEYRRP NPLTDLLYDP LARRSDGEIQ PWLAGDVTWQ WTEDTAVTAT VKLRDGLTWH DGQPLTAADV AFTYRFLDDT SLGTGNMNVP SPRFRGRTSL VESVERLDAQ TVRLEFGDTA EAVAARALTV PILPSHVWEE KSAATNIAGI NISEGVTQAL TWANPDPVGS GPLRFESRTP GERVVFSRFD DHVLVGGGPD RVSVPFERFV VQIAPSDTAA VSLVTDSTVD ATGDPIHPKV LDRVTENDPI EVLFGNSRSF YHVGFNTRRE PFGNVRFRQA VARLLDAEHI ASSVFDGHAT PAATPLAGTD WEPPEFEWDG TDPVVPFAGT DGELDISDAR GAFREAGYRY DGDGRLLK
|
| |