Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2233 |
Symbol | |
ID | 8384527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2279629 |
End bp | 2281836 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644973302 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003131133 |
Protein GI | 257053300 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.039331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACG AGTTTTCACT CAGTCCAACC CGTCGGCAGT TGCTCGCATC GCTTGGGACT GGCGGCGTGC TTGCTGGTGG GGTTGCGGGT TTCAGGACGA TGGCGGATGT CGTCACCCCA ATAGCAGCCC AATCGCCCCA GCGCGGGACA CTCGTGGGGA CGATGACCGG CCCTACCCAG GGACTGCACT TCAACCTCTT TGGAACGTCG AGCGCGGATA TCCCCGCTGC GATGGTGTTT GACCCGACGG TGAAATACCA TCGTGGTCGC GGCGAATTCA TCCCGGCCGC CGTGACGGAG TGGACAGTCG AGGGCGAAAC CATGACCCTC TCGCTGCGAC CGGATCTTGT CTGGGACGAC GGCGATCCGG TGACGGCCCG CGATCTGCGA ACCCAGCTAC TGCTCGGCAA GACCGTCGAC GACGATCTTT GGGAATACGC CACGGCAGTC GAGACGCTCG GGGAAAAGCG ACTCGCGGTT CACTTCGACG CGTCGTACAA TCCGGACTTG CTCGAACGCA TCGTTCTCGG AGACCGCCGA CTGTTCGTCA AACATGAGTA TTACGAGCCG TTTCTGGAGA CGTTACGTGC CGACGGCGAA ATCGCCGCCA GAGAGGAACT CATGGCATTC ACGCCTACGT CGATGGGCGA ATTCGCCGCA AGCGGCCCCT TCACTGTCGA CTCGATCGAC ACCGAGGGAA TCAAACTCGA ACTGAATCCG ACCCATCCCG ACAGCGATTC CATCGGGTTC CAGTCCTACG AGTTTCGCGC CTTGCTCGGA ACAGACACGG GGTTTCAGGC ACTCGAAACC GGCCAGGTCG ACGTGATCGC CTCGATGTCG ACGCTACCGA ACCAGCCCAT CGAATTGCCA TCATCCGTTG TACAGGTGCG GATCCCCGAT TATTGGGGGT TGGGCCTCTG GCCGAACCAC GATGTCGAGC CGCTGGACGA CCGGGCCGTC CGACAGGCAA TCGCCTTTGT CATCGACCGC TCGGCCGTGA CCCAGGCGAG TGGCCGGGAA ACGAACCAGC CGGTAGAGAC GCCAAGTGGC CTGCCGGCGT CCGCTGTTGC GGACTGGATA GCGCAGGATT CACTGGAGAC GTATGTCGAT CCGGATAACG AGGCTGCGAC AGACGTCCTG TCGAGGGCAG GATATACTCG CGAGGATGGA ACCTGGACGG CGGAGGACGG GACCCCCCTC GGCTTTTCGA TATCGGTCCC CGAGTTCATC ACCGACTTCG TCAACGCGAC GGATTCCGTC GCCGAACAAC TCCGTTCGTT CGGCATCGAA GCCGACGTTC GGGAAGTGAC GTTTCGGGAC ATCTTCGCCG GTGACTTCGA CGTCGTGTCG GGGTGGTGGC TACCGGGGAG CATCGACAGT ACACACCCGT ACGTCGCGTA CAGGTTCGGG TTCGGCATGG GGATGGTCCA GCCCGATCTA CTCGAGTATC CAGGGTTGGA CAGCACCGTG ACTGTGCCCG CGGACGGGAA CAACGGCTCG ATAGACGTCA CTCCCCAAGA CGAACTGGAA GCGCTGGCCA CGACTGTCGA CCGCGGCGAG GCCGAATCGA TCGTGCAGCG ACTCGCAAAG GTGTACAACG CGGACCTTCC GATGCTCCCA CTCTACCGGA GTCAATATAC GTCGTTCATC GATACGAGCG CCGTCGACGC ACCGGCGGAA GGTTCACCGA AGTTCCAGAT CACGTGGCCC CCGCACTGGC TTGTTCGAAC CGGCGATCTG CACAGTCCAG GCTGGGCCGA GAACGAAGCC TCGACACCCA CCGAGACAGT GACAAATACG CCGACTGCGA CGGACACGGA ACGGGAGACG CCATCGGCAA CCGAAACCGA GACGACACAA CCGACGGAGA CGGCGACAGA ATCCAGCCCG TCGACACCGA GCGAGACGGC GACGAAAACG GTGACGCCGA CAGCAACCCC GACAGACCCA AAAACAGCGA CAGTCAGCGA GACAGTCACG AACACTCCGG CAACGACCGA GACAGACCGA TCGGCCACTG AATCACGGAC GACGACGGTC ACGGAGATAC CGACCTCGAC GACCGATGCT GAATCAGAAG GTGGCCAGAA CACCCAGGAC AGCACCGACG GAGCCACCAC GACCAACGGT CCGGGGTTCG GTCTCCTCTC GTCGGTGGCT GGATTAGGTA GCTACGCGCT CTATCGGGCC AGGAAACGGG ACGGGTAG
|
Protein sequence | MSDEFSLSPT RRQLLASLGT GGVLAGGVAG FRTMADVVTP IAAQSPQRGT LVGTMTGPTQ GLHFNLFGTS SADIPAAMVF DPTVKYHRGR GEFIPAAVTE WTVEGETMTL SLRPDLVWDD GDPVTARDLR TQLLLGKTVD DDLWEYATAV ETLGEKRLAV HFDASYNPDL LERIVLGDRR LFVKHEYYEP FLETLRADGE IAAREELMAF TPTSMGEFAA SGPFTVDSID TEGIKLELNP THPDSDSIGF QSYEFRALLG TDTGFQALET GQVDVIASMS TLPNQPIELP SSVVQVRIPD YWGLGLWPNH DVEPLDDRAV RQAIAFVIDR SAVTQASGRE TNQPVETPSG LPASAVADWI AQDSLETYVD PDNEAATDVL SRAGYTREDG TWTAEDGTPL GFSISVPEFI TDFVNATDSV AEQLRSFGIE ADVREVTFRD IFAGDFDVVS GWWLPGSIDS THPYVAYRFG FGMGMVQPDL LEYPGLDSTV TVPADGNNGS IDVTPQDELE ALATTVDRGE AESIVQRLAK VYNADLPMLP LYRSQYTSFI DTSAVDAPAE GSPKFQITWP PHWLVRTGDL HSPGWAENEA STPTETVTNT PTATDTERET PSATETETTQ PTETATESSP STPSETATKT VTPTATPTDP KTATVSETVT NTPATTETDR SATESRTTTV TEIPTSTTDA ESEGGQNTQD STDGATTTNG PGFGLLSSVA GLGSYALYRA RKRDG
|
| |