Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3692 |
Symbol | |
ID | 8744318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3804308 |
End bp | 3805888 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514279 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405227 |
Protein GI | 284166948 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGTGA ACCAGTCAAC GACGCGGCGG CACCTCCTCG CTTCGGGGGC CGCGATTTCC GCGTCAGTCA TCGCAGGGTG CATCGGCGGC GGCGGTGGCG GCGACGGGAA AGCCTTCCGC TTCACGCAAG AGCAGTCGCG AGAGGAGCAG TTCGATCCCG TCGTCTCGAA CGACGCGTAC AGCTTTCAGG TGATTCAGCT CGTCTTCGAC GGGCTCTACG AGTACAGCGA AGGACTCGAG CTCCAGCCGA AACTCGCGAC GGGCGAGCCG ACCGTCGAGC GCGACGGCAC GCGGTACATC TTCGAGATCG TAGAGGGTGC CACGTTCCAC AACGGAAACG AAGTGACCGC CGGAGATGTC GCGCACTCCT TTACCGCGCC GGTCGAGGAG GAGACGGAGA ACGCCTCGGA GTACGACATG ATCGAGAGCA CCGAGGTCAT CGACGACTAT CAGCTCCAGG TCGACCTCGG GGAGGATCCG TACGGCCCGT TCGAACTCGC GACGATGGGC GTGACGGTGG TCCCAGAGAG CGCTCGAACC GAGGACCGCG AGGCGTTCAA CACGAATCCG ATCGGCTCGG GACCGTTCAC CTTCGCCGAA CTCCAAGAGA ACGAGTACGT CGAGATCGAA CGCAACGACG ACTACTGGGA CGACCTCGAG CCGAACCTCC AGCGGGTCCG CTTCGAAGCT CACGACGACA ACGCGGGTCG CGTCTCCGAC ATCCGGTCGG GGAACACCGA CGCCATCGCC GGCGTGCCCA ACGAGGACTG GAGCGTCCTC GAGAACGAGG AGAACGTCAC TCTTCATTCG GCGGAGAGTC CGACGTTCAT GTACATGGCG TTTAATTGTA ACGAGGGGCC GACAACGAGT CCCGAAGTGC GGCGAGCGAT CGCCCACTCG TTCTCGATGT CGGACTTCAT CGAGTCGAAC GCCGCGAACG TGGCGTCGCC GATGTACAGT CCGATCCCGC CCGTCGTCAA TGAGGTCTGG GGCTTCCCCG AGGACGAGTA TCAGGAACTG TTGCCGTCGT ACGACCCCGA GGAGGCGCAG TCGCTACTCG ACGAACACGC GCCCGACGGC TTCACGCCGA CGATCATCAC GCCGGAGGGA ATCCGCGCCC AGTTAGCCGA ACGGATCGCG ACTCGGTTGG ACGAGATCGG GTACGGTGCG GACGTACAGG TACTGGACTT CGCGACGCTG GTCGACACCT ACACCAGCGG AAGCGCCGAC GACTACCAGA TGTACCTGCT GGGCTGGACC GGCGGTCCCG ATCCGGACTA CTACCTCTAC CCGCTGTTCC ACGAGAGTCA GGCGGGAACG AATCAGGGCC ACTTCTACGG CGGGAGCGAC GGGTTCCACG AGGCGATCGC CGAGGGACGC AACTCCGCTG GACAGGAGGA GCGCTACGAC ATTTACGAGC CCGTCATCCG AGAGATCGTC GAACAACTGC CTGCCCTCCC GGCGTTCACG CAGGACAACA CGATGGCCTC GCGCAACTAC GTCCAGGACC TGCAGGCACA CCCGGAAGTG ACGCGGAACC CGACGCTCGT CGCAGAGTAT ACGAACGTAT CGATGGAGTG A
|
Protein sequence | MVVNQSTTRR HLLASGAAIS ASVIAGCIGG GGGGDGKAFR FTQEQSREEQ FDPVVSNDAY SFQVIQLVFD GLYEYSEGLE LQPKLATGEP TVERDGTRYI FEIVEGATFH NGNEVTAGDV AHSFTAPVEE ETENASEYDM IESTEVIDDY QLQVDLGEDP YGPFELATMG VTVVPESART EDREAFNTNP IGSGPFTFAE LQENEYVEIE RNDDYWDDLE PNLQRVRFEA HDDNAGRVSD IRSGNTDAIA GVPNEDWSVL ENEENVTLHS AESPTFMYMA FNCNEGPTTS PEVRRAIAHS FSMSDFIESN AANVASPMYS PIPPVVNEVW GFPEDEYQEL LPSYDPEEAQ SLLDEHAPDG FTPTIITPEG IRAQLAERIA TRLDEIGYGA DVQVLDFATL VDTYTSGSAD DYQMYLLGWT GGPDPDYYLY PLFHESQAGT NQGHFYGGSD GFHEAIAEGR NSAGQEERYD IYEPVIREIV EQLPALPAFT QDNTMASRNY VQDLQAHPEV TRNPTLVAEY TNVSME
|
| |