Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4447 |
Symbol | |
ID | 8745076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 28828 |
End bp | 30642 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646514984 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405931 |
Protein GI | 284172549 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0329376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGCAAG ATAACGATTC ACAGGACAGT GGAACGTACC GACGAGACGT CCTCAAATAC GGCGCCGCGG GAGGGACCGT TCTCGCCGCC GGCTGCCTCG GTGGGAGCAG TAGCACGGAT CGATTCCGCG TGTTCGATCC GGAGACGAGC GGGACGCTGC CGTCCCAGCG TCACTGTAAT CCGTTCAACC CGACCCAGCG CGGGACGTGG CATCCGGGAG CGCTCATCTG CGACCGGCCC GTGATGTACA GTCCGGCGGA GGTCGAAGTC TACCCCCTGA TCGTGACCGA CTGGGAGATG GCCGACGACA CCACGCTGGA GATGACGTTC AGCGACGAGT GGACCTGGCA CAACGGCGAT CAGCTCGTCG CCGACGACTG GGTCATGCAG CTGCAGATGA CGCTCTCGAT TCTCGAGTTC CAGGCCGAAG ACGGCGAGCG ACCCCACCAG TTCATCGAGT CCGTCGAGGC GCCCGATGAG CAGACGGCGC GGGTCAGCCT CTACGATCCG ATCCCGGAGA CGGTCGCGAT CCAGAACGCC ACCGCCGACA TCCTCGGCGA CGAGGGTCGC GGCATCTTCA CCAAACACGA CGACGACCAG TGGAGCGAGT GGCACGAGCA ACTGCAGAAC GCCGACGATT CCGAGATGGA GACCCTCGTC GGTGAGCTCA CGTCGGAGGG GTATCCGAAG GTCGAGGACG CGATCGGAAA CGGGCCGTTC CAGGTCGCCG ATATCGGGGA CAACGTCATG ATCTTCGAGA AGTACGAGGA CCATCCGAAC GCTGACAACC TCAATTTCAG CGAATTCTCG ATCCATCTCT TCGATACCGA TATCCCGACG CAGCCGTACG TCAACGGCGA GGTCGACGGC GCACACAACG AGTTCCCCGT GAAGGACGAC GTCAAGAGCC AACTCCCCGA CGGACACTCC CTCGTCAGGG AAAATCTGTC GACCAACAAG CTGCTCACGT TCAACTGCGG TCACAACGTC AACTACGATA CGCCCTTCTC GAACGCGAAC GTCCGGAAGG CGGTCTGCCA CGTCTTCGAC CGCCAGCAGG TCTCTGGGGT CCTCGAGGGC GTCAATCAGC TCTTCGATTG GCCGCCGTGT CGCGTCCCCG GGACCACTCT GGAGAGCGGT TCTCATGACG CTGCGGACTG GGTCGAGGAC TTCACCAAGT ACGGCCAGAA CGACACTGAA CGCGCCGCCG AACTCCTCGA AGGAGAGGGC TACACGCTCG AAGACGGGCA GTGGTACACG CCGGACGGTG ACCGCTTCGA GATCGATATC ATGAACGGCT CCGAGCAGAA GCATATCGGC GTCCTCAAGA ATAATCTGAA CGAGTTCGGC ATCAAGACGA ACCAGGAGCA GGTCGACGAT GCGACGTTCG ACGAGCGCCG CCACGCCGGC GAGTACGACA TGATGCCTGA CACGTCGTCC GCTAACGGCG TCCGCGCGAT GTGGGAACTG GACCTCGTGC CGACCTGGAT CCAGTCGATC ACGCACTTCG ACCCCAACGC GGAGATCCCG ATGCCCGTCG GCGATCCCGA GGGCTCGAGC GGGACGAAGA CGTTCAACGT CGAGGAACAC ATCCGGGACT GGCGCATCTC CGACGACGAC CAGTACCACC GTGAGTTGCT GTGGTGGTGG AATCAGAACG TCCCGCAGAT GGAGGCGATG TTCCAGCCCG ACGCCGGCGC TTACAACGGC GACAACTGGA CGATCGACGG GCCGGACGGC ATCGTCCACG GGATCGACGA CGCGCTGTAC CTCGTTACCA AGACGGACGA GGCGACGATG GAGTACACGG GCTGA
|
Protein sequence | MVQDNDSQDS GTYRRDVLKY GAAGGTVLAA GCLGGSSSTD RFRVFDPETS GTLPSQRHCN PFNPTQRGTW HPGALICDRP VMYSPAEVEV YPLIVTDWEM ADDTTLEMTF SDEWTWHNGD QLVADDWVMQ LQMTLSILEF QAEDGERPHQ FIESVEAPDE QTARVSLYDP IPETVAIQNA TADILGDEGR GIFTKHDDDQ WSEWHEQLQN ADDSEMETLV GELTSEGYPK VEDAIGNGPF QVADIGDNVM IFEKYEDHPN ADNLNFSEFS IHLFDTDIPT QPYVNGEVDG AHNEFPVKDD VKSQLPDGHS LVRENLSTNK LLTFNCGHNV NYDTPFSNAN VRKAVCHVFD RQQVSGVLEG VNQLFDWPPC RVPGTTLESG SHDAADWVED FTKYGQNDTE RAAELLEGEG YTLEDGQWYT PDGDRFEIDI MNGSEQKHIG VLKNNLNEFG IKTNQEQVDD ATFDERRHAG EYDMMPDTSS ANGVRAMWEL DLVPTWIQSI THFDPNAEIP MPVGDPEGSS GTKTFNVEEH IRDWRISDDD QYHRELLWWW NQNVPQMEAM FQPDAGAYNG DNWTIDGPDG IVHGIDDALY LVTKTDEATM EYTG
|
| |