Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4095 |
Symbol | |
ID | 8744723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 356177 |
End bp | 357880 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514655 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405602 |
Protein GI | 284167324 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAATC AGCCGGATCA GATCAACTAC AACCCCTTCG GACAGCAGAC CCCCGGCGTC TTGAACAAGA TCATCTTCGA GCAAGGCGCG CGGTGGCATG CCGACGAGGA AGAATGGGTG TCACTACTCG TCGAGAACTG GGAATACCCG TCGACGGTCC AGTCTGGGGC GACCGTTCGG TTCGACCTGT CTGATACGTA TACGTGGTTC AACGGAGACG CGTACACGGC CGAGGACTTC GTGGGCCAAA TGCGCTGGCG GAACGTGAAC AACGATGCCG TCTGGGACTT TCTCGACGAT GTCGAGCAGA CGGGCGAATA CAGCGTCGAG ATGACGCTCG CAGAGAAGAT CGACACCCAA CTGTTCGAGA ACGCGCTGTT CGGGATGGGC GACGCCCCGA CCAACTGGAC GTTCAAGTTC GACGTCTTCA GGGACTACCT CGAGCGCATG GAAGATGCTG GGTCCGACGA AGACCGGAGT ACGATCCTCC AAGAACTCGC CGAGTGGAAC GTTTCTCTCA ACGAGGCTCG TGAGAAAGGG CTCGGGAACG GGCCGTTTAT GCCGTCGGAG GCGACGGCGA ACCAGCTCCT CCTAGAGAAG TACGAGGACT ACCAGAACCC CCACATCACG GCGGACGACA TCGCGTTCGA TACGATGGAG GCGCTGCCGC TCCAGGGGCC TCAGGAGAAG CTCCGATCGC TGCGCAATAA CGAGGTGGAT GCGCTTCACA ACGTGGCGTT CAACTCGGCG CAGGCAGACC AGATCCCGGA CAACTACGAG TCCGTCAGGT TCTACAGCCA TAGCGGCGAG TCGATCTCAT TCAACTGTCG CCGCGAGCCT CTCGACAACC AGCAGGTCCG GTGGGCGCTC TCGAACGTGC TGCAAGCGAG CCACGACACG CTGATGCAGA ACCTGCCACT CTCGGACGTG AACAAGGAGC GCGTCAATCT GTCCGCGGGG ATGTCACAGC CGCTTATCGA TGAGTGGCTC GGAGACGTCA AAGGTCAGTT CATGCAGTTC GACGGCGGGA CCGAACGAGC TACCGAACTC CTTCGGGACG AAGGGTTCAC CCAGGAGAAC GGTACGTGGT ACAAGCCCGA CGGCGAGCAG TTCACGCTCA CGTTCAGAGA CGCCGGTTTC CACAGCAACC GGACGGAAAC CGCGTCGCGG ATCCTCAGCG ACTTCGGTAT CGAGACCGAA GCGATCATCG TTGAGGACAC CACGTACTTC GGACAGACGA TCCCCGAGCG GGACTACGAT CTCACTAACT GGTGGGTCGG CCAGTCCGCA CCGCTTCCGT ACGAGGGGTT CCAGAATCAC CTCGTCAACG AGGCATGGGT GACCGCGTAT CCGCTCGGCG TACCCTCCGT CTCGGAGTGG AACGGCGAGG GTACCAGCGA GTTCATCGTC GAAGTTCCGC CGATCGGTGA GCCCGACGGA GAGCTTCGAG AGATGGACAT CCGGGAACGC CTCCAGGCGA TCGCGCGAGG CCAGAGCAAG GAAGAACAGC GGCCGCACAT CCAGCAGCTA GCGTGGTCCT GGAACTGGAT GGACGCTTCC TGGGGACCAT GGACACTGTA CATCGCCTCT GAGTACTACA ACACCGAGAA CTGGAACTGG CCCGCAAACG ACAGCGCGAT CATGAAGACA CCGAGTGTGC AAGACTGGCC CGTCCGCCAG GGCCAGCCGA CGCCCAACGA ATAA
|
Protein sequence | MANQPDQINY NPFGQQTPGV LNKIIFEQGA RWHADEEEWV SLLVENWEYP STVQSGATVR FDLSDTYTWF NGDAYTAEDF VGQMRWRNVN NDAVWDFLDD VEQTGEYSVE MTLAEKIDTQ LFENALFGMG DAPTNWTFKF DVFRDYLERM EDAGSDEDRS TILQELAEWN VSLNEAREKG LGNGPFMPSE ATANQLLLEK YEDYQNPHIT ADDIAFDTME ALPLQGPQEK LRSLRNNEVD ALHNVAFNSA QADQIPDNYE SVRFYSHSGE SISFNCRREP LDNQQVRWAL SNVLQASHDT LMQNLPLSDV NKERVNLSAG MSQPLIDEWL GDVKGQFMQF DGGTERATEL LRDEGFTQEN GTWYKPDGEQ FTLTFRDAGF HSNRTETASR ILSDFGIETE AIIVEDTTYF GQTIPERDYD LTNWWVGQSA PLPYEGFQNH LVNEAWVTAY PLGVPSVSEW NGEGTSEFIV EVPPIGEPDG ELREMDIRER LQAIARGQSK EEQRPHIQQL AWSWNWMDAS WGPWTLYIAS EYYNTENWNW PANDSAIMKT PSVQDWPVRQ GQPTPNE
|
| |