Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4466 |
Symbol | |
ID | 8745095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 54153 |
End bp | 55847 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646515003 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405950 |
Protein GI | 284172568 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0117733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGGCG CGATGGCAAT GGCCGGTTGT CTCGGGGGCA CATCGGGGGA GACAAACACG TTCGTCGAGC CGATAACGGT GGATCCGACG AACTATCAGT TCAACCATCT CAATCTGCAG AACCCGTTCT CGCACGCGAT GCAGAGCGAC CAGCTCCAGC GGTACTTCAT CAACGACAGC TCGTTCGAGT CGTACGCGGT CTCCATCGAG GAGTTCGAGG GCGAGACTGC CGTTCTGAAG GTTCGGGACG GTCTCACGTG GCACAACGGT GACCCGGGAG ATCCGGTCGA CGCCGACGAC CTCTACACGA AACTCGTCAC CGACGCGATC ATCGGTGACA CGATCTCGAC TCTCTGGACG GACATCGAGC GGGTCGGTGA CAAGTCCGTC GAACTCACCC TCGACGGAAC GATCAACGAG CAACTGTTCC GGGACGGACT GAACTACTAC TGGCTCGAGA CGCCGTTCCG ACTGTACAAG GACTACGTCG AGCGGTGGGA GGACGCGACG AGCGACGACG AGATCAGCGA CATCCGGAGT GACCTCCGTA ACGACTCGTT CGACGAGTCG AAGGTGAAGG GGAACGGTCC CTTCCAGTTC GAGGAGCGGG ACAACCGCCA CCTCAAGATG ACCCTGTACG AACACCACCC CGACGCGGAC AACATCAACT GGGAGAACTG GGAGCTCCAC AAGGTTTCGA CCGACACCGC GAGCGTGCTA CTCGGCGGCG AAGTCGACGG AATTCGGAAT TTCTCCGCAC CCGAAACGGT CTTCGAGCAG GCACCCGACG ACCTCCTGAA CGCGCAACTC CCCGCGCTGT GGGGGCACTC GCTTCCCTTC AACCACGAGG ACGACGACTT CAGTAACATC CGGGTTCGAC AGGCCGTCAG CGAGTTCATC GACAGGCAGG CCTGCGCGGA CAACTACGGC CGGCTCGGAC AGCCGGTCGA GGCGCCGAGC GGGCTCGTCG GAAACATCGA CGGCCAGAAC GAACAGACCA ACCGATGGGA GGACAAGGTC TCCGACGAGA TGGCCGAACA GCTCTACCGC TACGACGATC CGGAGCGCGG TCGACGACTG CTCCGCAACG AAGGGTACCA GAAGGACGAC GGTACGTGGT ACCGTCCCAA CGGTGACCCG TTCGAACTGA CGATCAAGGT CCCTGCGGGA TATACCGATT GGCATCCGGT CTACGAGACG ATCGTCGATA ATCTCACGAA CGAAGGTATC GGTGCCGAAC TGCAGCGGAT CGAGGCGTCT GCGTACTGGG CCGACCACTA CACCGCGGCG AACTTCAAGG TGGCGTCGAC GGGATGGACG CTCCAGCGAT CCAGTCCGTA CTACGTCTTC GACATGTACT ACAACATCGA CACGGAGTTC ATGAACCTCG ACCCCGAGAA CCTCGAGGCT CCGCCGATGG GCGAACCCGA TGGCGACCTC CAGTCGGTCG ATCCGCGCGG TCTGATGGAC GAGTTGCTCG TCGCACAGGA CGAGGAGAGA GCGACCGAGC TCACCGATCA GCTCGCGTGG ATCACTAACC AGACGGTCCC CATGCTCCAA CTCAACGAGA TCAACGACCC GGTCTGGTTC TACACGGACG ACTGGGAGGT CCCGGAGCCG GACTCCCCGA AGTACCAGTC GAAGTGGCCC CTCTGGTGGT TCCCGCGGAA CGGTGAGCTG CAGGCGAAGG ACTGA
|
Protein sequence | MGGAMAMAGC LGGTSGETNT FVEPITVDPT NYQFNHLNLQ NPFSHAMQSD QLQRYFINDS SFESYAVSIE EFEGETAVLK VRDGLTWHNG DPGDPVDADD LYTKLVTDAI IGDTISTLWT DIERVGDKSV ELTLDGTINE QLFRDGLNYY WLETPFRLYK DYVERWEDAT SDDEISDIRS DLRNDSFDES KVKGNGPFQF EERDNRHLKM TLYEHHPDAD NINWENWELH KVSTDTASVL LGGEVDGIRN FSAPETVFEQ APDDLLNAQL PALWGHSLPF NHEDDDFSNI RVRQAVSEFI DRQACADNYG RLGQPVEAPS GLVGNIDGQN EQTNRWEDKV SDEMAEQLYR YDDPERGRRL LRNEGYQKDD GTWYRPNGDP FELTIKVPAG YTDWHPVYET IVDNLTNEGI GAELQRIEAS AYWADHYTAA NFKVASTGWT LQRSSPYYVF DMYYNIDTEF MNLDPENLEA PPMGEPDGDL QSVDPRGLMD ELLVAQDEER ATELTDQLAW ITNQTVPMLQ LNEINDPVWF YTDDWEVPEP DSPKYQSKWP LWWFPRNGEL QAKD
|
| |