Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1045 |
Symbol | |
ID | 8741632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1081275 |
End bp | 1083098 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646511623 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003402610 |
Protein GI | 284164331 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGTA ATCCAACCGA TCCTGTCGAC GGCGTCGATC GACGCTCCGT CCTGGCTGCG GGCGCAGCCG GGCTCTCGCT CTCCCTGAGC GGCTGTGTCG ATACGGTCCA ACGCGTCGTC GACCAGGACG GTACCGACCA ATTGTCGCTC TCCATCGTGA CGGTCCCCGC GGACGCCGAC CGGGAAAGCA TTCGGATCGC CTATCACCTC GAGTCGCATC TCGAGGCGAT CGGCGTGGAC GTGACCCTCA AGGTGCGCTC TCGTACCGAC TTCCTCAAAA CGGTCCTGAT CGACCACGAC TTCGATATCT ACGTCGGTCA GCATCCCGCC GATTACGATC CGGACTTCCT CTACGAAGCC CTCCACTCCA CGTACGCAAA CGAGGCCGGC TGGCAGAACC CCTTCGGGTT CGACAGCATG GCCTTCGACA CCCTCCTGGA GGATCAACGC CGGGCCGACG GTGAGCAGCG CAAGCAACGC TTAGCAAATG TACTGCGCGG AGTTGCAGAC GAGAAACCGT TCGATCCGAT CTGTCGTCCC GACGAGATCC GGGTCGCTAA CACCACCCGC TTCGACGGTT GGGATCAGGG TCACCTCGCG ACGCGACGCG GCTATCTCGG CCTCGAGCCG GACGCCGGCG TCGAACGATT GAACGCGCTC GTGACCGACG CCAGACCGTC GGTCAACGTC AACCCGATCT CGGCGACGGT CCGGGAACGG GGGACGGTCG TCGACCTGCT GTACGACTCG CTCGGGACCG TCGTCGACGG CGAGGTCCTG CCGTGGCTCG CGGAGTCGTG GGAGTGGGTG ACCGATGCCG AGACCGACGA GAAGAAAACC GAAGAGATCG CCGAGCCGAC CCGAAACACG ACGACGGCGC GGGTTTCGCT CCGAGAGGAC TGTCGGTTCC ACGACGGCGA ACCGGTTACC GCAGCGGACG TCGAGTTCAC CTATCAGTTC TTCCAGGATA CCGTGCTCGG TCACGCGACG CCGTCGCCGC CGCCCCGCTA TCGCGGTCAC GCGAGCGCGA TCGACGATAT CGAGATCGAG GACGAGTACA CGCTGCGGAT CACCGCCGCC GCCGGCACGG ACGTCTGTGA ACGCGCGTTT ACCGCCCCGA TCCTCCCGAA ACACGTCTGG CAATCGGAAC TCGAGGACCG GCTCGGTAAC TCTCAGGAGT TTTCGGCGCC GCAAGGATCC TGGAGTTTGG TCACCAGCGA CTCGATCGAT CCGACCGGGA GCGGTCCCTA CCAGTTCAAG AACAACTCCG AACGGGAACA CCTCACGCTC GAGCGGTTCG ACGATCACTT CACGCTGCGC GAGGACGCCG CAGGCGACCA CCTCCTCGCT CCACGCGTCG AGGAGCTCCG GTTTATCGTC GATCCCGGCA GCCCGTCCTC CATCTCGCGG GTCGCCAGCG GCAACGCCGA CCTCACCTCG TCGATGCTCG CGGCGTACTC GCTCAGCGAC ATTCCAGAGG ACAACCCCGA CGTCGAACGG CTCGAGTCGC CCTCGTGGAC GTTTTACCAC CTCGGGTTCA ACACGCGGGC GCCGCCGTGT AGCAACCTCC ACTTCCGGCG CGCGATCTGT CGACTGATCG ACAAGGAGTG GATCGCGAGC GAAGTCTTCG GCGGCCACGC GGACCCGCTC GTGGCTCCGG TGACCGAGGA GTGGACGCCC GACGACCTCG CGTGGGACGG TGCGGACCCA GAAACGCCAT TTGCCGGCAC CGACGGGACG CTCAACGTCA ACGCGGCGCG TAACGCGTTT CAGGCGGCCG GCTACTACAC CGACGACGAG AACCGACTAC AGGGGCGATA CTGA
|
Protein sequence | MNCNPTDPVD GVDRRSVLAA GAAGLSLSLS GCVDTVQRVV DQDGTDQLSL SIVTVPADAD RESIRIAYHL ESHLEAIGVD VTLKVRSRTD FLKTVLIDHD FDIYVGQHPA DYDPDFLYEA LHSTYANEAG WQNPFGFDSM AFDTLLEDQR RADGEQRKQR LANVLRGVAD EKPFDPICRP DEIRVANTTR FDGWDQGHLA TRRGYLGLEP DAGVERLNAL VTDARPSVNV NPISATVRER GTVVDLLYDS LGTVVDGEVL PWLAESWEWV TDAETDEKKT EEIAEPTRNT TTARVSLRED CRFHDGEPVT AADVEFTYQF FQDTVLGHAT PSPPPRYRGH ASAIDDIEIE DEYTLRITAA AGTDVCERAF TAPILPKHVW QSELEDRLGN SQEFSAPQGS WSLVTSDSID PTGSGPYQFK NNSEREHLTL ERFDDHFTLR EDAAGDHLLA PRVEELRFIV DPGSPSSISR VASGNADLTS SMLAAYSLSD IPEDNPDVER LESPSWTFYH LGFNTRAPPC SNLHFRRAIC RLIDKEWIAS EVFGGHADPL VAPVTEEWTP DDLAWDGADP ETPFAGTDGT LNVNAARNAF QAAGYYTDDE NRLQGRY
|
| |