Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4667 |
Symbol | |
ID | 8745268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 249708 |
End bp | 251471 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646515176 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003406123 |
Protein GI | 284172741 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACA ATAACCGTGA TTACGTTAGT CATATCGATC GCCGTGCGTT CCTGCAGGGC GCAGGCGCCT CAGGCGCCGC TGCACTGGCG GGCTGTACGG GCGGAACCGG ATCCGGGGAC GGCACCCATG TCAGCTATAC GAATCAGGTT CCGACGAAGA TTCAGTATAA CCCGCTCAAT CCAACGAGTT ACTCTCAGTA CTCGCATTAC CTGCTTTTCG ACCGATTTGC GAATTTCAAC TTCGCAAAAG GCGAGTTCAC TCCCTACCTC ATCCAGGATT GGGAGTTCGG CGACAAGACG TTCGAAATGA CCGTTCGCGA CGGCGTCACC TGGGAAGACG GCGACGACGT CACGGCCGGT GATGTCGCGA CGCAGCTGCG CCTCGCGAGG TTGACCGGCG GAACGATCGA TCAGTTCACC GAAGATATCG AAGTAGCGGA CGACCAGACC GTCGTTCTCA ACCTCGCTGG AGACGTTAAC CCCCGAATTG TCGAATTCAA TTCGTTGGGA CAGCGGTTCA TGACCCTGAA AGAGAGCGAG TTCGGGGAGT ACGTCGACAT GTTCGAGGAT GATGCTGAGC AGGCCCAAAG CGAAATCCAG AGCCACGCCT ACAAGGATGT CATCGCCAAC GGCCCGTTCA CTGTTTCAGA GCGCGGGAAT CAGCAGATTC TTCTCGAGAA GCGGGACAAC CACCCCGACT CAGGCAACAT CAACTTTGAT CGGGTCGCGT TCCGCTACCT CGACGGGAAC ACAGCCGTTC ATCAGGCGAT CGGCGCGAAC GAACTTGACT CCGTGATGGT GTTCGCCCCG CCGAACGTCG TGAATAACTT CCCCGATCAC ATCCAGATGG AGACCATTCC GGGGAAGGTC GGATACGGAT TGATTCCGCA GCATGACCAC GAGCACACGG GCGACAGAGC CGTTCGTCAG GCGATCGCGC ACGTGCTAGA CCGAGAGGCC ATCGTCAAGA ACGTTGGCGA GACGCTGAAG CAGGCGCCAC CGCTGCTCAC TGGAATTCCC TCGGACGACC AGGAACGGTG GCTCGACGAC GCCTACGACT CGTTCGAGGA CTATGGCGTG AACGAACGAC AGACCGACGA TGCCAACCGG ATCCTCCGTG ACGCCGGCTA CTCGAAGGAC GGCGATACGT GGGTCGATAA TAGCGGAGAC GTAGTCGAAC TTCCCATCAC CGTCCCGTCT GGGTGGACCG ACTGGGTCAC CGCGACCCAG ACGATCGTCG ATCAACTGAA CGACTTCGGA TTCGAGTCTC AGGTCGATTC TCGGAATTTC AGCGCGCTGA ACGGAACGGT CTGGCCGAAT GGCGACTTCG TCCTCTCGGC CGGCGGGTGG CTTCCCGGTG GGGGTCGGGC GTCGTTCCCA TACTTCTCGC TTCACCACCA GCTCCTCGAG CACTACCGCA ACTTCTCGTA CAACTACGAC CCCGCCAACG AGAACCGCGG CGGGAGCAAC GGCGACGTCA CCGTCCCCTC TCGGACCGGG TCAGAAACGA TGACGGTGAA CCCTAGCGAT CGCCTCAAGG AACTGTCCGA AACCTCCGAC GAAGCGACAA CCCGCGAGAT CTCGATCGAA CAGGCGTGGG TGACCAATGT GGACCTTCCC ATGATCCCCG TCATGGAAAA GCAAGAACAG GCGTTCCTTG CCACCGACGA GTGGTCCGTT CCCGAACAGG GTGCCGAGGT TTCGCAGGTT CGGTGGCCGA ACCTCTGGCT CATTCGTCAG GGAGAACTGC AATACGACGG ATAG
|
Protein sequence | MTDNNRDYVS HIDRRAFLQG AGASGAAALA GCTGGTGSGD GTHVSYTNQV PTKIQYNPLN PTSYSQYSHY LLFDRFANFN FAKGEFTPYL IQDWEFGDKT FEMTVRDGVT WEDGDDVTAG DVATQLRLAR LTGGTIDQFT EDIEVADDQT VVLNLAGDVN PRIVEFNSLG QRFMTLKESE FGEYVDMFED DAEQAQSEIQ SHAYKDVIAN GPFTVSERGN QQILLEKRDN HPDSGNINFD RVAFRYLDGN TAVHQAIGAN ELDSVMVFAP PNVVNNFPDH IQMETIPGKV GYGLIPQHDH EHTGDRAVRQ AIAHVLDREA IVKNVGETLK QAPPLLTGIP SDDQERWLDD AYDSFEDYGV NERQTDDANR ILRDAGYSKD GDTWVDNSGD VVELPITVPS GWTDWVTATQ TIVDQLNDFG FESQVDSRNF SALNGTVWPN GDFVLSAGGW LPGGGRASFP YFSLHHQLLE HYRNFSYNYD PANENRGGSN GDVTVPSRTG SETMTVNPSD RLKELSETSD EATTREISIE QAWVTNVDLP MIPVMEKQEQ AFLATDEWSV PEQGAEVSQV RWPNLWLIRQ GELQYDG
|
| |