Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3888 |
Symbol | |
ID | 8744516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 121275 |
End bp | 123044 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514472 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003405419 |
Protein GI | 284167141 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.170596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACACA ATCACACCCG CGTAAATAGA CGGCAGTTTA TCGGCGCGAG CGCCGGCACG GTCGCCGCCA CGTTCGCGGG CTGTCTCGGC AGCGACAGCG ACTCGACGGA GTTCGTCACG GCGTTTGCGG GCGGCCGTCA GCCGACACAG GTTCACTTCA ACCCGTGGAA CGCGTCGGAC TACGCACAGA CATACAGCAT CTACTGGCTT CAGGGAACGG TCGTGACACA CGCCGACGGG ACCGTCTCGA CCGATTTCTT CGAGGACCTC AGCGTCGACG GTCGGGAGGT CACGCTCGAG TTCTCGGACG AGTGGAGCTA CTGGAACGGC AACGACATCA CCGCCGAGGA CTACCTCATC GAACAGGAGA TCTGGCGCTA CCAGGATCCG GAGGCCTCGC CGATCGAAGG CCACGAACTG GTCGACGAGT ACACCGTCAA GCGGATCTAC AAGGACGATA TCTCGCCGAC GATCGCGAAA TCGAACGCGG GCGCCGGCAC CTCCGCGCCG AAGTCGGTCT TCCGCGAGTA CTACGAGCGT TACGAGGACG CCACCACGGA GAGCGAGCGG GAGGCCGTGA CCGACGACCT CCTCCAGCTG ACGATCGACA CCGAGGAGTT CGTCGAGGAG GGGTACGGGA GCTCGCTGTT CAAGATCGAG GACTTCAACT CCTCCGAGAC GCTGGCGACG AAGTGGGACG ATCACCCGTG GGCGGACCGA ACGGACATCG AACAGATCCG CGTAAAGCCA ACCGAGGGGA CGCAGGTCGA GCAACTCGAG AAGAGCGACG AGCTCGACAT GACCCAGTAC ATCACCGAGG ACCAGCGGTC GGACTACCCC GACAACATCG AGAACATCTA CGAACTTGAC CACTACAGCT GCCGGAAGTA CATCCTGAAC TGGAACAACG AGCACCTCGC GCGGCGGCCG GTCCGGCGGG CGATCATCTC CGCGATCGAC ATCCCCTCGA TCGTCGACGC CGCGAACCAG ACCGGGTTGA TGGCGACGCC AACGCAGGTC CAGACGGGGA TTCGTGCATC CATCGAAGAG AAGTACCTCG GCGAGGACTT CGTCGACAGT CTCATCGACT ACCCCGTCGA GGCCGACGAA GAGACGGCCG TCGAGTACAT GGAGGAGGCC GGCTACTCAC GTGAGGGCGG CGAGTGGATC GGTCCCGACG GCAACGCGAC TGACTTCACC GCTATCACGC AGGCGGGGGT CAGGAAGTCC CAGCCGATGA AGGTCTTCAC CGACCACCTC AACGAGTTCG GCTTCAACGT GGAGATGGAG GCCGTCGGCC AGGACTACTA CTCGCGGGTT CAGGAGTGGG AGTTCGATAT CGCCTGGATG TGGCACGTCG CACTGCCGTA CTGGCATCCC GTGGCGTACT TCTCGAACGA CTTCTACGGT CTCCTCGCCG GCGACGTCAC CAGCGACAGC GACACGGGTC CGACCGGCGT GCCGTTCTCG CTCGAGATCC CAGAGGAAGT CGGCGCGACG GAAGTCGGGG GCAACGGCGT CGAGATCAAC CCGGCCCAGC TCATGGTCGA CCTCGAGGGC GCATCGTCCG AGGAGGAAAC GATCGAGCTC ACGCGAACGC TCGCTCAGTG GGTCAACTAC GATCTGCCCG TGATCGTCCA CTTACAGGAG AACCGCGGCT TCGCCGGCGA CGTCGAGAAC TTCGACTTCC CGAGCGAGGA CGACTTTCGC ATGGATCGCT CCAATCCGGG ACCGAACGCG CTGCTGAACG GCCACATTAC GACTAACTAA
|
Protein sequence | MRHNHTRVNR RQFIGASAGT VAATFAGCLG SDSDSTEFVT AFAGGRQPTQ VHFNPWNASD YAQTYSIYWL QGTVVTHADG TVSTDFFEDL SVDGREVTLE FSDEWSYWNG NDITAEDYLI EQEIWRYQDP EASPIEGHEL VDEYTVKRIY KDDISPTIAK SNAGAGTSAP KSVFREYYER YEDATTESER EAVTDDLLQL TIDTEEFVEE GYGSSLFKIE DFNSSETLAT KWDDHPWADR TDIEQIRVKP TEGTQVEQLE KSDELDMTQY ITEDQRSDYP DNIENIYELD HYSCRKYILN WNNEHLARRP VRRAIISAID IPSIVDAANQ TGLMATPTQV QTGIRASIEE KYLGEDFVDS LIDYPVEADE ETAVEYMEEA GYSREGGEWI GPDGNATDFT AITQAGVRKS QPMKVFTDHL NEFGFNVEME AVGQDYYSRV QEWEFDIAWM WHVALPYWHP VAYFSNDFYG LLAGDVTSDS DTGPTGVPFS LEIPEEVGAT EVGGNGVEIN PAQLMVDLEG ASSEEETIEL TRTLAQWVNY DLPVIVHLQE NRGFAGDVEN FDFPSEDDFR MDRSNPGPNA LLNGHITTN
|
| |