Gene Htur_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3888 
Symbol 
ID8744516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp121275 
End bp123044 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content63% 
IMG OID646514472 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003405419 
Protein GI284167141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACACA ATCACACCCG CGTAAATAGA CGGCAGTTTA TCGGCGCGAG CGCCGGCACG 
GTCGCCGCCA CGTTCGCGGG CTGTCTCGGC AGCGACAGCG ACTCGACGGA GTTCGTCACG
GCGTTTGCGG GCGGCCGTCA GCCGACACAG GTTCACTTCA ACCCGTGGAA CGCGTCGGAC
TACGCACAGA CATACAGCAT CTACTGGCTT CAGGGAACGG TCGTGACACA CGCCGACGGG
ACCGTCTCGA CCGATTTCTT CGAGGACCTC AGCGTCGACG GTCGGGAGGT CACGCTCGAG
TTCTCGGACG AGTGGAGCTA CTGGAACGGC AACGACATCA CCGCCGAGGA CTACCTCATC
GAACAGGAGA TCTGGCGCTA CCAGGATCCG GAGGCCTCGC CGATCGAAGG CCACGAACTG
GTCGACGAGT ACACCGTCAA GCGGATCTAC AAGGACGATA TCTCGCCGAC GATCGCGAAA
TCGAACGCGG GCGCCGGCAC CTCCGCGCCG AAGTCGGTCT TCCGCGAGTA CTACGAGCGT
TACGAGGACG CCACCACGGA GAGCGAGCGG GAGGCCGTGA CCGACGACCT CCTCCAGCTG
ACGATCGACA CCGAGGAGTT CGTCGAGGAG GGGTACGGGA GCTCGCTGTT CAAGATCGAG
GACTTCAACT CCTCCGAGAC GCTGGCGACG AAGTGGGACG ATCACCCGTG GGCGGACCGA
ACGGACATCG AACAGATCCG CGTAAAGCCA ACCGAGGGGA CGCAGGTCGA GCAACTCGAG
AAGAGCGACG AGCTCGACAT GACCCAGTAC ATCACCGAGG ACCAGCGGTC GGACTACCCC
GACAACATCG AGAACATCTA CGAACTTGAC CACTACAGCT GCCGGAAGTA CATCCTGAAC
TGGAACAACG AGCACCTCGC GCGGCGGCCG GTCCGGCGGG CGATCATCTC CGCGATCGAC
ATCCCCTCGA TCGTCGACGC CGCGAACCAG ACCGGGTTGA TGGCGACGCC AACGCAGGTC
CAGACGGGGA TTCGTGCATC CATCGAAGAG AAGTACCTCG GCGAGGACTT CGTCGACAGT
CTCATCGACT ACCCCGTCGA GGCCGACGAA GAGACGGCCG TCGAGTACAT GGAGGAGGCC
GGCTACTCAC GTGAGGGCGG CGAGTGGATC GGTCCCGACG GCAACGCGAC TGACTTCACC
GCTATCACGC AGGCGGGGGT CAGGAAGTCC CAGCCGATGA AGGTCTTCAC CGACCACCTC
AACGAGTTCG GCTTCAACGT GGAGATGGAG GCCGTCGGCC AGGACTACTA CTCGCGGGTT
CAGGAGTGGG AGTTCGATAT CGCCTGGATG TGGCACGTCG CACTGCCGTA CTGGCATCCC
GTGGCGTACT TCTCGAACGA CTTCTACGGT CTCCTCGCCG GCGACGTCAC CAGCGACAGC
GACACGGGTC CGACCGGCGT GCCGTTCTCG CTCGAGATCC CAGAGGAAGT CGGCGCGACG
GAAGTCGGGG GCAACGGCGT CGAGATCAAC CCGGCCCAGC TCATGGTCGA CCTCGAGGGC
GCATCGTCCG AGGAGGAAAC GATCGAGCTC ACGCGAACGC TCGCTCAGTG GGTCAACTAC
GATCTGCCCG TGATCGTCCA CTTACAGGAG AACCGCGGCT TCGCCGGCGA CGTCGAGAAC
TTCGACTTCC CGAGCGAGGA CGACTTTCGC ATGGATCGCT CCAATCCGGG ACCGAACGCG
CTGCTGAACG GCCACATTAC GACTAACTAA
 
Protein sequence
MRHNHTRVNR RQFIGASAGT VAATFAGCLG SDSDSTEFVT AFAGGRQPTQ VHFNPWNASD 
YAQTYSIYWL QGTVVTHADG TVSTDFFEDL SVDGREVTLE FSDEWSYWNG NDITAEDYLI
EQEIWRYQDP EASPIEGHEL VDEYTVKRIY KDDISPTIAK SNAGAGTSAP KSVFREYYER
YEDATTESER EAVTDDLLQL TIDTEEFVEE GYGSSLFKIE DFNSSETLAT KWDDHPWADR
TDIEQIRVKP TEGTQVEQLE KSDELDMTQY ITEDQRSDYP DNIENIYELD HYSCRKYILN
WNNEHLARRP VRRAIISAID IPSIVDAANQ TGLMATPTQV QTGIRASIEE KYLGEDFVDS
LIDYPVEADE ETAVEYMEEA GYSREGGEWI GPDGNATDFT AITQAGVRKS QPMKVFTDHL
NEFGFNVEME AVGQDYYSRV QEWEFDIAWM WHVALPYWHP VAYFSNDFYG LLAGDVTSDS
DTGPTGVPFS LEIPEEVGAT EVGGNGVEIN PAQLMVDLEG ASSEEETIEL TRTLAQWVNY
DLPVIVHLQE NRGFAGDVEN FDFPSEDDFR MDRSNPGPNA LLNGHITTN