Gene Htur_4399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4399 
Symbol 
ID8745027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp667896 
End bp669263 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID646514936 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003405883 
Protein GI284167605 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0609802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACTA GTGGCAGGGC AACGAATCGG AGTGATAGGT CCGAGAATGG CGATTCCGCA 
GCGATCACGA AGCACAGTTC GCGACGTCGA TTTCTGGCCG GCGTCGGCGG TGCGGCGGCC
GTAACCGGGC TCGCAGGTTG CGTCGGTGGT GGCGGTAACG ATAGCGACGT CAGCATCATC
GCCGTCGAGG GCGAAGGCCG ACTCGTGGAG AATTTGATCG ACGACTACGT CAGGGACGAG
ACGGATCTCT CCATCGACGT CACGTTATTC CCGTACGCGA ACCTCTACGA ACGCGTCAGC
AGCGTCCTCA CGACCGGCGG AACGGGGTAC GACGCCATCC TCATGGACGA CACGTGGTTC
CCGCAGTTCG CGGCGAACCT CGATCCGCTC GAGCAGTGGC TTCCCGACGG GCTGCCCACG
GAGCAACTCA TCGACACGAC GGTCGACATC ACCACGTGGC CGACGCCCGG CGCCCCGAAA
GTTCCGTCCG CCGAGGATAT GGACGAGAAG ATCCGCGGGC AGGTCGTCGT GGGGAACACG
CAGATGTTCG TCTACAACAC CGCCTACTAC GAGGAGGTCG GTGAAGAGGA GCCGAAGACG
TGGGACGATG TGTTACGCGC CGGGCAGAGC ATCGACGAGG AGATCGCGGA CACGAACGGG
TACGTGATCC GCGGTCAGCG CGGCAACCCG GCGAACACGA ACTTCATGAG CATCGGGTGG
TCGAACCTCG GAGACATGTT CGACGAGGAC TGGCGGTACC AGTGGGACTC CAGCGAGGGC
GAGGACGTTG TCAGTTTCTT CGTCGACGAT CTGCGATCGA TCTCGCCGGA CGGCGTCGGA
TCGTTCAACA GCGATCAGGT GCTGAATCGG ATCGGCGAGG GCTCGGCCGC CCAGGGGATG
GCGTGGCCGG CCGCGGCGTC GACGCTGCTC GACGACGACA CCGCAGAAGC CGACAATCTG
GAGTTCATTC CGATCCCGGA AGGCGAGGTA CAGCAGGCGC CGATGCAGGG CAACTGGCTG
CTCGGTATCA ACTCGAACAT CTCCGACGAC CGGAAGGAAG ACGCCGGCAC GGTCATCCAG
TCCATCATCT CCAAGGAGGC ACAGGACCGC TACGTGGAAC TCGGCGGCGT CCCCTTCCGC
CACGACACCT TCGAGGACAA CATGGACGCC GAGCCGTGGT ACGAAGCGCT GTACGAGAGC
CTGCAAAACG CCAAACCGCG GCCGCGGACG CCCCTCTGGA ACGAGATCGA CGTGACCCAA
GGGGAGTACC TCAACAGCGC ACTGACCGGC GACATGAGCC CGGCCGAGGT CGTGAGCGAA
ACCAAGAACG AGGTCGAGTC GATCCTCGAA AACGCGGGAT ACTACTAG
 
Protein sequence
MPTSGRATNR SDRSENGDSA AITKHSSRRR FLAGVGGAAA VTGLAGCVGG GGNDSDVSII 
AVEGEGRLVE NLIDDYVRDE TDLSIDVTLF PYANLYERVS SVLTTGGTGY DAILMDDTWF
PQFAANLDPL EQWLPDGLPT EQLIDTTVDI TTWPTPGAPK VPSAEDMDEK IRGQVVVGNT
QMFVYNTAYY EEVGEEEPKT WDDVLRAGQS IDEEIADTNG YVIRGQRGNP ANTNFMSIGW
SNLGDMFDED WRYQWDSSEG EDVVSFFVDD LRSISPDGVG SFNSDQVLNR IGEGSAAQGM
AWPAAASTLL DDDTAEADNL EFIPIPEGEV QQAPMQGNWL LGINSNISDD RKEDAGTVIQ
SIISKEAQDR YVELGGVPFR HDTFEDNMDA EPWYEALYES LQNAKPRPRT PLWNEIDVTQ
GEYLNSALTG DMSPAEVVSE TKNEVESILE NAGYY