Gene Htur_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2649 
Symbol 
ID8743262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2718809 
End bp2720011 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content65% 
IMG OID646513237 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003404198 
Protein GI284165919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATGA AACGCAGACC AGTACTCAAG GGGATCGGCG GTACCGTCGC GGGACTCTCG 
CTCGCGGGTT GCATGGGCTA CTTCACCGGG GACGACACGT CGCCGCTGTG GCACGAGTTC
ACCGATTCTG AGGAGCGCAC CTTCGAGAGT CACCTCGAGA CGTTCACCGA GGAGACCGAC
CACGACCTCG AGGCCTCCGG CGTCTCGAAC ATGCAAGATC AGTTAGAGAC CGCCCTCCCG
GCGGGCGACG GGCCGATGAG TTTCACCTGG GCACACGACT GGATCGGCGC CCAGCATGAA
GACGAGACCC TCTATGACGC ATCTGACTCG ATCGACGTCG ACCTCGAGGG AACGTACTCG
GAGGCGGCGG CCAACGCGGT TCAGTGGAAG GACAACGTGT ACGGACTCCC CTACGCCGCG
GAAACGGTGA CGCTGATGTA CAACAAGGAT ATGGTCGAGG AACCGCCGGA GACGATCCCC
GAGATGATCG AGATCATGGA GTCGTACGAC GGCGACGACC AGTACGGCAT CGGCTATCCG
GGGGACGCGT ACCACTTCAG CGCCTACCTG CAAGGGTTCG GCGGCGTGCT CTACGACGAG
GACGCCGACG AACTGGGAAT CGACGACGAT GCGGTCGTCG AGGGGCTCGA ACTCGTCCGG
GACAGCATCT ACGAGTACAG TCCGAACGAC CTGAACAAGG ACCCGAACCT CTCGGTCTTC
CAGAACGGAA ACGCGCCGTT CGTCGTGACC GGCCCGTGGA ACCTCGGCGG GCTCCGCGAT
GCGGGCATCG ACGTCGGGGT CGCGCCGCTG CCTGCGCCCG AGGGCGGAGA ACCGACGCCG
TTTACGGGCG TTCAGATGTG GTACTTCACG TCTCGCCTCG AGGACGCGGA GGACGACGTC
CACGACGCGG TGCTCGACTG GGCGGAGTGG TACACCACGA CCGAGGACGT CGCCACGACC
AACGCACAGG ACCACGCGAT GATTCCCGTC CTCGACTCGG TCGTCGGGAG CGACGACCTC
GGTTCGGACG TCGACGCGTT CAGCCAGAGC GTGGGCATGG GGATGTCGAT GCCCGCGAGC
GAGAAGATGG ACGCCGTCTG GGACCCGCTC GAGTCCGCGA TCGACGTCGT GCTCGGCTCG
GGCGGCGACG CCCGAGAGGA ACTGGAGTCG GCCGCCGAGC AGATCCGAGG CTCCTGGGAG
TAA
 
Protein sequence
MPMKRRPVLK GIGGTVAGLS LAGCMGYFTG DDTSPLWHEF TDSEERTFES HLETFTEETD 
HDLEASGVSN MQDQLETALP AGDGPMSFTW AHDWIGAQHE DETLYDASDS IDVDLEGTYS
EAAANAVQWK DNVYGLPYAA ETVTLMYNKD MVEEPPETIP EMIEIMESYD GDDQYGIGYP
GDAYHFSAYL QGFGGVLYDE DADELGIDDD AVVEGLELVR DSIYEYSPND LNKDPNLSVF
QNGNAPFVVT GPWNLGGLRD AGIDVGVAPL PAPEGGEPTP FTGVQMWYFT SRLEDAEDDV
HDAVLDWAEW YTTTEDVATT NAQDHAMIPV LDSVVGSDDL GSDVDAFSQS VGMGMSMPAS
EKMDAVWDPL ESAIDVVLGS GGDAREELES AAEQIRGSWE