Gene Htur_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1855 
Symbol 
ID8742449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1929602 
End bp1930927 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID646512433 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003403413 
Protein GI284165134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAG ACTCGGGTGT CGAGGGGATC ATACACAGCG ACGGGGACGG GCCCTCGGTC 
CAGCAGGCGC TCTGGGACGC CGGTCTCGAC GACGACATCA GCCTCGAGAT TCAGACCGTC
GTCAGCGACT CCGCGTCGCG GATGCAGACG GCCCAGTCGG CCCTCGAGGC GGGTCGCGCC
CCGCCGGACA TCCACATGAT GGACAGCGGC TGGACGGTCC CGTTCGTTCT GCGCAATCAG
ACGGTCAACC TGACCGAAGA GCTCCCCGAG GAGACGGTCT CGTTCGTCAA CGAGAACTAC
CTCGACGCGA TCCTCGAGAC GGCTCGCCAC CCCGAGTCCG GCGACCTCCA CGGGCTGCCG
CTGTTCCCGG ACCTCGGGTT CACTCTCTAC AGACAGGACC TGATCGAGGA CGCCGGCTAC
GACACCAGTA GTTGGGGGAC GGACCCGCCG CAGTGGGAAG AGTTCGCGAA CGCGGTCAGC
GACGCGAGAG ACCAGGCCGA CCTCAACTAC GGGTACACGA CGCAGGCGGC CGCCTACGAG
GGGCTGTCCT GCTGTACGTT CAACGAGGTG ATGACGAGCT GGGGCGGAGC GTACTACGGC
GGCGTGGACA ACCTCTTCAC CGCGGGCGAC CGCCCGGTCA CCGTCAACGA ACAGCCGGTC
ATCGACGCGA TTCGAATGAT GCGCTCGTTC ATCGAGGGCG AGAATCAGAA CACTCTCGAC
GGCTACGCCC AGATCAGTCC GTCGCCGATC GTCCAGTGGA CCGAACAGGA GTCGCTCAGC
CCGTTCGACG CCGGCAACGC CGTCTCGAAC CGGAACTGGT CGTTCGCGAT CGCCCAGACC
GGCGCGGAGG AGGCCTTCGG TGAGGACCTC GGCGTCACGA CGAGTCCGGT CGGGGTCCCC
GAGGAGGAAG CCGAGTTCGA AGGCACCGGC GGCACCGCCG CGGCGCTCGG CGGCTGGAAC
CTGGTCGTGA GCCCGTTCTC GGATCGCAAG GAGGAAGCGC TGCAGGTCCT CGAGGCGTTC
GCCAACGAAG AGGTGATGCT CACGATCTTC GAACTCGGGG GATACCTCCC GCCGAATCTC
GACCTGGTCG CGGAGGCCAG CCCGGACGAC GTCGGCCCGG TCGCCCGCTA CGGCGACGTC
GTGCAGGCGG CCAGCGACAA CGCGATTCCG CGGCCGGCGA CCGACCTCTG GCCCGAGCAG
TCGGCGCTGA TCTATCAGTC GGTCAATTCG GCCTACCGCG GCGCAAATGC GCCAGAGGCG
GCGATGAACG ATCTCGCGGA AGAACTCCAG CAGAGCGAAT CGGAGGTGCA AACGAATGGC
AACTGA
 
Protein sequence
MTADSGVEGI IHSDGDGPSV QQALWDAGLD DDISLEIQTV VSDSASRMQT AQSALEAGRA 
PPDIHMMDSG WTVPFVLRNQ TVNLTEELPE ETVSFVNENY LDAILETARH PESGDLHGLP
LFPDLGFTLY RQDLIEDAGY DTSSWGTDPP QWEEFANAVS DARDQADLNY GYTTQAAAYE
GLSCCTFNEV MTSWGGAYYG GVDNLFTAGD RPVTVNEQPV IDAIRMMRSF IEGENQNTLD
GYAQISPSPI VQWTEQESLS PFDAGNAVSN RNWSFAIAQT GAEEAFGEDL GVTTSPVGVP
EEEAEFEGTG GTAAALGGWN LVVSPFSDRK EEALQVLEAF ANEEVMLTIF ELGGYLPPNL
DLVAEASPDD VGPVARYGDV VQAASDNAIP RPATDLWPEQ SALIYQSVNS AYRGANAPEA
AMNDLAEELQ QSESEVQTNG N