Gene Htur_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1813 
Symbol 
ID8742407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1887698 
End bp1888933 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID646512391 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003403371 
Protein GI284165092 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAGG GTGCAGGGAT AGCAGGGACG GCGAGTCTCG CGGCTCTCGC GGGCTGTACC 
GGCGGTGGCG GCAGTACGCT CGAGGTCCTC CACGGATGGA CCGGCGGCGA CGGCGCCGAG
GCCGCCGACG CGCTCTTCTC GGCGTTCGAA GAGGAGCACT CGGACGTCGA CTACAACGAG
AAGCCGATCG GTGGCGGTGG GAACACGACG CTCGACCAGA CGGTCGCCAA CCGCCTCCAG
GGCGGCGACC CGCCGAGTTC GTTCGCCGGC TGGCCGGGTG CGAACTTAGA GCAGTACGAG
GACGCCGTCG GCGACATCGA GTCGGAGGTC TGGGACGAGG CTGGCCTGAA GGACGCCCAC
GTCCAGGAAG CGGTCGAACT CTGCCGGCAC AACGACGGCT TCTCGGCGGT CCCGCTCGGC
TCCCACCGCC TGAACGACCT CTTTTACAAC GTCGAGGTCG TCGAGAGCGC CGGCGTCGAT
CCGAGCTCGA TCGACAGCGC CGACGCGCTG ATCGACGCGC TGGACGCCGT GGAGTCGGAG
ACCGACGCGA CGCCGTTCGC GTTCTCGCTC GCGCCGTGGT GTATCCTCCA GACGTGGGCG
CAGACGATGC TCGGCGAACA CGGCTACGAG GCCTACATGA ACTTCATCGA GGGCAACGGC
GACGAGAGCG CCGTCCGCGA CACCTTCGAG AAGCTCGAGC AACTCCTCGG CTACATTAAC
AACGACGCAG CCTCCGTCGA CTTCACCGAG GTCAATCAGG ACATCATGAG CGGCGACGCC
GCGTTCATCC ACCAGGGCAA CTGGGCCGCC GGCGCGTACA TCTCGGGCGA CCAGGATATC
GAGTACGGCA CCGACTGGGA CGCGATCCGA TACCCCGGGA CGGAGGACTA CTACACCCTC
CACATCGACT CGTTCATCTA CCCGAGCGAC AATCCGACGC CCGACGACAC CGCGACTTGG
CTGCAGTTCG TCGGCTCCGA GACGGCACAG GTCGCGTTCA ACCAGTACAA GGGGTCGATC
CCGACGCGGA CGGAGGTGTC CACCGACGAG TTCAACGCCT ATCTCACGGA CACGATCGAG
GACTTCGATA ACGCCTCGGA GAAGCCGCCA ACGCTCGCAC ACGGACTCGC CGTGGATCCG
AGCACACAGG CCGACCTCGA GGACGTCCTC AACAACAGCT TCGCGGACCC CTACGACGTC
GACGGTGCGA CGAGCGGGTT CATGGACGCC GTCTAA
 
Protein sequence
MLKGAGIAGT ASLAALAGCT GGGGSTLEVL HGWTGGDGAE AADALFSAFE EEHSDVDYNE 
KPIGGGGNTT LDQTVANRLQ GGDPPSSFAG WPGANLEQYE DAVGDIESEV WDEAGLKDAH
VQEAVELCRH NDGFSAVPLG SHRLNDLFYN VEVVESAGVD PSSIDSADAL IDALDAVESE
TDATPFAFSL APWCILQTWA QTMLGEHGYE AYMNFIEGNG DESAVRDTFE KLEQLLGYIN
NDAASVDFTE VNQDIMSGDA AFIHQGNWAA GAYISGDQDI EYGTDWDAIR YPGTEDYYTL
HIDSFIYPSD NPTPDDTATW LQFVGSETAQ VAFNQYKGSI PTRTEVSTDE FNAYLTDTIE
DFDNASEKPP TLAHGLAVDP STQADLEDVL NNSFADPYDV DGATSGFMDA V