Gene Dtur_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0437 
Symbol 
ID7082458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp444103 
End bp445377 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content36% 
IMG OID643457526 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002352353 
Protein GI217966847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000010114 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT ACTTTAAGTT AACAATTTTA GTATTAGTTT TAGTTAGTTT TATCCTTGCT 
ATTTCTCCAG CTCAACAGCA GATTACATTA AGAATTATCT GGTGGGGTTC TCAGGATAGA
CATAACAGAA CTTTGAAAGT AATAGAGCTT TTCCAAAAGA AATACCCCAA CATTAAGATT
GTATCAGAAT ATACAGGATG GTCTGAGTAT TATACAAAAC TTACCACTAT GGCTGCAGGT
GGAAACCTTC CAGATATTAT GCAACAGGAC CACGCATATA TTAGGGGATG GGTAGAGAAA
GGCTTACTTT TACCTCTTGA TGATTTAGTA GCACAAGGTA TTATAAATCT TAAAGATGTA
GCAAAAAGCA TAGTCGATTC TGGAAGATTA AGTGGAAAAC TTTATGCCAT AAACTTAGGA
AATAACTCTC AAGCCTTTGC TATTGATCCG GAGGTATTTA GAAAGGCAGG AGTTCCTCTT
CCACCTACTT TATGGACATG GGATGATTTC AAGAGAATTG CAAGGATAAT TCATAGAAAA
CTTGGCATAT ATGGAGCAGC GGAGAACCTT GGCGATCATA ACATATTCAG AGTATGGACT
ATTGAAAACG GTGGATATCT TTTCAGCGAA GATGGTAAAT CCTTGGGATA CGAAGATGAT
AACGTATACG CAAGCTTCTA CAAGATGCTT CTTGAACTTC AAGACGAAGG TGTAATTCCT
TCAAGAGATG TAGAAGTTGC AAGGGGTAGT GTAAGTCCAG AGCAAAGATT TATATGTCTT
GGAAAGTCTG CAATGCAATT TACTTGGAGT AACCAGCTTA CAGCTATGAG CAAAGCTCTT
AAAGATAAAC CTTTGAAACT TTATATGATT CCAACACTAA ATGGAAAGGT TGGAAACTTC
TTAAAGCCAT CAATGTTTTT TGCAATCAAT GCTAAGACTA AATATCCCAA GGAAGCAGCA
ATGTTTATAA ACTTCTTTAT TAATGATATT GAGGCTGGAA AGATACTAAT GGCAGAGAGA
GGAGTGCCGG TATCCAAGAA AGTACAGCTT GCTCTAAAGC CAATTTTAAC TCCTGTGGAG
AAAGAAATAT TTAACTTCAT AGCTACAGTA GAAAAATATG GAGCTCCAAC TCCTGCTCCA
GACCCAGAAA GATGGCAAGA AATTTATAAC AATGTCTATA CTCCCCTTTA TGACCAAATA
ATGTATAAGA AGATTACTCC AGAAGAAGCT GCAAAGAGAT TTAGAGAACA AGTGACTCAA
ATACTTAGAA AGTAG
 
Protein sequence
MKKYFKLTIL VLVLVSFILA ISPAQQQITL RIIWWGSQDR HNRTLKVIEL FQKKYPNIKI 
VSEYTGWSEY YTKLTTMAAG GNLPDIMQQD HAYIRGWVEK GLLLPLDDLV AQGIINLKDV
AKSIVDSGRL SGKLYAINLG NNSQAFAIDP EVFRKAGVPL PPTLWTWDDF KRIARIIHRK
LGIYGAAENL GDHNIFRVWT IENGGYLFSE DGKSLGYEDD NVYASFYKML LELQDEGVIP
SRDVEVARGS VSPEQRFICL GKSAMQFTWS NQLTAMSKAL KDKPLKLYMI PTLNGKVGNF
LKPSMFFAIN AKTKYPKEAA MFINFFINDI EAGKILMAER GVPVSKKVQL ALKPILTPVE
KEIFNFIATV EKYGAPTPAP DPERWQEIYN NVYTPLYDQI MYKKITPEEA AKRFREQVTQ
ILRK