Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_0437 |
Symbol | |
ID | 7082458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | + |
Start bp | 444103 |
End bp | 445377 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643457526 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002352353 |
Protein GI | 217966847 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000010114 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT ACTTTAAGTT AACAATTTTA GTATTAGTTT TAGTTAGTTT TATCCTTGCT ATTTCTCCAG CTCAACAGCA GATTACATTA AGAATTATCT GGTGGGGTTC TCAGGATAGA CATAACAGAA CTTTGAAAGT AATAGAGCTT TTCCAAAAGA AATACCCCAA CATTAAGATT GTATCAGAAT ATACAGGATG GTCTGAGTAT TATACAAAAC TTACCACTAT GGCTGCAGGT GGAAACCTTC CAGATATTAT GCAACAGGAC CACGCATATA TTAGGGGATG GGTAGAGAAA GGCTTACTTT TACCTCTTGA TGATTTAGTA GCACAAGGTA TTATAAATCT TAAAGATGTA GCAAAAAGCA TAGTCGATTC TGGAAGATTA AGTGGAAAAC TTTATGCCAT AAACTTAGGA AATAACTCTC AAGCCTTTGC TATTGATCCG GAGGTATTTA GAAAGGCAGG AGTTCCTCTT CCACCTACTT TATGGACATG GGATGATTTC AAGAGAATTG CAAGGATAAT TCATAGAAAA CTTGGCATAT ATGGAGCAGC GGAGAACCTT GGCGATCATA ACATATTCAG AGTATGGACT ATTGAAAACG GTGGATATCT TTTCAGCGAA GATGGTAAAT CCTTGGGATA CGAAGATGAT AACGTATACG CAAGCTTCTA CAAGATGCTT CTTGAACTTC AAGACGAAGG TGTAATTCCT TCAAGAGATG TAGAAGTTGC AAGGGGTAGT GTAAGTCCAG AGCAAAGATT TATATGTCTT GGAAAGTCTG CAATGCAATT TACTTGGAGT AACCAGCTTA CAGCTATGAG CAAAGCTCTT AAAGATAAAC CTTTGAAACT TTATATGATT CCAACACTAA ATGGAAAGGT TGGAAACTTC TTAAAGCCAT CAATGTTTTT TGCAATCAAT GCTAAGACTA AATATCCCAA GGAAGCAGCA ATGTTTATAA ACTTCTTTAT TAATGATATT GAGGCTGGAA AGATACTAAT GGCAGAGAGA GGAGTGCCGG TATCCAAGAA AGTACAGCTT GCTCTAAAGC CAATTTTAAC TCCTGTGGAG AAAGAAATAT TTAACTTCAT AGCTACAGTA GAAAAATATG GAGCTCCAAC TCCTGCTCCA GACCCAGAAA GATGGCAAGA AATTTATAAC AATGTCTATA CTCCCCTTTA TGACCAAATA ATGTATAAGA AGATTACTCC AGAAGAAGCT GCAAAGAGAT TTAGAGAACA AGTGACTCAA ATACTTAGAA AGTAG
|
Protein sequence | MKKYFKLTIL VLVLVSFILA ISPAQQQITL RIIWWGSQDR HNRTLKVIEL FQKKYPNIKI VSEYTGWSEY YTKLTTMAAG GNLPDIMQQD HAYIRGWVEK GLLLPLDDLV AQGIINLKDV AKSIVDSGRL SGKLYAINLG NNSQAFAIDP EVFRKAGVPL PPTLWTWDDF KRIARIIHRK LGIYGAAENL GDHNIFRVWT IENGGYLFSE DGKSLGYEDD NVYASFYKML LELQDEGVIP SRDVEVARGS VSPEQRFICL GKSAMQFTWS NQLTAMSKAL KDKPLKLYMI PTLNGKVGNF LKPSMFFAIN AKTKYPKEAA MFINFFINDI EAGKILMAER GVPVSKKVQL ALKPILTPVE KEIFNFIATV EKYGAPTPAP DPERWQEIYN NVYTPLYDQI MYKKITPEEA AKRFREQVTQ ILRK
|
| |