Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_1594 |
Symbol | |
ID | 7082029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | - |
Start bp | 1608798 |
End bp | 1611683 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643458702 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002353481 |
Protein GI | 217967975 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGAGGT TTAAAGAAGC TATGAAACCG TTTTTAAGAT TTGTGCTTAT ATTGCTCTTT TTATTGGGAT TAATTACAAA GATAATTTTT GCTCAAGGCT CCTCAGCTAA AAATCTAAAT GTTATCTTTG AACTTATAAG TAAAGAAAGT TCGTATGAAA CTTATTTATC AAAATTTTCT AATAAAAATA GACCTGAAGT AGTCTTGGTC ATACCTGCAG TAAATTATTC TGCTTACTCT AAAGATATGG AGTTAAAGAA GCTCACAAAT TTAACCCAAG ATAAACATCC TGTTCTTTAT ATGGGTGAGA ATGGTTTTGT AGAGTGGACT TTTAATATTG AGGAAGAGGG TTTGTATAAC ATTGCTGTGA AATATTATCC TGTACCTGGA AAAAATTCTG CTATAGAGAG AGAAATACTA ATTGATGGTA AGAGACCCTT TAACGAAGCA AGAATTGTTA GATTTGAAAG GATATGGAAA GATGCTGGAG ATCCCTTGAG AGATAATAGG GGTAATGAAT TGCGGCCATT GCAAGTGGAG TTTCCTATGT GGGTGGAGAA GGTTATTGAT GATTCTGATG GTTTATATTC TGAGCCATTT CTTTTCTATT TTTCAAAGGG AAGACATACT CTGAGATTTG TTTCAGTAAA AGAACCTGTA GCAATAGATT ATATCAAAAT TTTTAATCTT AAAGATATTC CCTATTATAG AGAGGTAATG AAAGAAGAGC ATATCTCTAA AAGCAACTTT AAGAATATTA TTGTAAAAAT TCAAGGAGAA AAAGCTCACT TAAAATCAGA GCCAACTCTT TATCCTGTGT ACGATATGAG CAATCCTCTT AATGAGCCTT ATAGTTCTAA AAACAAACTC TTAAATATAA TAGGAGGGTA TAACTGGAGA TCTGCCGGAC AATGGATTGA GTGGAAGTTT ACAGTACCAG AGAGTGGCTT TTATAAGATT GGGTTTAAAT TTAGACAAAA TGCAAATCCC GGTATACCAT CAGAGAGAAC TCTTTATATA GATGGGGTTG TTCCTTTTAG AGAGGTAAGA AATATAAAAT TCAAATATGA TACTAAATGG CAATTCAAGT ATTTGGGAGA TGGAAAAGAA GATTATCTTT TTTATTTAAA AAAGGGAGAA CACACCCTTA GGCTAAAAGT GAGTTATGAA AGTATAGCAG AAATTATGAG AAATGTATTA CAATGCTCCA TAGATTTATC TCAACTTTAT ACCAAAATTG TAATGATTAC CTCTCCAAAT CCTGACCCAT ACAGAGATTA TCTCTTAGAA CAGAGTATTC CAGACCTTAT TCCTACTTTG GAAAGGAATG CAAAGATATT AAAAGAAAAT GCTGAAAAAT TAAAAATTCT TGGAGGGGAA AAAGTTAGTG AAGCAGCAAC CCTTGAAAGA GTGGCTATTC AGCTTGAAGG AATGGCTAAG GAACCGGAGA CTATTGCTCA AAGATTACAA AGATATAGAG ATAATCTATC GGCACTTTCT GCGTGGGTAC TTGCTATTAG AGAGCAACCT TTAGATATTG ATTACATAAT TATTGCTTCT CCAGATATGA AATCTCCAAG AGTAAATCCA AACATCCTTG AGGCATTTTT AGATGGAATA AAGAAGTTTT TCTATTCCTT TTTGGAAGAT TATAACATGA TTGGAACTGT GTATGACAAA GAAAAAGCAA TAAATGTATG GGTTCAAATG GGAAGAGATC AAGCAGAGAC CCTAAAAATG CTTATAGATA CAGATTTTAC TCCTAAGACA GGAATAGGAG TTAACCTAAA CATTATAACC ACAGAAGCAG CTTTACTTTT CTCGGTTGCT TCTAAGGAGA ATCCTCCTGA TGTGGCTTTA AATGTCCCAA GAGGGCTTGC TGTAGATTAT GGGATAAGAG GAGCTCTTGT GGACATTTCA AAATTATCTG GTTTTGAGGA GGTAAAGAAG CAGTTTGCTC CTTACGCTCT TGTTCCTTAT AGCTTTGGTG GAAAAGTTTA CGGACTTCCT ATGACTCAAG ATTTTCCCAT AATGTTTTAT AGAGCTGATA TATTGGGACG TTTAAATATT GAAATTCCCA ATACTTGGGA AGAACTTTAT GAAACCATTG CTAAACTGCA GAGTTATAAC TTACAGTTTG CAGCGGGAAC AGGTGGTACA AGTTTTGATA TTTTTAACAT GCTTCTTCTT CAGAGAGGTG GAAGATATTA CACTGAAGAT GGCAAAAGAT GTGTTCTTAA TAATGAAGAG GGTGTGACAG CATTTAAAGA ATGGACCAAT TTGTATGTAC TTTATGGTAT TCCTCTTTAT TATGATTTCT TCAACCGTTT TAGAACAGGA GAAATGCCAT TGGGAATTGG TCCATATACT ATGTATAACC AATTTAAAGT TGCTGCTCCT GAAATAAGTG GTCTTTGGGG AATTGCTCCT GTTCCCGGAA GAAGAAAAAA TGATGGTTCA ATTGACAGAA GTGTAGCTGG AGGTGGAAAT GCCATATTGA TTTTTGCTCA GACTAAGAAG CTTAAAGAGG CATGGGAATT TGTTAAGTGG TGGGTCTCTA CTGATGTTCA GGCAAGATTT GGTAGAGAAC TTGAAGCAGT TCTTGGAGCG GGAGCCAGGT ACAATACTGC CAATATAGAG GCCATGAGTT ATTTGCCATG GCCTTCTTCA GATTACAAAA TCCTCTCTAC TCAATGGAAA TATTTAAAAG AAATTCCTAA TGTTCCTGGA AGTTATTATG TTTCAAGACA TTTAGACAAT GCCTTTAGAG AGGTGGTTAT GCTTGGTGAA ATTCCAAGAG AGGCAATAGA AAAATACACA AGAGAGATTA ACAAAGAGAT TGATAGGAAG AGAGAAGAAT TTGGTTTAGA GCTTGCAAAA GATTGA
|
Protein sequence | MLRFKEAMKP FLRFVLILLF LLGLITKIIF AQGSSAKNLN VIFELISKES SYETYLSKFS NKNRPEVVLV IPAVNYSAYS KDMELKKLTN LTQDKHPVLY MGENGFVEWT FNIEEEGLYN IAVKYYPVPG KNSAIEREIL IDGKRPFNEA RIVRFERIWK DAGDPLRDNR GNELRPLQVE FPMWVEKVID DSDGLYSEPF LFYFSKGRHT LRFVSVKEPV AIDYIKIFNL KDIPYYREVM KEEHISKSNF KNIIVKIQGE KAHLKSEPTL YPVYDMSNPL NEPYSSKNKL LNIIGGYNWR SAGQWIEWKF TVPESGFYKI GFKFRQNANP GIPSERTLYI DGVVPFREVR NIKFKYDTKW QFKYLGDGKE DYLFYLKKGE HTLRLKVSYE SIAEIMRNVL QCSIDLSQLY TKIVMITSPN PDPYRDYLLE QSIPDLIPTL ERNAKILKEN AEKLKILGGE KVSEAATLER VAIQLEGMAK EPETIAQRLQ RYRDNLSALS AWVLAIREQP LDIDYIIIAS PDMKSPRVNP NILEAFLDGI KKFFYSFLED YNMIGTVYDK EKAINVWVQM GRDQAETLKM LIDTDFTPKT GIGVNLNIIT TEAALLFSVA SKENPPDVAL NVPRGLAVDY GIRGALVDIS KLSGFEEVKK QFAPYALVPY SFGGKVYGLP MTQDFPIMFY RADILGRLNI EIPNTWEELY ETIAKLQSYN LQFAAGTGGT SFDIFNMLLL QRGGRYYTED GKRCVLNNEE GVTAFKEWTN LYVLYGIPLY YDFFNRFRTG EMPLGIGPYT MYNQFKVAAP EISGLWGIAP VPGRRKNDGS IDRSVAGGGN AILIFAQTKK LKEAWEFVKW WVSTDVQARF GRELEAVLGA GARYNTANIE AMSYLPWPSS DYKILSTQWK YLKEIPNVPG SYYVSRHLDN AFREVVMLGE IPREAIEKYT REINKEIDRK REEFGLELAK D
|
| |