Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_0082 |
Symbol | |
ID | 7082349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | + |
Start bp | 83294 |
End bp | 85204 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643457207 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002352034 |
Protein GI | 217966528 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.5866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TCCTTTTTAT TTTCCTTCTT CTCTTTCTCT TAATTGTTTC CTCTGGAAGC TTTGCCCAAG GTCTTCCTGC TGGTATTCCT AGAGATAAAA CCTTAATTCT ACCATTCTTA TTTGCCCCCC TTCCTGTTCC TGGAAACTGG AACTTATGGG CTGGCTGGAG GGCACAAAAC TGTGGATTAC ATCAATTTGT TACAGAACCT CTTTGGACAA TAAACCCCGA CGTTGTTAAA GGGGGAATAA TCAACGCATT AGCAGCGGCG GACCCTATTT ATAACAGTGA TTTTACAAAA TTAGTAATAA AGCTTAGAAA AGGAATATAT TGGAGTGATG GTGTTGAATT TACTGCCGAT GATTTAGTCT ATACTATTCA AACAGTAAAA GATACTGAGG GGCTTGACTA TCATGGAGCA ATGCAAGATG TAAAAAGGGT TTATGCTCCA GACAAATATA CAGTGGTAGT AGAACTTAAC AGACCCAACA GTAGATTCCA TACTTTCTTC CTTGAAAGAT GGGGCGCCCT AAGACCTATG CCAAAACATA TTTTTAGCAA AGTCAAAGAT GTACTTACTT ACGATTTTAA CCCTCCAGTA AGTTTAGGAC CCTATGTGTA TGTAGCCCAT GACCCAGCAG GATATTGGGT GCTTTGGAAA AAGAGAGAGG ATTGGAAGAG AACAGTTACT GGACAACTGT TTGGTGAACC AAAACCTGAG TATGTATTAT TTATAAACCA TGGTACACCA GAGAAACAAA ACATGGCTAT GCTGAGACAC GAACTTGACG CAATACAAGG TACAGCAGAA CAAATGCTAA CAATGTTAAA AATGAGTAAT ACCACAAGAA GCTACCGTGA TACATGGCCC TATATTGATC CAAGAGACAT ATCTACCAGA GGGCCAGCCT TTAACCATTT AGTACCCCCA TACAATAATA AAGACGTAAG ATGGGCACTT GCATTATCCA TAGATGTGGT TGAAACAATT ATATCTACAT ATAATGGAAT GGCTGCAATG ACACCAGGAC TTCCTCTCGT AGTTGGTAAA AACTTCTATG ATTGGTATTT CAAACCCTTA AAACCATGGC TTGAAAACCT AACATTAGAT ATTGGCGGTG GACAAAAATT TAAACCTTGG GATTCAAAGG CACCTTGGAG ACTATTAGAA TGGGCAAAGA AAAACTACAA AGTAGATATT GATCCTAATA ATGAAGAAGA GGTAAGGCTC ACATTAGGTT ATGGATGGTG GAAATATGCA CCTGATGTGG CAGAGAAATT ACTAATAAAG AATGGATTTA AGAAGATAGG AGGAAAATGG TATTTGCCCG ACGGCAAACC TTGGAAAATT ACTATATTAA GAGGTCCAGA TCCAACAGAC ATGGCAAATA TCTTCATTCT TGGTGTAGCT GAGCAGTGGA AAAAGTTTGG TATAGATGTG GTCTTCAATG TGACAACTGC AGCAGCTACA TTGGCCTTAA ATGGACAATT TGAAGTACTC AATACAGGGC ATGGCGGATT TGCAGGAGAG CCTTGGGGAT TGCATCCTGA TTTATACTTG TGCTTTAATG CTATGAATAG TAAATATGTC AGACCTGTAG GAGAACCAAC CCTCGGTTGG GCTGGAAGAT GGAGTGATCC TCGCATGGAT AAAATAATTG CTGAATTAGA AAGAACTCGT TGGACTGACT TTCCAAAAAT CAAAAAACTT GGCTTAGAAG GTTTGAAAGT AGAAATTGAA GAAATGATAT CAATTCCAAT AATTAACTGT CCAATAGCTA TAGTGTTTGA TGAATATTAT TGGACCAACT GGCCAAGCCC TAAGAATGAT TATGCACGTT GCGACAATTT CACCACATGG CCACAGTTAA AATATATCTT GCATCAAATA AAACCCACTG GTAAAAAGTA A
|
Protein sequence | MKKLLFIFLL LFLLIVSSGS FAQGLPAGIP RDKTLILPFL FAPLPVPGNW NLWAGWRAQN CGLHQFVTEP LWTINPDVVK GGIINALAAA DPIYNSDFTK LVIKLRKGIY WSDGVEFTAD DLVYTIQTVK DTEGLDYHGA MQDVKRVYAP DKYTVVVELN RPNSRFHTFF LERWGALRPM PKHIFSKVKD VLTYDFNPPV SLGPYVYVAH DPAGYWVLWK KREDWKRTVT GQLFGEPKPE YVLFINHGTP EKQNMAMLRH ELDAIQGTAE QMLTMLKMSN TTRSYRDTWP YIDPRDISTR GPAFNHLVPP YNNKDVRWAL ALSIDVVETI ISTYNGMAAM TPGLPLVVGK NFYDWYFKPL KPWLENLTLD IGGGQKFKPW DSKAPWRLLE WAKKNYKVDI DPNNEEEVRL TLGYGWWKYA PDVAEKLLIK NGFKKIGGKW YLPDGKPWKI TILRGPDPTD MANIFILGVA EQWKKFGIDV VFNVTTAAAT LALNGQFEVL NTGHGGFAGE PWGLHPDLYL CFNAMNSKYV RPVGEPTLGW AGRWSDPRMD KIIAELERTR WTDFPKIKKL GLEGLKVEIE EMISIPIINC PIAIVFDEYY WTNWPSPKND YARCDNFTTW PQLKYILHQI KPTGKK
|
| |