Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0343 |
Symbol | |
ID | 8427278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 355018 |
End bp | 356631 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645032741 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003189919 |
Protein GI | 258513697 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGCAA AACAAAAATG GCTCAGGCAG TTAATTACAC TATTACTTAT GCTGGGCCTA ACAGTGGTTG TGCTGTCAGG TTGTGGCACA AACCGGGCTG TCGAGAAAAA TGCCGGCGGG TCAGGTGAAG TTGCTGTTTA TACTATCGCC GATTCAACGG GTGACTGGGG TTTTCCTTCT CCTTATACTC ACTATAACCG GGGACCTGGC TACGTAAGAA TGAGTTTTCT CTTTGACACA CTGGTGTGGA AAAACGATCG GGAGTATCTG CCTGGTCTGG CCGAAAAATG GCAATACCTG ACGCAGGAAA ATGCTTATTT GTTTAACCTG CAAAAAAATG TTACCTGGCA TGACGGGGAA AAATTTACTT CCGGAGATGT GCTGTTTACG TATAATTATG TTAAAGCCCA CCCTTATCAA TGGGCCGATG TCGGCATGAT TAAGAAAATC GAGGCTTTGG ACGATTATAC TGTAAAGATG TATTTAAACA AACCCTATGC TCCGTTCTTG GATACCGTGG TTGGCAGCAT GCCCATTTTG CCCGGGCATA TTTGGAAAAA TGTGCAAAAT CCAATGCAGT TCCAGAAGGA GGAGGCATTA ATCGGGACCG GTCCTTATAA GCTGTTGGAT TACAATAAAG AGCAAGGTAC ATACCTATAT GAGGCCTATG ATAATTATTA CCTGGGAAAA CCCCGGGTGA AGCAGTTAAA GTTCATTAAA ATTAGCAATG AAATGGTGGG GAATGCTTTA AAACAGAAAC AGGCTGACGC GGCGCAAGTT CCGCCGGAAC TGGCCAGTCA AATGGAAAAA GAAGGATTTA ATATTTTAAA AGGTTCTCAC GATTCGGTAG TCAAAATACA AATAAACCAC CGGAAAGAGC CCCTGTCCAA TAAAGAATTC CGGCAGGCGC TGGCTTATGC CGTAAACCGC CAGGAACTGT TGGATACTAC CCTGCGGGGT TATGGTCTGG TAGGCAATCC GGGCTTGGTG CCGCCGGATA ACAGCTGGTA TAATCCTCAA GTGGAACAAT ACTCTTATAA CCCGGTCAAA ACGGGGGAAA TACTTGCCAA ACTGGGATAT GTTAAAAAAG GAATGTATTT TGCGAAGGAC GGAAAACCGC TGGAACTGGA GCTTTTAATC AGTGGGGCAG GTTCAGCTAA TACTCCGGGA GTGCGCCAGG GCGAAATGAT TAAGGAGCAG TTGGAAAAGG CGGGCATAAA GGTAAATTTG CGCAGCCTGG ATCCCAAGAC ACTCGACAGC ATGGTGGGAG AATGGAAATT TGATCTGGCT TTAATCAGTC ACGGCGGAAT GGGCGGGGAA CCTAAAGTAT TAAACACAAT GATTACAGAT AAAAGCTTTA ATAGTGCCAG GTATCTGAAA AGTGAAGAAC TCAACAGTCT TTTGCAGCAG CAATTGGAGA AAATAAACCA GCAGGAGCGT AGAAAACTAA TCAATAGAAT TCAGGAAATT TATGCTCAGG AAATGCCTTC TTTACCTCTT TATTATCCTA GCAGCTATTG GGTTTATGAT AATCAGGTGA AACTTTTTTA TACTAAACAA GGTATTGGTA TCGGCGTTCC AATTCCTGCT AACAAAATGT CTTTTGTGAA ATAA
|
Protein sequence | MDAKQKWLRQ LITLLLMLGL TVVVLSGCGT NRAVEKNAGG SGEVAVYTIA DSTGDWGFPS PYTHYNRGPG YVRMSFLFDT LVWKNDREYL PGLAEKWQYL TQENAYLFNL QKNVTWHDGE KFTSGDVLFT YNYVKAHPYQ WADVGMIKKI EALDDYTVKM YLNKPYAPFL DTVVGSMPIL PGHIWKNVQN PMQFQKEEAL IGTGPYKLLD YNKEQGTYLY EAYDNYYLGK PRVKQLKFIK ISNEMVGNAL KQKQADAAQV PPELASQMEK EGFNILKGSH DSVVKIQINH RKEPLSNKEF RQALAYAVNR QELLDTTLRG YGLVGNPGLV PPDNSWYNPQ VEQYSYNPVK TGEILAKLGY VKKGMYFAKD GKPLELELLI SGAGSANTPG VRQGEMIKEQ LEKAGIKVNL RSLDPKTLDS MVGEWKFDLA LISHGGMGGE PKVLNTMITD KSFNSARYLK SEELNSLLQQ QLEKINQQER RKLINRIQEI YAQEMPSLPL YYPSSYWVYD NQVKLFYTKQ GIGIGVPIPA NKMSFVK
|
| |