Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3801 |
Symbol | |
ID | 7874043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4193904 |
End bp | 4194950 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700743 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_002890767 |
Protein GI | 237654453 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGCTT CAAGGTCACT ACTAATCACC GGCGGCACCG GTTCCTTCGG CAACGCCGTC CTCAAGCGCT TCCTCGACAC CGACATCGGC GAGATCCGCA TCTTCAGCCG TGACGAGAAG AAGCAGGACG ACATGCGCAA GCGCTACAAC AGCGCCAAGC TCAAGTTCTA CATCGGCGAC GTGCGAGACC AGCGCAGCGT GGAGCAGGCG ATGCGCGGGG TGGACTTCGT CTTCCACGCC GCGGCGCTCA AGCAAGTGCC GTCGTGCGAG TTCCACCCAA TGCAGGCGGT GCGCACCAAC GTGCTGGGCA CCGAGAACGT GCTCGAGGCG GCGATCGCCG CCGGGGTCAA GCGCGTGGTG GTGCTGAGCA CCGACAAGGC GGTGTACCCG ATCAACGCCA TGGGCATTTC CAAGGCGATG ATGGAGAAGG TGATGGTGGC CACCAGCCGC AACCTGGAAG GCACCGGCAC GGTGATCTGC GGCACGCGCT ACGGCAACGT GATGGCCTCG CGCGGGTCGG TGATTCCGCT GTTCGTCGAG CAGGTGCTGG CGGGCAAGCC GATCACCATC ACCGACCCGA GCATGACGCG CTTCATGATG ACGCTGGCCG ATGCAGTGGA TCTGGTGCTG TATGCCTTCA CCAACGGCAA CAACGGCGAC ATCTTCGTGC AGAAGGCGCC GGCGGCGACC ATCGAGACGC TGGCGCGCGC GGTGACCGGG CTGATGGGCC AGCCCGCACA CCCGGTCAAC ATCATCGGCA CCCGCCATGG CGAGAAGCTC TACGAGGCGC TGCTGAGCCG CGAGGAGCGT GCCTGCGCCG AGGACATGGG CGACTACTTC CGCGTGCCGG CCGATGGGCG CGACCTCAAC TACGGCAAGT TCGTGGACCA GGGCGAGGCG AAGCTGACGC AGACCACGCA CGGTGAGGAC TACAACTCGC ACAACACCAC GCGGCTGGAC GTGGACGGCA TGACGCAGCT GCTGCTGAAG CTCGAAGGCA TGCAGCGCAT AGCCCGCGGC GAGACGACCA CCGTCGAGGA GTGCTGA
|
Protein sequence | MFASRSLLIT GGTGSFGNAV LKRFLDTDIG EIRIFSRDEK KQDDMRKRYN SAKLKFYIGD VRDQRSVEQA MRGVDFVFHA AALKQVPSCE FHPMQAVRTN VLGTENVLEA AIAAGVKRVV VLSTDKAVYP INAMGISKAM MEKVMVATSR NLEGTGTVIC GTRYGNVMAS RGSVIPLFVE QVLAGKPITI TDPSMTRFMM TLADAVDLVL YAFTNGNNGD IFVQKAPAAT IETLARAVTG LMGQPAHPVN IIGTRHGEKL YEALLSREER ACAEDMGDYF RVPADGRDLN YGKFVDQGEA KLTQTTHGED YNSHNTTRLD VDGMTQLLLK LEGMQRIARG ETTTVEEC
|
| |