Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2239 |
Symbol | |
ID | 7083671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2522728 |
End bp | 2523804 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699258 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002355874 |
Protein GI | 217970640 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00478334 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCC AGACCGAAAA CCTCAACGTC CTCGCCTTCG ACCACATGCC CTCGCCGGAC GAGGTGAAGG CGCGCGTGCC GCTGACCGAG CGCGCCGCGG CTGCGGTGGT GGCCGGGCGC AAGGCGGTGA TGGACATCCT CGATCGCAAG GATCCGCGCG TGTTCGTGGT GGTCGGGCCG TGCTCGATCC ATGACCCAGT GGCCGGGCTG GATTATGCGC GACGGCTGAA GGCGCTCGCC GACGAGGTCT CCGACGTGCT GCTGCTGGTG ATGCGGGTGT ATTTCGAGAA GCCGCGCACC TCGACCGGGT GGAAGGGCTA CATCAACGAT CCGTTCATGG ACGACTCCTT CCGCATCGAC GTGGGCATGG AGCGTGCGCG CAAATTCCTC CTCGACGTGT GCGAGCTCGG CCTGCCCACG GCCACCGAGG CGCTCGACCC GATCGCACCG CAGTATTACG GCGACCTCAT CGCCTGGACC GCGATCGGCG CGCGCACCTC CGAGTCGCAG ACCCATCGCG AGATGGCCTC GGGCCTGTCG ACGCCGGTCG GCTTCAAGAA CGCCACCGAT GGCGACCTCG AGGTGGCGAT CAACGCGATC ATTTCGGCCG GCAGTCCGCA CAGCTTCCTC GGCATCAACA GCCAGGGCCA GTCGGCGGTT ACCCGCACGC GCGGCAACCG TTACGGCCAC GTGGTGCTGC GCGGCGGCGG CGGCCGGCCC AACTACGACA CGGTGTCGGT GTCGCTGGCC GAGCAGGCGC TCGCGAAGGC CAAGCTGGCG AAGAACATCG TGGTCGATTG CTCGCACGCC AACTCGTGGA AGAAGCCCGA ATACCAGCCC CTGGTGATGA AGGACGTGAT GCATCAGATC CGCGAGGGCA ACCAGTCGAT CGTCGGCCTG ATGATCGAGA GCAATATCGA AGCCGGCAAC CAGCCGATTC CGGCCGACCT GTCGCAGCTC AAGTACGGCT GTTCGGTCAC CGATGCCTGT GTCGATTGGG CGACGACCGA GGACATGATC CGCAAGTCCG CCGCCGTGCT GCGCGACGTG CTGCCGAAGC GGGAGCGGCG CGCATGA
|
Protein sequence | MPTQTENLNV LAFDHMPSPD EVKARVPLTE RAAAAVVAGR KAVMDILDRK DPRVFVVVGP CSIHDPVAGL DYARRLKALA DEVSDVLLLV MRVYFEKPRT STGWKGYIND PFMDDSFRID VGMERARKFL LDVCELGLPT ATEALDPIAP QYYGDLIAWT AIGARTSESQ THREMASGLS TPVGFKNATD GDLEVAINAI ISAGSPHSFL GINSQGQSAV TRTRGNRYGH VVLRGGGGRP NYDTVSVSLA EQALAKAKLA KNIVVDCSHA NSWKKPEYQP LVMKDVMHQI REGNQSIVGL MIESNIEAGN QPIPADLSQL KYGCSVTDAC VDWATTEDMI RKSAAVLRDV LPKRERRA
|
| |