Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3508 |
Symbol | |
ID | 7873014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3844382 |
End bp | 3846232 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700449 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002890479 |
Protein GI | 237654165 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCAGT ACCGTTCCCG CACGTCCACC GCCGGCCGCA ACATGGCGGG CGCCCGCGCC CTGTGGCGCG CCACCGGCAT GAAGGACGGC GACTTCGAAA AGCCGATCAT CGCGATCGCC AACAGCTTCA CCCAGTTCGT GCCCGGCCAC GTGCACCTGA AGGATCTCGG TCAGCTCGTC GCGCGCGAGA TCGAGTCCGC CGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCGGTCGAT GACGGCATCG CCATGGGCCA CGGCGGCATG CTGTATTCGC TGCCCTCGCG CGAGCTCATC GCCGACAGCG TCGAGTACAT GTGCAACGCG CACACGGCGG ACGCGCTGGT GTGCATCTCG AACTGCGACA AGATCACCCC GGGCATGCTG ATGGCCGCGC TGCGCCTGAA CATCCCGGCG ATCTTCGTCT CCGGCGGTCC GATGGAGGCC GGCAAGGTCA AGTGGGAAGC CAAGGTGATC TCGCTCGACC TCGTGGATGC GATGGTCAAG GCGGCCGACA AGTCGTGTTC GGACGAGGAA GTCGACGCCA TCGAGCGCTC GGCCTGCCCG ACCTGCGGGT CGTGCTCGGG CATGTTCACA GCCAACTCGA TGAACTGCCT CACCGAGGCG CTCGGCCTGT CGCTGCCCGG CAACGGCACC ACGCTCGCCA CCCACGCCGA CCGCGAGCGC CTGTTCAAGG AAGCCGGCCG CCGCATCGTC GATCTCGCGC GTCGTTACTA CGAAAAGGAC GACGCCTCGG TGCTGCCGCG CTCGATCGCC AGCTTCCAGG CCTTCGAGAA CGCGATGAGC CTGGACGTGG CCATGGGCGG CTCGACCAAC ACCGTGCTGC ACCTGCTCGC CGCCGCGCGC GAGGCCGGCG TGGACTTCAC GATGAAGGAC ATCGACCGCG TCAGCCGCCG CGTGCCTTGC CTGTGCAAGG TCGCGCCCGC GATCGCCGAC GTGCACATCG AGGACGTGCA TCGTGCCGGC GGCATCATGT CCATCCTCGG TGAACTCGAC CGCGCCGGCC TGCTGCACAC CGATGTGCCG ACCGTGCACA GCGCGAGCCT GGGCGAAGCC CTGGACAAGT GGGACATCAA GCGCACCGAA GACGAAGCGG TGCACACCTT CTTCCGTGCA GCGCCGGGCG GGGTGCCGAC CCAGGTCGCC TTCAGTCAGG ACCGGCGCTG GAAGTCGCTC GACGTCGACC GCGAGCACGG CATCATCCGC AACAAGGAAC ATGCCTTCAC GGCCGACGGA GGGCTGGCCG TGCTCTACGG CAACATCGCC GAGAAGGGCT GCATCGTGAA GACCGCCGGG GTGGATGAAT CCATCTGGAA GTTCACCGGC AAGGCCAGGG TGTACGAGAG CCAGGAAGAC GCGGTGGAGG GCATCCTCGG CGAGCAGGTG CAGGCGGGCG ACGTGGTGGT GATCCGCTAC GAAGGCCCGA AGGGTGGCCC CGGCATGCAG GAGATGCTCT ATCCCACCTC TTACCTGAAG AGCCGCGGCC TGGGCGCGCA GTGCGCGCTG CTCACCGACG GACGCTTCTC GGGCGGCACC TCGGGCCTGT CGATCGGCCA TGCGTCGCCC GAGGCGGCCT GCGGCGGCGC GATCGCGCTG GTCGAGGACG GCGACACGAT CGAGATCGAC ATCCCCGCGC GCCGCATCCA TCTTGCCATC GCCGACGCCG AGCTCGCCCG CCGGCGTGCC GCGATGGAGG CGAAGGGCAA TGCGGCCTGG AAGCCGGTGA AGCGCGAACG CGTGGTCTCC GCCGCGCTGC AGGCCTACGC CGCGCTCACC ACCTCGGCCG ACACCGGCGC GGTGCGGGAC GTCACCCAGG TCCAGCGCTG A
|
Protein sequence | MPQYRSRTST AGRNMAGARA LWRATGMKDG DFEKPIIAIA NSFTQFVPGH VHLKDLGQLV AREIESAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMCNA HTADALVCIS NCDKITPGML MAALRLNIPA IFVSGGPMEA GKVKWEAKVI SLDLVDAMVK AADKSCSDEE VDAIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGT TLATHADRER LFKEAGRRIV DLARRYYEKD DASVLPRSIA SFQAFENAMS LDVAMGGSTN TVLHLLAAAR EAGVDFTMKD IDRVSRRVPC LCKVAPAIAD VHIEDVHRAG GIMSILGELD RAGLLHTDVP TVHSASLGEA LDKWDIKRTE DEAVHTFFRA APGGVPTQVA FSQDRRWKSL DVDREHGIIR NKEHAFTADG GLAVLYGNIA EKGCIVKTAG VDESIWKFTG KARVYESQED AVEGILGEQV QAGDVVVIRY EGPKGGPGMQ EMLYPTSYLK SRGLGAQCAL LTDGRFSGGT SGLSIGHASP EAACGGAIAL VEDGDTIEID IPARRIHLAI ADAELARRRA AMEAKGNAAW KPVKRERVVS AALQAYAALT TSADTGAVRD VTQVQR
|
| |