Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0853 |
Symbol | |
ID | 7084710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 944103 |
End bp | 945113 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697875 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_002354516 |
Protein GI | 217969282 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.127023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACTCG CCACCCTCAA GCAAGGCGGC CGCGACGGCA CCCTGGTCGT CGTCAGCCGC GATCTCACCC GCTGCCGTGC CGTGCCGGCG ATCGCCCGCA CCCTGCAGGC CGCGCTCGAC GACTGGCAGG CCTGCGAGCC GCCGCTGCGC CAGGTGTATG AGGCGCTCAA CAGCGGCGCG GTCGACGCCC AGCCCTTCGA CCAGACCGCC TGCGCCTCGC CGCTGCCGCG CGCCTACCAG TGGGCCGACG GCTCGGCCTA CATCAACCAC GTCGAGCTGG TGCGCAAGGC GCGCAATGCC GAGATGCCGC CCTCGTTCTA CACCGACCCG CTGATGTACC AGGGCGGCTC GGACAGCTTC ATCGGCCCGC GCGACGCGGT GGTGGCCGAC GAGGCCTGGG GCATCGACTT CGAGGCCGAG GTCACGGTGG TGACCGGCGA CGTGCCGATG GGCGCCACCC CCGCCGAGGC GGCGCAGGCG ATCCGGCTGG TGATGCTGGT GAACGATGTG TCGCTGCGCA ATCTGATTCC CAACGAGCTC GCCAAGGGTT TCGGCTTCTT CCAGAGCAAG CCGGCCTCGG CCTTCAGTCC GGTCGCGGTC ACGCCCGACG AGCTCGGCGA AGCCTGGAGG GATGCCAAGG TGCACCTGCC GCTGGTGGTG CACCTCAACG GCAAGCTCTT CGGCAAGCCC GAGGCCGGCG TCGACATGAC CTTCGACTTC GGCCAGCTCG TCGCCCACGT CGCCAGGACG CGCGAGCTCG AGGCGGGCTC GATCATCGGC TCGGGCACGG TCTCGAACAA GCAGGGCGAC CTGTGGGGCT CGTCGATCGA TCACGGCGGC GTCGGCTACT GCTGCCTCGC CGAAGTACGC ACCTACGAGA CCATCGAGCA GGGCAAGCCG GCCACGTCCT TCATGCGCGA CGGCGACGTC GTCCGCATCG AGATGTTCGA CCGCCAGGGC CGCAACGTCT TCGGCACGAT CGAGAACCGG GTGACGGCGC GCAAGGGCTG A
|
Protein sequence | MKLATLKQGG RDGTLVVVSR DLTRCRAVPA IARTLQAALD DWQACEPPLR QVYEALNSGA VDAQPFDQTA CASPLPRAYQ WADGSAYINH VELVRKARNA EMPPSFYTDP LMYQGGSDSF IGPRDAVVAD EAWGIDFEAE VTVVTGDVPM GATPAEAAQA IRLVMLVNDV SLRNLIPNEL AKGFGFFQSK PASAFSPVAV TPDELGEAWR DAKVHLPLVV HLNGKLFGKP EAGVDMTFDF GQLVAHVART RELEAGSIIG SGTVSNKQGD LWGSSIDHGG VGYCCLAEVR TYETIEQGKP ATSFMRDGDV VRIEMFDRQG RNVFGTIENR VTARKG
|
| |