Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3039 |
Symbol | |
ID | 7874509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3290061 |
End bp | 3291128 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699962 |
Product | chorismate mutase |
Protein accession | YP_002890014 |
Protein GI | 237653700 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACG AACTGCTGAA ACTGCGCAAC GAGATCGACC GCATCGACGA GGAAATCCTC GCCCGCCTCG CCGAGCGCGC GCGCTGCGCG CAGCGCGTGG GCGAGATCAA GCGCGGGGTG ATGTATTACC GCCCCGAGCG CGAGGCGCAG GTGCTGCGCC GGCTGGCCGA GCTCAACCCC GGCCCGCTGT CTTCCGATGC GGTGAAGACC ATCTTCCGCG AGGTGATGTC GGCCTGCCTC GGGCTGGAGC AGCCGCTGCG CGTGGCCTAC CTCGGCCCCG CCGGCACCTT CTCCGAGAGC GCCAGCCGCA AGCACTTCGG TTCCGCGCCC AACTTCCTGG CGATGGCGGC GATCGACGAC GTCTTCCGCG CGGTGGAAGC CGGCAATGCC GACTACGGCG TGGTGCCGGT GGAGAACTCC ACCGAGGGCG CGGTCGGCGG CACGCTCGAT CTGCTGCTGG CCAACCCGCT CAAGGTCTGC GGCGAGGTGC GCCTGCGCAT CCACCAGCAG CTGATGTCGC GCGCCGAGGG CATCGGCGCC GCCCGCCGCA TCTACTCGCA CGCGCAGTCG CTGGCGCAGT GCCACGAGTG GCTCAACCGC AACCTCCCGC ACCTGCCGCG CATCCCGGTG GCGAGCAACG CCGAGGCCGC GCGCATGGCC TCCGAGGATC CAGAGTCCTG CGCCATCGCC GGCGACGCCG CGGCGCAGCT CTACGGGCTC AACATCCTCG CGCCCAACAT CGAGGACGAT CCCAACAACA CGACGCGCTT CCTGGTCATC GCCGACCACG ACGCCGGGCC CTCGGGCAAG GACCGCACCT CGCTGGTGTT CTCCGCACCC AACCGGCCGG GGGCGATCCA CAGCCTGCTC GAGCCGATGG CCCGCCACGG CGTGGACATG ACCAAGCTGC AGTCGCGCCC GGCGCGCTCC GGGCTGTGGG AGTACGTGTT CTACGCCGAC ATCAACGGCC ACCGCGAAGA CCCCGAGGTG GCGGCCGCGC TGCGCGAGCT CGACGAGCGC GCCGCCTTCG TGAAGATCAT CGGTTCCTAT CCGGTCGCGG CGATCTGA
|
Protein sequence | MSDELLKLRN EIDRIDEEIL ARLAERARCA QRVGEIKRGV MYYRPEREAQ VLRRLAELNP GPLSSDAVKT IFREVMSACL GLEQPLRVAY LGPAGTFSES ASRKHFGSAP NFLAMAAIDD VFRAVEAGNA DYGVVPVENS TEGAVGGTLD LLLANPLKVC GEVRLRIHQQ LMSRAEGIGA ARRIYSHAQS LAQCHEWLNR NLPHLPRIPV ASNAEAARMA SEDPESCAIA GDAAAQLYGL NILAPNIEDD PNNTTRFLVI ADHDAGPSGK DRTSLVFSAP NRPGAIHSLL EPMARHGVDM TKLQSRPARS GLWEYVFYAD INGHREDPEV AAALRELDER AAFVKIIGSY PVAAI
|
| |