Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3074 |
Symbol | |
ID | 7874544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3328327 |
End bp | 3329457 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699997 |
Product | chorismate synthase |
Protein accession | YP_002890049 |
Protein GI | 237653735 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGCA ACACCCTCGG CACGCTCTTC ACCGTCACCT CCTTCGGGGA ATCCCACGGC CCGGCGATCG GCTGCATCGT CGATGGCTGC CCGCCGGGAC TGGCGATCTG CGAGGCAGAC ATCCAGGCCG AGCTCGACCG CCGCAAGCCG GGCACCTCGC GCCACGTCAC CCAGCGCCGC GAGCCCGACA CCGTCGAGAT CCTCTCCGGC GTGTTCGAGG GCGTCACCAC CGGCACCCCG ATCTCGCTGC TGATCCGCAA CCAGGACCAG CGCAGCAAGG ACTACGGCAA CATCGCCGAC ACCTTCCGCC CCGGCCACGC CGACTACGCC TACCTGCAGA AGTACGGCCT GCGCGACCAT CGTGGCGGCG GGCGCTCGTC GGCGCGCGAG ACCGCGGTGC GGGTGGCGGC CGGCGCGATC GCGAAGAAGT GGCTGAAGGA GCGCCACGGC ATCGTCATCC GCGCCTGCAT GGGTGCGCTC GGCCCGATCG AGATTCCCTT CGTGTCCTGG GACGAGGTCG ACGGCAACCC CTTCTTCGCG CCCAACGCCG CGATCGTGCC CGAGCTCGAG GCCTTCATGG ACGCGCTGCG CAAGTCGGGC GACTCGATCG GCGCGCGCAT CGACGTGGTC GCCAGCGGCG TCCCGGTCGG CTGGGGCGAG CCGGTGTATG GCCGCCTGGA CGCCGACATC GCCTATGCGA TGATGGGCAT CAACGCGGTC AAGGGCGTGG AGATCGGCGC CGGGTTCAAG TCGGTCGCGC AGCGCGGCAC CGAGCACGGC GACGAGATGA CGCCCGCGGG CTTCCTGTCC AATCATGCGG GCGGCGTGCT CGGTGGCATC TCCACCGGGC AGGACATTCT CGCCAGCATC GCGATCAAGC CGACCTCGAG CATCCGCCTC GAGCGTCGCT CGATCGACCG CGCAGGCAAT CCCGTCATGG TCGCCACCGA GGGTCGCCAC GACCCCTGCG TGGGCATCCG CGCGACGCCG ATCGCCGAAT CCATGCTCGC ACTGGTGCTG ATCGACCACG CGCTGCGCCA TCGCGCGCAG TGCGGCGACG TGCGCACCGA CACCCCGCGC ATCCCGGCGC TCGCGCCCGC AGGGAGCCAG CGCCTGCCTT CGCCGCGCTG A
|
Protein sequence | MSGNTLGTLF TVTSFGESHG PAIGCIVDGC PPGLAICEAD IQAELDRRKP GTSRHVTQRR EPDTVEILSG VFEGVTTGTP ISLLIRNQDQ RSKDYGNIAD TFRPGHADYA YLQKYGLRDH RGGGRSSARE TAVRVAAGAI AKKWLKERHG IVIRACMGAL GPIEIPFVSW DEVDGNPFFA PNAAIVPELE AFMDALRKSG DSIGARIDVV ASGVPVGWGE PVYGRLDADI AYAMMGINAV KGVEIGAGFK SVAQRGTEHG DEMTPAGFLS NHAGGVLGGI STGQDILASI AIKPTSSIRL ERRSIDRAGN PVMVATEGRH DPCVGIRATP IAESMLALVL IDHALRHRAQ CGDVRTDTPR IPALAPAGSQ RLPSPR
|
| |