Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3220 |
Symbol | |
ID | 7874441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3521225 |
End bp | 3523171 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700154 |
Product | transaldolase |
Protein accession | YP_002890192 |
Protein GI | 237653878 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00876] transaldolase, mycobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAGA TCCTCTACCG CGACGACTGG CTGGTTGCGA TCCACAAGCC CTCCGGCCTG CTCGTGCACC GCAGCCCGAT TGCCGCGCAC GAGGAACGCT TCGCGGTGCA GCTGCTGCGC GACCAGCTCG GCCGCCGCGT GTATCCCGCC CACCGGCTGG ATCGCGGCAC CTCGGGCGTG CTGCTGTTCG CGCTCGATCG CGAGGTGGCG CGCACGCTCG CGCAGCGTTT CGAGGCGCAG GCGGTCGACA AGCGCTACCT GGCGGTCGTG CGCGGGCATC CGCCCGAGCA CGGCGTGATC GAGCACGCGC TGGTACGCCG ACTCGATGCG GTCGAAGTGC GCAGCGGCAA GGGCGCCGGG GCCCGCGACG CGCTGCCCGA GGATGTCGAC GACAGTGATT CGCCAGAGGC TGCCGAGCCG GTCGCCCAGC CCGCGCGCAC CCGCTTCCGC CGCCTCGCCA CGGTCGAGCT GCCGCACGCG GTGGACCGCT ACCCCAGCAG CCGCTACGCG CTCGTCGAGC TCCTCCCCGA GACCGGCCGC CGCCATCAGC TGCGCCGCCA CCTCAAGCAC ATCGCCCACC CGATCATCGG CGACGCCACC TACGGCAAGG GCCGCCATAA CCGCCTGTTC CAGGCGCTCT TCGGCAGCCA CCGCCTGCTG CTCGCGTGCA CGCGGCTGGC GCTGGCGCAT CCGGTGACGG AGCGCCCGCT GGAGATCGTG GCGCCGGTGG CGGAGGATTT TGCCGCGGTG CTCGCGGCGC TCGGCTGGCA GACGGCACAG CGCAGCACTT TCGAGGCAGA ATCCAGGCTT TCCGCCGCCC GCGGGCCCGC GCCCGAGCGG CCCCGTTCCC ACGACACTCC AGGTAGAACG ATGAACCCGC TGCTCCAGGT ACGCCAGCAC GGCCAGCAGA TCTGGCTTGA CAACCTCTCC CGCACCCTGC TCGAGGAAGG CCACCTCGCC CGCTTCGTCG CCGACGACGG CGTCGCCGGC GTGACCACCA ACCCGGCGAT CTTCCACAAG GCGATTTCCG GCGGGCGTTA TTACGAGGAC GATCTCGCCG CGCTCAAGCA GCAGCCGCTG AGCGCCGAGG CGCGCTACGA GGCCCTGGTG ATCCCCGACG TGCAGCGCGC CTGCGACCTG CTCGCACCCC TGCATCGGGA CAGCGGCGGC AGCGCCGGCT ACGTCAGCCT GGAAGTCTCG CCCGCGCTCG CCCACGACGC CGACGGTACC GTGGCCGCCG GCCTGCGCCT GAAGGCCGCG GTCGACCGCC CCAACCTGCT GATCAAGGTG CCCGCCACGC CGGCGGGCCT GGTCGCCATC GAGCGCCTGA TCGGCGAGGG CGTCAGCGTC AACGTCACGC TGATGTTCTC GCTCGCGCAC TGCGAGGGGG TGGCCGAGGC CTACCTGCGC GGGCTCGCGC GTCTGCGCGC GGCGGGCGGA GACGTCGCGG GCGTGATGTC GGTGGCGAGC CTGTTCCTGT CGCGGGTCGA CACCCTGGTG GACAAGCTGC TGGAAGAGCG CGGCGGCGAC CTGCCGGCGC TGCGCGGCCG CACCGCGGTG GCGATGGCAC GGCTGGCCTA CGAGGCCTAC CAGGAGCGCT TCCACGGCGC CGGCTTCGAG GACCTGGCCG CGGCGGGCGC ACGTCCGCAG TACATGTTGT GGGCAAGCAC CGGCACCAAG AACCCGGCCT ACAGCGACCT GCTCTACGTC GAGCCGCTGA TCGGTGCCGA GACCATCAAC ACCCTGCCCG ACGCCACGCT CGACGCCCTG CGCGACCACG GCCGCGTGGC CTCGACGCTG GAACAGGATG TCGAGCAGGC CGCCGCCCAC TTCACCGCGC TCGCGGCCGC CGGCATCGAC CTCGTGGCCG TGGGCGAGCG CCTGCAGCAG GAAGGCCTGG CGCAGTTCGA GCAGGCCTTC GCAGGGCTGC TCGAACTCAC CGCCTGA
|
Protein sequence | MLEILYRDDW LVAIHKPSGL LVHRSPIAAH EERFAVQLLR DQLGRRVYPA HRLDRGTSGV LLFALDREVA RTLAQRFEAQ AVDKRYLAVV RGHPPEHGVI EHALVRRLDA VEVRSGKGAG ARDALPEDVD DSDSPEAAEP VAQPARTRFR RLATVELPHA VDRYPSSRYA LVELLPETGR RHQLRRHLKH IAHPIIGDAT YGKGRHNRLF QALFGSHRLL LACTRLALAH PVTERPLEIV APVAEDFAAV LAALGWQTAQ RSTFEAESRL SAARGPAPER PRSHDTPGRT MNPLLQVRQH GQQIWLDNLS RTLLEEGHLA RFVADDGVAG VTTNPAIFHK AISGGRYYED DLAALKQQPL SAEARYEALV IPDVQRACDL LAPLHRDSGG SAGYVSLEVS PALAHDADGT VAAGLRLKAA VDRPNLLIKV PATPAGLVAI ERLIGEGVSV NVTLMFSLAH CEGVAEAYLR GLARLRAAGG DVAGVMSVAS LFLSRVDTLV DKLLEERGGD LPALRGRTAV AMARLAYEAY QERFHGAGFE DLAAAGARPQ YMLWASTGTK NPAYSDLLYV EPLIGAETIN TLPDATLDAL RDHGRVASTL EQDVEQAAAH FTALAAAGID LVAVGERLQQ EGLAQFEQAF AGLLELTA
|
| |