Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3947 |
Symbol | |
ID | 7873593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4343235 |
End bp | 4344374 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700884 |
Product | Saccharopine dehydrogenase |
Protein accession | YP_002890907 |
Protein GI | 237654593 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.35469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGACA TCCTGATCGT CGGCGCGGGC AAGATCGGCA CGGTGATCGC CGATCTGCTC GCCGGGAGCG GAGACTATGC GGTGACGGTC GCGGATCGCG ACCCCGCCGC GGTCGAGCGC GTGGGGGCAG AGCTCGCGCA CGTGCAGGCC TGCGCGCTGG ACGTCGCCGA CGCCGATGCG CTCGCCGCGG AGCTGGAAGG GCGCTGGGCG GTGATCGACG CCGGGCCCTT CGACATCGGC ATGCGCATCG CCGCGGCCGC GGTGGCGCAG CGGGTGCATT ACCTCAACCT CACCGAGGAC GTCGCCAGCA CGCGCCGGGT ACGCGAGCTC GCCCGCGGCG CGCACAGCGC GCTGATCCCG CAATGCGGGC TGGCGCCCGG CTTCATCTCC ATCGTCGCCC ACGACCTCGC CGCGCGCTTC GACGAACTGC GCGACGTGCG CATGCGCGTC GGCGCACTGC CCAAGTACCC CTCCAACGGC CTCAAGTACA ACCTGACCTG GAGCACCGAC GGCCTCATCA ACGAGTACCT CAACCCCTGC GAGGCGATCG TCGACGGGGT GCGCCGCGAG ATGCCGGCGC TCGAGGAGCT CGAGCACTTC TCGCTCGACG GCGACGACTA CGAGGCCTTC AACACCTCGG GTGGGCTGGG TACGCTGTGC GACACGCTCG AGGGCCGGGT GCGCAACCTC AACTACCGCA CCGTGCGCTA CCGCGGCCAC CGCGACGTGA TGAAGCTGCT GCTGCACGAC CTGCGCCTGG GCGAGCGGCG CGCGCTGCTC AAGGACATCC TGGAGTCGGC GATCCCGGTG ACCATGCAGG ACGTGGTGCT GGTCTTCGTC ACCGTCAGCG GCCGGCGCGA GGGCCTGCTG ATGCAGGAGA CCTTCGCGCG CAAGCTCTAT GCGGCCGAGG TCAACGGCCG CCTGCGCAGC GCGATCCAGC TCACCACCGC GAGCGCGCTG TGCGCGGTGC TCGACCTGCT CGCGGCGGGC CGCCTGCCGC AGGCGGGCTT CGTGCGCCAG GAGGACGTCG ATTTCTGCGA CTTCGTCACC AACCGCTTCG GCCGCCACTT CCTGACCGAC AGCGAGGAGG TGCGCTGCGC CGCCAGCGGC GTCCTCGCCG GGCCGGCCTC CGGCATGTGA
|
Protein sequence | MRDILIVGAG KIGTVIADLL AGSGDYAVTV ADRDPAAVER VGAELAHVQA CALDVADADA LAAELEGRWA VIDAGPFDIG MRIAAAAVAQ RVHYLNLTED VASTRRVREL ARGAHSALIP QCGLAPGFIS IVAHDLAARF DELRDVRMRV GALPKYPSNG LKYNLTWSTD GLINEYLNPC EAIVDGVRRE MPALEELEHF SLDGDDYEAF NTSGGLGTLC DTLEGRVRNL NYRTVRYRGH RDVMKLLLHD LRLGERRALL KDILESAIPV TMQDVVLVFV TVSGRREGLL MQETFARKLY AAEVNGRLRS AIQLTTASAL CAVLDLLAAG RLPQAGFVRQ EDVDFCDFVT NRFGRHFLTD SEEVRCAASG VLAGPASGM
|
| |