Gene Tmz1t_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3947 
Symbol 
ID7873593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4343235 
End bp4344374 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content71% 
IMG OID643700884 
ProductSaccharopine dehydrogenase 
Protein accessionYP_002890907 
Protein GI237654593 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.35469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGACA TCCTGATCGT CGGCGCGGGC AAGATCGGCA CGGTGATCGC CGATCTGCTC 
GCCGGGAGCG GAGACTATGC GGTGACGGTC GCGGATCGCG ACCCCGCCGC GGTCGAGCGC
GTGGGGGCAG AGCTCGCGCA CGTGCAGGCC TGCGCGCTGG ACGTCGCCGA CGCCGATGCG
CTCGCCGCGG AGCTGGAAGG GCGCTGGGCG GTGATCGACG CCGGGCCCTT CGACATCGGC
ATGCGCATCG CCGCGGCCGC GGTGGCGCAG CGGGTGCATT ACCTCAACCT CACCGAGGAC
GTCGCCAGCA CGCGCCGGGT ACGCGAGCTC GCCCGCGGCG CGCACAGCGC GCTGATCCCG
CAATGCGGGC TGGCGCCCGG CTTCATCTCC ATCGTCGCCC ACGACCTCGC CGCGCGCTTC
GACGAACTGC GCGACGTGCG CATGCGCGTC GGCGCACTGC CCAAGTACCC CTCCAACGGC
CTCAAGTACA ACCTGACCTG GAGCACCGAC GGCCTCATCA ACGAGTACCT CAACCCCTGC
GAGGCGATCG TCGACGGGGT GCGCCGCGAG ATGCCGGCGC TCGAGGAGCT CGAGCACTTC
TCGCTCGACG GCGACGACTA CGAGGCCTTC AACACCTCGG GTGGGCTGGG TACGCTGTGC
GACACGCTCG AGGGCCGGGT GCGCAACCTC AACTACCGCA CCGTGCGCTA CCGCGGCCAC
CGCGACGTGA TGAAGCTGCT GCTGCACGAC CTGCGCCTGG GCGAGCGGCG CGCGCTGCTC
AAGGACATCC TGGAGTCGGC GATCCCGGTG ACCATGCAGG ACGTGGTGCT GGTCTTCGTC
ACCGTCAGCG GCCGGCGCGA GGGCCTGCTG ATGCAGGAGA CCTTCGCGCG CAAGCTCTAT
GCGGCCGAGG TCAACGGCCG CCTGCGCAGC GCGATCCAGC TCACCACCGC GAGCGCGCTG
TGCGCGGTGC TCGACCTGCT CGCGGCGGGC CGCCTGCCGC AGGCGGGCTT CGTGCGCCAG
GAGGACGTCG ATTTCTGCGA CTTCGTCACC AACCGCTTCG GCCGCCACTT CCTGACCGAC
AGCGAGGAGG TGCGCTGCGC CGCCAGCGGC GTCCTCGCCG GGCCGGCCTC CGGCATGTGA
 
Protein sequence
MRDILIVGAG KIGTVIADLL AGSGDYAVTV ADRDPAAVER VGAELAHVQA CALDVADADA 
LAAELEGRWA VIDAGPFDIG MRIAAAAVAQ RVHYLNLTED VASTRRVREL ARGAHSALIP
QCGLAPGFIS IVAHDLAARF DELRDVRMRV GALPKYPSNG LKYNLTWSTD GLINEYLNPC
EAIVDGVRRE MPALEELEHF SLDGDDYEAF NTSGGLGTLC DTLEGRVRNL NYRTVRYRGH
RDVMKLLLHD LRLGERRALL KDILESAIPV TMQDVVLVFV TVSGRREGLL MQETFARKLY
AAEVNGRLRS AIQLTTASAL CAVLDLLAAG RLPQAGFVRQ EDVDFCDFVT NRFGRHFLTD
SEEVRCAASG VLAGPASGM