Gene Tmz1t_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0428 
Symbol 
ID7084938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp488936 
End bp490396 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content70% 
IMG OID643697460 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_002354103 
Protein GI217968869 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGA ACCTGAAAGA TCCCGACCTC TTCCGCACCC GCTGCCATGT CGATGGGCAG 
TGGATCGACG CCGACGACGG CGCCACCACG AGCATCCGCA ACCCGGCCAC GGGCGAGGTG
CTCGGCACCA TCCCGCGCAT GGGCGCGGCC GAGACCCGGC GCGCCATCGC GGCCGCCAAC
GCCGCATGGC CGGCGTGGCG TGCGCTGACC GCCGGCGCGC GCGCGAAGAT CCTGCGGCGC
TGGTTCGAGC TCATCCTCGC CAACCAGGAA GACCTCGCCG TGCTGATGAC CAGCGAGCAG
GGCAAGCCGC TCGCCGAGGC GCGCGGCGAG GTGCTCTACG CCGCCTCCTT CATCGAGTGG
TTCGCCGAGG AAGGCAAGCG CATCTACGGC GACGTGATCC CCGGCCACCA GCCCGACAAG
CGCATCGTCG TCACCAAGGA GCCGATCGGG GTGTGCGCGG CGATCACGCC GTGGAACTTC
CCCGCGGCGA TGATCACGCG CAAGGCCGGT CCGGCGCTCG CGGCCGGATG CACGATGGTG
CTCAAGCCCG CCACCCAGAC CCCTTACTCG GCGCTCGCGC TGGCGGTGCT CGCCGAGCGC
GCGGGGGTGC CGAAGGGCGT GTTCAGTGTG GTCACCGGCG GCGCGGCCGA GATCGGCGGC
GAGCTGACCG CCAACCCGAT CGTCCGCAAG CTCACCTTCA CCGGCTCCAC CGAGATCGGC
GTCAAGCTGA TGGCGCAGTG CGCGCCGAGC GTCAAGAAGC TCTCGCTCGA GCTCGGCGGC
AATGCGCCCT TCATCGTCTT CGACGATGCC GACCTCGACG CCGCGGTCGA GGGCGCCATC
GCTTCCAAAT ACCGCAACAC CGGCCAGACC TGCGTGTGCG CCAACCGCCT GCTGGTGCAG
GACGGTGTGT ACGACGCCTT CGCCGCCAGG CTCGCCGCCG CGGTGGCGCG CCTGAAGGTG
GGCAACGGCC TCGCCGAGGG CAGCACCCAG GGCCCGCTGA TCGACATGAA CGCGGTGGCC
AAGGTCGAGG AGCACATCGC CGACGCGGTG GAGAAGGGCG CGCGCGTGCT CGCCGGCGGC
AAGCGCCACG CGCTCGGCGG CAGCTTCTTC GAGCCCACCA TCCTGGTCGA CGTGACCCCG
GCGATGAAGG TGGCGCGCGA GGAGACCTTC GGCCCGGTGG CGCCGCTGTT CCGCTTCAAG
GACGAGGCCG AGGCGATCCG CATGGCCAAC GACACCGAGT TCGGCCTCGC CGCCTATTTC
TACGCCAGCT CGATGAACCG CGTGTGGCGG GTCGGGGAGG CGCTCGAGTA CGGCATCGTC
GGCATCAACA CCGGAATCAT CTCGACCGAG GTCGCGCCCT TCGGCGGCAT GAAGTCCTCC
GGCCTCGGCC GCGAAGGTTC CAAGTACGGC ATCGAGGACT ACCTCGAGGT CAAGTATCTG
TGCATGGGCG GCGTGCAGTG A
 
Protein sequence
MSLNLKDPDL FRTRCHVDGQ WIDADDGATT SIRNPATGEV LGTIPRMGAA ETRRAIAAAN 
AAWPAWRALT AGARAKILRR WFELILANQE DLAVLMTSEQ GKPLAEARGE VLYAASFIEW
FAEEGKRIYG DVIPGHQPDK RIVVTKEPIG VCAAITPWNF PAAMITRKAG PALAAGCTMV
LKPATQTPYS ALALAVLAER AGVPKGVFSV VTGGAAEIGG ELTANPIVRK LTFTGSTEIG
VKLMAQCAPS VKKLSLELGG NAPFIVFDDA DLDAAVEGAI ASKYRNTGQT CVCANRLLVQ
DGVYDAFAAR LAAAVARLKV GNGLAEGSTQ GPLIDMNAVA KVEEHIADAV EKGARVLAGG
KRHALGGSFF EPTILVDVTP AMKVAREETF GPVAPLFRFK DEAEAIRMAN DTEFGLAAYF
YASSMNRVWR VGEALEYGIV GINTGIISTE VAPFGGMKSS GLGREGSKYG IEDYLEVKYL
CMGGVQ