Gene Tmz1t_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0641 
Symbol 
ID7084579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp725450 
End bp726886 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID643697667 
ProductSuccinate-semialdehyde dehydrogenase 
Protein accessionYP_002354309 
Protein GI217969075 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTACC CCCATACCCG CCTGCTCATC GACGGCCAAT GGTGCGACGC CCTCGACGGC 
CGCACGCTGG CCGTGCACAA CCCGGCCACC GGCGAGGAGA TCGGCCGCGT CGCGCATGCC
GCGATCGCCG ACCTCGACCT CGCCGTCGCC GCCGCCGTGA AGGGCTTCCA GACCTGGCGC
CGCACCCCGG CGATCGAGCG CGCCAAGACC CTGCGCCGCG CCGCAGCGCT GATGCGCGAG
CGCGCCGGCG ACATCGCCCG CGTGCTCACC CAGGAACAAG GCAAGCCGCT GCCCGAGGCG
AAAATGGAAA CACTCGCCGC CGCCGACATC ATCGAGTGGT TTGCCGACGA AGGCCTGCGC
GTGTATGGCC GCATCGTGCC GGGACGCAAC CTCGCCGCCA CGCAGATGGT GATCAAGGAC
CCGGTCGGCC CGGTCGCCGC CTTCACGCCG TGGAACTTCC CCATCAACCA GGTCGTGCGC
AAGGCCGCGG CGGCGCTCGC CACCGGCTGC TCCATCCTGG TCAAGGCCGC CGAGGAGACG
CCCGCGGCGC CGGCCGAGCT GGTGCGCGCC TTCGTCGATG CCGGCGTGCC GGCGGGCGTG
ATCGGGCTGG TGTATGGCAA CCCGGCGGAG ATCTCCAGCT ACCTGATCGC CCACCCGGCG
ATCCGCAAGA TCACCTTCAC CGGCTCGACT CCGGTGGGCA AGCAGCTCGC GGCGCTCGCC
GGCCAGCACA TGAAGCGGGT GACGATGGAG CTCGGCGGCC ATGCGCCGGT GATCGTGTGC
GAGGACGCCG ACCTCGAGCT GGCGATCAAG GTGTCGGCGG CGTCGAAGTT CCGCAACGCC
GGCCAGGTAT GCATCTCGCC CACGCGTTTC CTCGTGCACG AGGCGGTGCG CGAGACCTTC
GCCGCCGCGC TGGCCCGGCA TGCGAGCACG CTGAAGGTCG GCGACGGCCT GGCCGAGGGC
ACGCAGATGG GTCCGCTGGC CAATCCCCGC CGCGTGGCGG CGATGGCGGA CTTCGTGCAG
GACGCGCTGG CGCGCGGCGC GACCGTGGCG GCGGGCGGAG AGCGCATCGG CGCCGCGGGC
AACTTCTTCG CGCCCACGGT GCTGACCGGC GTGCCGCTGG ACGCGAAGGT GTTCAATGAG
GAACCCTTCG GGCCGGTCGC GGCGATCCGC GGCTTCACCA CGCTGGACGA GGCGATCGCG
GAGGCGAACC GCCTGTCCTT CGGGCTGGCG GGCTACGCGT TCACGCGCTC GCTGAAGAAC
GCGCACCGCC TGGCGCACGA ACTCGAGGTC GGCATGCTGT ACGTGAACCA ACCCGCGACG
CCGAGCGCGG AGATGCCCTT CGGCGGCATC AAGGATTCGG GCTACGGCAC CGAGGGCGGG
CCGGAGGCGC TCGACGCCTA CCTGAACACG CGCGCGGTGA CGGTGATGAA CGTGTAA
 
Protein sequence
MTYPHTRLLI DGQWCDALDG RTLAVHNPAT GEEIGRVAHA AIADLDLAVA AAVKGFQTWR 
RTPAIERAKT LRRAAALMRE RAGDIARVLT QEQGKPLPEA KMETLAAADI IEWFADEGLR
VYGRIVPGRN LAATQMVIKD PVGPVAAFTP WNFPINQVVR KAAAALATGC SILVKAAEET
PAAPAELVRA FVDAGVPAGV IGLVYGNPAE ISSYLIAHPA IRKITFTGST PVGKQLAALA
GQHMKRVTME LGGHAPVIVC EDADLELAIK VSAASKFRNA GQVCISPTRF LVHEAVRETF
AAALARHAST LKVGDGLAEG TQMGPLANPR RVAAMADFVQ DALARGATVA AGGERIGAAG
NFFAPTVLTG VPLDAKVFNE EPFGPVAAIR GFTTLDEAIA EANRLSFGLA GYAFTRSLKN
AHRLAHELEV GMLYVNQPAT PSAEMPFGGI KDSGYGTEGG PEALDAYLNT RAVTVMNV