Gene Mmcs_4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4333 
Symbol 
ID4113163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4609095 
End bp4610525 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content69% 
IMG OID638033479 
Productsuccinate-semialdehyde dehydrogenase (NAD(P)+) 
Protein accessionYP_641494 
Protein GI108801297 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.393848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGTCAG TTCCCACCGG ACTGTGGATC GGTGGTGAAG AACGCGAGGC GAAGTCGACG 
TTCAACGTGT TGGACCCCAG TAGCGACGAG GTGCTGGTGG CCGTCGGCGA CGCGACGGCC
GAGGACGCGG TCGCCGCGCT CGACGCCGCC TGCGCGGTGC AGGCCGAATG GGCGGCCACG
CCCGCGCGGA AACGCGGCGA GATCCTGCGG TCGGTCTTCG AGACGCTCAC CGACCGCGCC
GACGACATCG CCGCGCTGAT GACCCTCGAG ATGGGCAAGG TGCTCGCCGA GAGTAAGGGT
GAGGTCACCT ACGGCGCCGA GTTCTTCCGC TGGTTCGCCG AAGAGGCCGT CCGCATCGGC
GGCAGGTACA CGCCGAGCCC GGCCGGCACC GGCCGCATCA TCGTCACCAA ACAGCCCGTC
GGGCCGTGTT ACGCGATCAC CCCGTGGAAC TTCCCGCTGG CCATGGGCAC CCGCAAGATG
GGCCCGGCGT TCGCCGCGGG TTGCACGATG ATCGTCAAAC CCGCGCAGGA AACCCCGCTG
ACCATGCTGT ACCTGGCCAA GCTGATGGCC GAGGCGGGGC TGCCCAAGGG CGTGCTGTCG
GTGCTGCCGA CCAGCAGCCC CGGACCCGTC ACCGAGGCGC TCGTCGACGA CGGCCGACTG
CGCAAGCTGA CCTTCACCGG TTCCACGGGG GTGGGCAAGT CGCTGGTGAA ACAGTCCGCC
GACAAGCTGC TGCGCACGTC GATGGAGCTC GGCGGCAACG CGCCGTTCAT CGTGTTCGAC
GACGCCGACG TCGATGCCGC GGTCGACGGC GCGATGCTGG CCAAGATGCG CAACGGCGGT
GAGGCCTGCA CCGCGGCCAA CCGCTTCCAC GTGGCCAACT CCGTGCGCGA GGAGTTCACC
GAGAAGCTGG TCAAGCGGAT GAGCGAGGTG ACGCTCGGCA ACGGCCTCGA CGAATCCGCC
ACGCTCGGCC CGCTGATCAA CAACAAGCAG CTGCAGAAGG TGACCGAACT GGTCTCCGAC
GCGGTCTCGC GCGGCGCGAC CGTCGCCGTC GGCGGCGTCG CCCCCGGCGG GCCCGGCAAC
TTCTACCCGG CCACGGTGCT CGCCGACGTG CCGGCCGACG CCCGCATCCT CAAGGAGGAG
GTGTTCGGGC CCGTCGCCCC GATCACCGGA TTCGACTCCG AGGAGGAGGG CGTCGCCGCG
GCCAACGACA CCGAATACGG GCTGGCCGCC TACGTGTACA CCCGGTCGCT GGACCGCGCG
CTGCGCGTCG CCGAGGCCAT CGAATCCGGG ATGGTCGGCA TCAACCGCGG CGTGATCAGC
GACGCCGCCG CGCCGTTCGG CGGTATCAAG GAGTCCGGCT TCGGCCGCGA GGGCGGCACC
GAGGGCATCG AGGAGTACCT CGACACGAAG TACATCGCGT TGACGAAGTA G
 
Protein sequence
MKSVPTGLWI GGEEREAKST FNVLDPSSDE VLVAVGDATA EDAVAALDAA CAVQAEWAAT 
PARKRGEILR SVFETLTDRA DDIAALMTLE MGKVLAESKG EVTYGAEFFR WFAEEAVRIG
GRYTPSPAGT GRIIVTKQPV GPCYAITPWN FPLAMGTRKM GPAFAAGCTM IVKPAQETPL
TMLYLAKLMA EAGLPKGVLS VLPTSSPGPV TEALVDDGRL RKLTFTGSTG VGKSLVKQSA
DKLLRTSMEL GGNAPFIVFD DADVDAAVDG AMLAKMRNGG EACTAANRFH VANSVREEFT
EKLVKRMSEV TLGNGLDESA TLGPLINNKQ LQKVTELVSD AVSRGATVAV GGVAPGGPGN
FYPATVLADV PADARILKEE VFGPVAPITG FDSEEEGVAA ANDTEYGLAA YVYTRSLDRA
LRVAEAIESG MVGINRGVIS DAAAPFGGIK ESGFGREGGT EGIEEYLDTK YIALTK