Gene Mlut_20000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlut_20000 
Symbol 
ID7985210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMicrococcus luteus NCTC 2665 
KingdomBacteria 
Replicon accessionNC_012803 
Strand
Start bp2157249 
End bp2158793 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content69% 
IMG OID644806940 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002958028 
Protein GI239918470 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAGA AGACCAGCAT CGAGGCCCGC ACGCCCGAGG GCCTGCCCGA CGTCCTGCGC 
CACTACATCG ACGGCGAGTT CGTCGACTCG ATCGACGGCG ACACCTTCGA GGTGCTCGAC
CCGGTGACCA ACGAGCCGTA CCTGACGGCC GCCTCCGGCA AGGCCGCGGA CATCGACCGC
GCCGTCGCCG CCGCGAAGCG GGCCTTCAAG TCCGGCGAGT GGTCCCAGGC CCTGCCGCGC
CAGCGCTCCC GCGTGCTGCA CCGGATCGCC GACATCATGG AGACCCGCGG CGACCAGCTG
GCCGAGATGG AGTGCTTCGA CACCGGCCTG CCGATCAAGC AGGCGAGGGG CCAGGCCGCC
CGCGCCGCCG AGAACTTCCG CTTCTTCGCG GACCTGATCG TGGCCCAGCA CGACGACACC
TTCAAGGTGC CGGGCCGCCA GATCAACTAC GTGAACCGCA AGCCGATCGG CGTCGCCGGC
CTGATCACCC CGTGGAACAC CCCGTTCATG CTGGAGTCCT GGAAGCTGGC CCCGGCCATC
GCCACCGGCA ACTCGGTGGT CCTGAAGCCG GCGGAGTTCA CCCCGCTCTC GGCCTCCCTG
TGGGGCGGGA TCTTCGAGGA GGCCGGCCTG CCCCAGGGCG TGTTCAACAT GGTGCACGGC
TTCGGCGAGG AGGGCTACGC GGGCGACCCG CTCGTGAAGC ACCCGGACGT GCCGCTGATC
TCCTTCACCG GCGAGTCCCG CACCGGCCAG ATCATCTTCG CCAACGCCGC CCCGCACCTG
AAGGGCCTGT CCATGGAGCT CGGCGGCAAG TCCCCGGCCG TGGTGTTCGA GGACGCGGAC
CTGGACGCGG CGATCGACGC GACCATCTTC GGCGTGTTCT CCCTGAACGG CGAGCGCTGC
ACCGCCGGCT CCCGCATCCT GGTCCAGCGT TCCGTCTACG ACGAGTTCGT GGAGCGCTAC
GCCGCCCAGG CCTCCCGCGT GAAGGTCGGC CTGCCGAACG ACGAGACCAC CGAAGTCGGC
GCCATCGTGC ACCCGGAGCA CTTCGAGAAG GTCATGTCCT ACGTGGAGAT CGGCAAGACC
GAGGCCCGCC TGGTGGCCGG CGGCGGCCGC CCGGAGGGCT TCCCCGAGGG CAACTTCGTG
CAGCCCACCG TGTTCGCGGA CGTGGCCCCG GACGCCCGGA TCTTCCAGGA GGAGATCTTC
GGCCCGGTCG TGGCCATCAC CCCCTTCGAC ACGGAGGAGG AGGCCCTGCA GCTGGCCAAC
AACACCAAGT ACGGTCTGGC CGCCTACATC TGGACCAACG ACCTCAAGCG CGCCCACAAC
GTCGCGCAGA ACGTGGAGGC CGGCATGGTG TGGCTCAACT CCAACAACGT GCGGGACCTG
CGCACCCCGT TCGGCGGGGT GAAGGCCTCC GGCCTGGGCC ACGAGGGCGG CTACCGCTCG
ATCGACTTCT ACACCGATCA GCAGGCCGTG CACATCAACC TCGGCGAGGT CCACAACCCG
GTGTTCGGCA AGCAGGAGCA GGCCGCGGCG AAGATCGACG GCTGA
 
Protein sequence
MTEKTSIEAR TPEGLPDVLR HYIDGEFVDS IDGDTFEVLD PVTNEPYLTA ASGKAADIDR 
AVAAAKRAFK SGEWSQALPR QRSRVLHRIA DIMETRGDQL AEMECFDTGL PIKQARGQAA
RAAENFRFFA DLIVAQHDDT FKVPGRQINY VNRKPIGVAG LITPWNTPFM LESWKLAPAI
ATGNSVVLKP AEFTPLSASL WGGIFEEAGL PQGVFNMVHG FGEEGYAGDP LVKHPDVPLI
SFTGESRTGQ IIFANAAPHL KGLSMELGGK SPAVVFEDAD LDAAIDATIF GVFSLNGERC
TAGSRILVQR SVYDEFVERY AAQASRVKVG LPNDETTEVG AIVHPEHFEK VMSYVEIGKT
EARLVAGGGR PEGFPEGNFV QPTVFADVAP DARIFQEEIF GPVVAITPFD TEEEALQLAN
NTKYGLAAYI WTNDLKRAHN VAQNVEAGMV WLNSNNVRDL RTPFGGVKAS GLGHEGGYRS
IDFYTDQQAV HINLGEVHNP VFGKQEQAAA KIDG