Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1784 |
Symbol | |
ID | 4110618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 1924244 |
End bp | 1925485 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638030904 |
Product | aldehyde dehydrogenase |
Protein accession | YP_638949 |
Protein GI | 108798752 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGAGC ACACCAGGTC CCATGTGGCG AAATGGATGC GGCGCACCAC GCTGTTGCGT CCGGCGCGGC TGGCAGGGCT GCGGGCCGAG GTCGAACCCG TCCCGGTCGG TGTGGTCGGG ATCGTCGGAC CGTGGAACTT CCCCGTCAAC CTCGTCGTCC TGCCCGCGGC CGCCGCCTTC GCGGCGGGCA ACCGGGTGAT GATCAAGATG TCGGAGATCA CCGCGCACAC CGCCGAACTG CTCGAGGCCC GCGCCCCCGA GTACTTCGAC GCGGCCGAAC TCACCGTGGT CACTGGTGGG CCGGACACCG CCGCGGCGTT CACCGCGCTG CCCTTCGACC ACCTGTTCTT CACCGGTTCA CCGGCCGTCG GCGTGCACGT GCAGCGCGCC GCCGCGGCGA ACCTCGTCCC GGTCACGCTC GAGCTCGGAG GTAAGAATCC GGCCGTGGTC GGGCCGCGTG CCGACCTCCG GCGCGCAGCC GTGCGGATCG CGCAGGGCCG CATGGTCAAT GGTGGGCAGG TCTGCGTCTG CCCCGATTAC GTCCTGGTGC CCGAGTACCT CGTCGACGAG TTCAGCGCGA CCGTGCTCGC CACGTGGCGG CGGATGTTCC CGTCGATCAC CGGTAGCGAG GACTACTGCT CGTCGGTCAA CGACGCCAAC TTCGACCGGG TCGTCGGCCT GATCGACGAC GCCCGTGCCG GCGGCGCCCG CGTGAACAGC GTTGTCCCAC CGGGTGAAAC GCTTCCGGAC CGGAGATCGC GCAAGATCGC GCCCACGCTG ATCCGCGACG TGACACCGAC GATGCGCATC GCCTCCGAGG AGGTGTTCGG GCCGGTGTTG TCCGTGCTCG GGTATTCCAC GACCGACGAG GTGATCGACC ACATCAACAG CCGTCCCGCT CCGCTGGTGG CCTATTGGTT CGGCCCCGAC GACCAGGATT TCCGCACCTT TGTGCGCCGG ACACGCAGCG GCGGGGTGGC CCGCAACGAC TTTGCCGCAC AGATGATCCC GTCCGACGCG CCGTTCGGTG GGGTGGGGCG CAGTGGGATG GGCGCCTACC ATGGCAAGGC CGGGTTCGAC ACCTTCAGCC ACCACCGATC CGTGGTGGGC AGCGATCTGC CGTTCTCGAT CACCGGCAGC GCCGCACCTC CGTTCGGGGC CGCCATGCGG CGCAGCACCG AGTTCCGGCT GCGGATGGCC CGCAGGCGCA ACCATCGCCG GCTCCGGCGC AGCCACGGTT GA
|
Protein sequence | MIEHTRSHVA KWMRRTTLLR PARLAGLRAE VEPVPVGVVG IVGPWNFPVN LVVLPAAAAF AAGNRVMIKM SEITAHTAEL LEARAPEYFD AAELTVVTGG PDTAAAFTAL PFDHLFFTGS PAVGVHVQRA AAANLVPVTL ELGGKNPAVV GPRADLRRAA VRIAQGRMVN GGQVCVCPDY VLVPEYLVDE FSATVLATWR RMFPSITGSE DYCSSVNDAN FDRVVGLIDD ARAGGARVNS VVPPGETLPD RRSRKIAPTL IRDVTPTMRI ASEEVFGPVL SVLGYSTTDE VIDHINSRPA PLVAYWFGPD DQDFRTFVRR TRSGGVARND FAAQMIPSDA PFGGVGRSGM GAYHGKAGFD TFSHHRSVVG SDLPFSITGS AAPPFGAAMR RSTEFRLRMA RRRNHRRLRR SHG
|
| |