Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_4659 |
Symbol | |
ID | 4113488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 4936262 |
End bp | 4937323 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638033810 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_641819 |
Protein GI | 108801622 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.890029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA AAGACATCTT CTTCAACCCG ATCTGGGACG TCCGGATGAC GGACACGTCG CTGCGCGACG GCTCGCACCA CAAACGTCAC CAGTTCACCA AGGACGAGGT GGGCGCCATC GTCGCCGCGC TCGACACCGC AGGTGTGCCG GTCATCGAGG TGACCCACGG TGACGGGCTC GGCGGGTCGA GCTTCAACTA CGGGTTCTCG AAAACCCCTG AGCAGGAACT GATCAAGCTC GCGGCCGAGA CCGCCAAGGA AGCCAAGATC GCCTTCCTCA TGCTGCCCGG GGTGGGCACC AAGGAGGACA TCAAAGAGGC GCAGAACAAC GGCGGTTCGA TCTGCCGCAT CGCGACCCAC TGCACCGAGG CCGACGTGTC GATCCAGCAC TTCGGTCTGG CGCGTGAGCT CGGGCTCGAG ACCGTGGGCT TCTTGATGAT GAGCCACACC ATCCCGCCGG AGAAGCTCGC CCAACAGGCC CGCATCATGG CCGACGCGGG GTGCCAGTGC GTCTACGTCG TCGACTCCGC CGGTGCACTG GTGCTCGAAG GGGTGCGCGA TCGGGTCGCC GCGCTCGTCG CCGAACTCGG TGACGACGCC CAGGTCGGTT TTCACGGCCA CGAGAATCTC GGTCTGGGCG TGGCGAATTC GGTCGAGGCG GTGCGCGCCG GGGCCAAGCA GATCGACGGG TCGTGCCGCC GGTTCGGCGC CGGAGCGGGT AACGCACCGG TCGAGGCGCT CATCGGGGTC TTCGACAAGA TCGGCGTGAA GACCGGCATC GACTTCTTCG ACATCGCCGA CGCCGCCGAG GAAGTCGTCG CACCGGCGAT GCCGGCCGAA TGCCTGCTGG ACCGCAACGC GCTCATCATG GGCTACTCGG GTGTCTACTC GAGCTTCCTC AAACACGCGA TCCGGCAGTC GGAGCGCTAC GGCGTGCCCG CGCACCAGCT GCTGCACCGC GCCGGCCAGC GCAAGCTGAT CGGCGGTCAG GAAGATCAGC TCATCGACAT CGCGCTGGAG ATCAAACGAG AACAGGACAG CGGGGCGACG GCCGCGCACT GA
|
Protein sequence | MSTKDIFFNP IWDVRMTDTS LRDGSHHKRH QFTKDEVGAI VAALDTAGVP VIEVTHGDGL GGSSFNYGFS KTPEQELIKL AAETAKEAKI AFLMLPGVGT KEDIKEAQNN GGSICRIATH CTEADVSIQH FGLARELGLE TVGFLMMSHT IPPEKLAQQA RIMADAGCQC VYVVDSAGAL VLEGVRDRVA ALVAELGDDA QVGFHGHENL GLGVANSVEA VRAGAKQIDG SCRRFGAGAG NAPVEALIGV FDKIGVKTGI DFFDIADAAE EVVAPAMPAE CLLDRNALIM GYSGVYSSFL KHAIRQSERY GVPAHQLLHR AGQRKLIGGQ EDQLIDIALE IKREQDSGAT AAH
|
| |