Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_1831 |
Symbol | |
ID | 4613758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 1943213 |
End bp | 1944469 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639791497 |
Product | aldehyde dehydrogenase |
Protein accession | YP_937822 |
Protein GI | 119867870 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGCGC TCCCGGTGAT CGAGCACACC AGGTCCCATG TGGCGAAATG GATGCGGCGC ACCACGCTGT TGCGTCCGGC GCGGCTGGCA GGGCTGCGGG CCGAGGTCGA ACCCGTCCCG GTCGGTGTGG TCGGGATCGT CGGACCGTGG AACTTCCCCG TCAACCTCGT CGTCCTGCCC GCGGCCGCCG CCTTCGCGGC GGGCAACCGG GTGATGATCA AGATGTCGGA GATCACCGCG CACACCGCCG AACTGCTCGA GGCCCGCGCC CCCGAGTACT TCGACGCGGC CGAACTCACC GTGGTCACTG GTGGGCCGGA CACCGCCGCG GCGTTCACCG CGCTGCCCTT CGACCACCTG TTCTTCACCG GTTCACCGGC CGTCGGCGTG CACGTGCAGC GCGCCGCCGC GGCGAACCTC GTCCCGGTCA CGCTCGAGCT CGGAGGTAAG AATCCGGCCG TGGTCGGGCC GCGTGCCGAC CTCCGGCGCG CAGCCGTGCG GATCGCGCAG GGCCGCATGG TCAATGGTGG GCAGGTCTGC GTCTGCCCCG ATTACGTCCT GGTGCCCGAG TACCTCGTCG ACGAGTTCAG CGCGACCGTG CTCGCCACGT GGCGGCGGAT GTTCCCGTCG ATCACCGGTA GCGAGGACTA CTGCTCGTCG GTCAACGACG CCAACTTCGA CCGGGTCGTC GGCCTGATCG ACGACGCCCG TGCCGGCGGC GCCCGCGTGA ACAGCGTTGT CCCACCGGGT GAAACGCTTC CGGACCGGAG ATCGCGCAAG ATCGCGCCCA CGCTGATCCG CGACGTGACA CCGACGATGC GCATCGCCTC CGAGGAGGTG TTCGGGCCGG TGTTGTCCGT GCTCGGGTAT TCCACGACCG ACGAGGTGAT CGACCACATC AACAGCCGTC CCGCTCCGCT GGTGGCCTAT TGGTTCGGCC CCGACGACCA GGATTTCCGC ACCTTTGTGC GCCGGACACG CAGCGGCGGG GTGGCCCGCA ACGACTTTGC CGCACAGATG ATCCCGTCCG ACGCGCCGTT CGGTGGGGTG GGGCGCAGTG GGATGGGCGC CTACCATGGC AAGGCCGGGT TCGACACCTT CAGCCACCAC CGATCCGTGG TGGGCAGCGA TCTGCCGTTC TCGATCACCG GCAGCGCCGC ACCTCCGTTC GGGGCCGCCA TGCGGCGCAG CACCGAGTTC CGGCTGCGGA TGGCCCGCAG GCGCAACCAT CGCCGGCTCC GGCGCAGCCA CGGTTGA
|
Protein sequence | MGALPVIEHT RSHVAKWMRR TTLLRPARLA GLRAEVEPVP VGVVGIVGPW NFPVNLVVLP AAAAFAAGNR VMIKMSEITA HTAELLEARA PEYFDAAELT VVTGGPDTAA AFTALPFDHL FFTGSPAVGV HVQRAAAANL VPVTLELGGK NPAVVGPRAD LRRAAVRIAQ GRMVNGGQVC VCPDYVLVPE YLVDEFSATV LATWRRMFPS ITGSEDYCSS VNDANFDRVV GLIDDARAGG ARVNSVVPPG ETLPDRRSRK IAPTLIRDVT PTMRIASEEV FGPVLSVLGY STTDEVIDHI NSRPAPLVAY WFGPDDQDFR TFVRRTRSGG VARNDFAAQM IPSDAPFGGV GRSGMGAYHG KAGFDTFSHH RSVVGSDLPF SITGSAAPPF GAAMRRSTEF RLRMARRRNH RRLRRSHG
|
| |