Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_4075 |
Symbol | |
ID | 4112905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 4343587 |
End bp | 4344615 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638033218 |
Product | aldo/keto reductase |
Protein accession | YP_641236 |
Protein GI | 108801039 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.48002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTATC GGCGGGTTGG CGAATCGGGT CTGACGGTGT CGGAGATCAG TTTCGGCGCA GCGACATTCG GCGGGGTCGG CGACTTCTTC GGCGCCTGGG GCGATACCGG CGTCGAGGGC GCGCGTCGCA TCGTGGACAT CTGCCTGGAG GCTGGTGTCA CGCTGTTCGA CACCGCGGAC GTGTACTCCG ACGGCGCCTC GGAGGAGGTG CTCGGCGAAG CCCTGCGCGG CCGGCGCGAC CGGGTGCTCA TCTCCACCAA GGCCGCGCTG CCCACCTCGA CCGGCTGGGG CACCTCACGC GCTCGGTTGC TGCGTGCGGT CGAGGATGCG CTGCGGCGGT TGCGGACCGA CCGCATCGAC CTGTTCCAAC TGCACGGCTA CGACTCGGGG ACGCCGATCG AGGAAGTCGT GGCGACCCTC GACACGCTGC TCACGCAGGG CAAGGTGCGC TACACCGGCG TGTCGAACTT CTCCGGATGG CAGTTGATGA AATCGCTGGC GGTCGCCGAC GGCGCACACC GCACCCGCCA CATCGCCCAT CAGGTCTACT ACTCGCTCGT CGGGCGGGAT TACGAATGGG AACTCATGCC GCTGGGCCTT GCCGAGGGCG TCGGCGCGCT GGTGTGGAGT CCGCTGGGCT GGGGACGGCT CACCGGCCGG ATCCGGCGCG GACGACCGCT GCCCGAGCGC AGCCGCCTGC ACGCGACCGC TGACGCGGGT CCGCCCGTCG ACGAGGATCG GCTCTACGCC GTCGTCGACA CCCTCGACGA CATCGCCGCG GAGACCGGAC GCACCGTGGC GCAGATCGCG CTCAACTGGC TCCTGCGGCG GCCGACCGTC GCCTCGGTGA TCATCGGAGC CCGCAACGAG GAACAGCTGC GCGAGAACCT GGGCGCCGTC GGCTGGCGAC TCGACGACGA GCAGATCGCC CGGCTGGACG CGGTCAGCGC CCGGGAGGCG CCGTATCCCT ACTTCCCGTA CCGCAGGCAG GAAGGTTTCG CACTGCTCGA TCCGCCGGTG GCGGGTTAG
|
Protein sequence | MEYRRVGESG LTVSEISFGA ATFGGVGDFF GAWGDTGVEG ARRIVDICLE AGVTLFDTAD VYSDGASEEV LGEALRGRRD RVLISTKAAL PTSTGWGTSR ARLLRAVEDA LRRLRTDRID LFQLHGYDSG TPIEEVVATL DTLLTQGKVR YTGVSNFSGW QLMKSLAVAD GAHRTRHIAH QVYYSLVGRD YEWELMPLGL AEGVGALVWS PLGWGRLTGR IRRGRPLPER SRLHATADAG PPVDEDRLYA VVDTLDDIAA ETGRTVAQIA LNWLLRRPTV ASVIIGARNE EQLRENLGAV GWRLDDEQIA RLDAVSAREA PYPYFPYRRQ EGFALLDPPV AG
|
| |