Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4372 |
Symbol | |
ID | 4596890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4621931 |
End bp | 4622770 |
Gene Length | 840 bp |
Protein Length | 279 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639778982 |
Product | short chain dehydrogenase |
Protein accession | YP_925556 |
Protein GI | 119718591 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0247047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACC TTGGCTACGC ACAGGCCCAC GACACCTCGA TCGAGGACCG CTACGACGCC GTACCCCTGC TGAAGGGCAA GGTCGTGGTG ATCTCCGGGA TCGGGCCCGG CCTGGGCCGC TCGCTGGCCG AGGAGGCCGC GAAGATGGGC GCGGACCTGG TGATCGCCAG CCGCACCGAG GCCCGCCTCG TCGAGCTCCA GGGCGAGCTG GAGGGCCACG GCGTCAAGGT CGTGAGCGTG GTCACCGACG TGACCGACGA GGACTCGCGG GTCAACCTGC GCGACCGGGC CCTCGAGGCG TTCGGGCGCG TCGACTGCGT GATCAACAAC GCGTTCGCGA TCCCGCCGAT GGACCCGATC ACCCGGATCG ACCCGCGCAA GCTCGCGAAG GTCAACGAGA CCAACGTGTT CGCGCCGCTG CGGCTCTCGG CGCTGTTCGC CGACGCGCTC GCGGAGAGCA GGGGCTCGAT CATCATGCTG AACTCCTGCG TCTCCTTCTC CTCCCAGCCC GAGTACGCCG GCTACAAGCT GTCCAAGGGC GCGCTCGAGC ACCTCGCGTC GTCGCTGGCC ACCGAGCTCG GCCCGCGCGG GATCCGGGTC AACAGCGTGG CGCCGTCCTA CATCTACGAG GACGTCAACC GCGGCTACTT CGACTTCCTG GCGGCCGTCG AGGGCAAGAC GCACGAGGAG GTGTACGCCG AGAAGGCGGC GCCGACCGAC CTCAAGCGGC TGGCGTCGGC CCAGGAGGTC GCCCGGGCCG CGCTGTTCCT CGCCTCCGAC CTCGCCTCCG CGGTGACCGG GCAGATGCTC ACCGTCGACT GCGGGGAGTT CCACGACTGA
|
Protein sequence | MTDLGYAQAH DTSIEDRYDA VPLLKGKVVV ISGIGPGLGR SLAEEAAKMG ADLVIASRTE ARLVELQGEL EGHGVKVVSV VTDVTDEDSR VNLRDRALEA FGRVDCVINN AFAIPPMDPI TRIDPRKLAK VNETNVFAPL RLSALFADAL AESRGSIIML NSCVSFSSQP EYAGYKLSKG ALEHLASSLA TELGPRGIRV NSVAPSYIYE DVNRGYFDFL AAVEGKTHEE VYAEKAAPTD LKRLASAQEV ARAALFLASD LASAVTGQML TVDCGEFHD
|
| |