Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1053 |
Symbol | |
ID | 4599659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1110758 |
End bp | 1112275 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639775651 |
Product | aldehyde dehydrogenase |
Protein accession | YP_922258 |
Protein GI | 119715293 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGG CCCCCGCCCG CCCGCAGTCC CCTCAGCCCG CCGAGCCGGC CCCGTCCGGC TCGACGTTCG AGTCGCTGGA CCCGGCGACC GGGGCGGTCG TCGGGACGTT CCCGGTGCAC GGCGAGGCGG AGGTGCGCGC GGCCGTCGAG CGCGCCCGCA CGGCCGCCGA GTGGTGGTCG GCGCTCTCGT TCAAGGACCG CAAGGTCTAC CTGACCACCT GGAAGGCCGC GATCACCCGC CGGATGCCCG AACTCGCGGA GCTCATGCAC CGCGAGACCG GCAAGCCCCG CTCCGACGCG ATGCTCGAGG CGACGCTCGG GGTCGACCAC CTCGGCTGGG CCGCCGGCCA CGCGGGCAAG GTGCTGGGGC GGCACCGGGT CTCGCCCGGG ATGTTGATGG TCAACCAGGC CGCGACCGTG GAGTTCCGCC CGCTCGGCGT CGTCGGCGTG ATCGGCCCGT GGAACTACCC GGTGTTCACC CCGCTCGGCT CGATCGCCTA CGCGCTCGCG GCCGGCAACG CCGTGGTGTT CAAGCCCAGC GAGCACACGC CCGCGGTCGG CGAGTGGCTG GCCCGCACGT TCGGCGAGTG CGTCGGGCGA CCGGTCCTCC AGGTCGTCAC CGGCCGCGGC GAGACCGGTG CCGCGCTGTG CCGCTCCGGG GTCGACAAGG TGGCGTTCAC CGGCTCGACC GGCACGGGGA AGAAGGTGAT GGCCGCCTGC GCCGAGACCC TGACCCCGGT CGTCATCGAG GCCGGTGGCA AGGACCCGCT GATCGTCGAC GCGGACGCCG ACGTCCCGGC CGCCGCCGAC GCCGCGCTGT GGGGCGCCTG CAGCAACGCC GGCCAGACCT GCGCGGGCGT CGAGCGGGTC TACGTGCACG AGCGGGTGTA CGACGAGTTC CTCGCCGAGA TCACCCGCAA GGCGCAGGGC CTGAGCGCCC ATGGCGGCGA CGACGCGAAG ATCGGCCCGA TCACGATGCC GGGCCAGCTC GACGTGATCC GCCGCCACAT CGACGACGCG CTCGAACGCG GCGGCCGCGC GGTCGTCGGC GGGGCGGACG CGGTGGGCGA GCGGTTCGTG CAGCCCACGA TCCTCGTCGA CGTGCCCGAG GACTCCGCGG CGGTCCAGGA GGAGACGTTC GGCCCGACCG TGACGATCGC GAAGGTGCGC GACATGGACG AGGCGATCGA GCTCGCCAAC GGGACGCCGT ACGGCCTCGG GGCGACGGTC TTCAGCAGGA GCAACGGGAT GGCCATCGCC GAGCGGATCC GCTCCGGCAT GACCGCGATC AACGCGGTGA TCTCGTTCGC GGCGATCCCG AGCCTGCCGT TCGGCGGCGT CGGCGACTCC GGATTCGGAC GGATCCACGG GCCCGAGGGC CTCAAGGAGT TCACCTACGC GAAGGCGATC GCCCGGCAGC GGTTCAAGCC GGCCCTCGCG CTGACCACGT TCGAGCGCAC CGAGCAGGCC GACCGGCGGC TCGCCGCGAT CGTCCGGGCG CTGCACGGCC GCGGCTGA
|
Protein sequence | MTQAPARPQS PQPAEPAPSG STFESLDPAT GAVVGTFPVH GEAEVRAAVE RARTAAEWWS ALSFKDRKVY LTTWKAAITR RMPELAELMH RETGKPRSDA MLEATLGVDH LGWAAGHAGK VLGRHRVSPG MLMVNQAATV EFRPLGVVGV IGPWNYPVFT PLGSIAYALA AGNAVVFKPS EHTPAVGEWL ARTFGECVGR PVLQVVTGRG ETGAALCRSG VDKVAFTGST GTGKKVMAAC AETLTPVVIE AGGKDPLIVD ADADVPAAAD AALWGACSNA GQTCAGVERV YVHERVYDEF LAEITRKAQG LSAHGGDDAK IGPITMPGQL DVIRRHIDDA LERGGRAVVG GADAVGERFV QPTILVDVPE DSAAVQEETF GPTVTIAKVR DMDEAIELAN GTPYGLGATV FSRSNGMAIA ERIRSGMTAI NAVISFAAIP SLPFGGVGDS GFGRIHGPEG LKEFTYAKAI ARQRFKPALA LTTFERTEQA DRRLAAIVRA LHGRG
|
| |