Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4188 |
Symbol | |
ID | 4596702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4426848 |
End bp | 4428206 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639778794 |
Product | aldehyde dehydrogenase |
Protein accession | YP_925372 |
Protein GI | 119718407 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTACGCCG TCACCAACCC GGCCACGGGC GAGCTGATCA GCGAGTTCGA CACGGCAACC GACGCGCAGG TTCGCGAGGC CGTCAGCCGC GCCGACCTCG CCTTCCAGTC CTGGAAGAGC ACGCCGCTGG AGGAGCGCTC CCGGACCCTG GCTCGGGCCG CAGACCTGTT CCTCGAGCGC AGCGACGAGT TGGCCCGCGC CATCACCCAG GAGATGGGCA AGCGCCTCGA GGAGAGCCGG GGGGAGGTAC GCATCGCCTC GGACATCTTC CATTACTACT CCGACAACGC TCCGAAGCTG CTCGCGGACG AAACCATCGC CATCCAGGGC GGCGAGGCCA AGATCCTCAA GCGGCCTGTC GGGGTGCTGC TCGGGATCAT GCCGTGGAAC TACCCGTACT ACCAGGTCGC CCGCTTCGCG GCGCCCAACC TGGTGCTGGG CAACACGATC ATCCTCAAGC ACGCGCCGTC CTGTCCGCAG TCCTCGGCCC TGGTCGAGCA GCTCTTGCAC GACGCCGGGG TACCGGTTGA CGCCTACATC AACGTCTACG CCACCAACGA GCAGGTCGCC TGGGCCCTCG CTGATCCGCG CATCCAGGGC GTATCCGTCA CTGGCAGCGA GCGAGCCGGC GCGGCCGTCG CGGCCGAGGC CGGCAGGAAT CTGAAGAAGG TGGTTCTCGA ACTGGGTGGC TCGGACCCCA TGGTCATCCT CGACACCGAC GATCTCGATG CCCTGGTCGA GACGGCCATG GAGTCACGGA TGGGCAACAC CGGACAGGCG TGCAACGCGC CCAAGCGGAT GATCGTGGTG GACGAGCTCT ACGACGACTT CGTGACGAAG ATGGTCCAGG CTGCCCGCAG ACTCCAGCCA GGGGACCCGC TTGATCCGGA GACGACGCTT GCCCCGCTGT CGTCGGAGCA GGCCGCCGTA CGCTTGATCG GACAGCTCGA CGAGGCGCGC AACCAGGGCG CCACCATCCG CGTGGGCGGT CACCGGGTCG AGCGACCCGG CGCCTACGTC GAGCCAACGG TGATCACTGA CGTCACGCCG GAGATGTCGG CCTATCGGGA CGAACTCTTC GGCCCGGTGG CCATCATCTT CCGAGTCGAT GACGAGGACG ACGCGGTTCG ACTCGCGAAC GACACGCCCT TCGGCCTCGG CGCCAGCGTC TTCTCAGGCG ATTCCGAGCG CGCCGAGCGC GTGGCAGCCC GGATCGACGC CGGCATGGTC TACCTCAACC AGGCTGGCGG CTCGCAGCCC GACCTCCCCT TTGGCGGCAT CAAGCGCTCC GGCATCGGCC GCGAACTCGG TGCCCTCGGC ATCGAGGAGT TCATGAACAA GAAGGTCGTG CGGCTCTGA
|
Protein sequence | MYAVTNPATG ELISEFDTAT DAQVREAVSR ADLAFQSWKS TPLEERSRTL ARAADLFLER SDELARAITQ EMGKRLEESR GEVRIASDIF HYYSDNAPKL LADETIAIQG GEAKILKRPV GVLLGIMPWN YPYYQVARFA APNLVLGNTI ILKHAPSCPQ SSALVEQLLH DAGVPVDAYI NVYATNEQVA WALADPRIQG VSVTGSERAG AAVAAEAGRN LKKVVLELGG SDPMVILDTD DLDALVETAM ESRMGNTGQA CNAPKRMIVV DELYDDFVTK MVQAARRLQP GDPLDPETTL APLSSEQAAV RLIGQLDEAR NQGATIRVGG HRVERPGAYV EPTVITDVTP EMSAYRDELF GPVAIIFRVD DEDDAVRLAN DTPFGLGASV FSGDSERAER VAARIDAGMV YLNQAGGSQP DLPFGGIKRS GIGRELGALG IEEFMNKKVV RL
|
| |