Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4281 |
Symbol | |
ID | 4596796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4523084 |
End bp | 4524529 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778888 |
Product | aldehyde dehydrogenase |
Protein accession | YP_925465 |
Protein GI | 119718500 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000626871 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAC CGGAGCTGCG GGGCAACTTC GTCGGCGGGC ACTGGGTGGG CGCCTCGGGC GGGCGCACCT TCGAGCGCCG CAACCCCGCC GACCCCGCCG ACGTCGTCTC GGTCGCGCCC GACTCCGACG CGACCGACGT CGACCAGGCC GTCGGGCACG TCGCGACCCA CTACCGCGAG TGGGCCGAGC TCGCGCCCGA GGTGCGCGCG GACGTGCTGT GCCGGGCCGC CGACCAGCTC GAGCAGCGGG CCGACACCCT GGTGGCCGAG CTGGTCCGCG AGGAGGGCAA GACCCGGGCC GAGGCCCGGA TGGAGGTGCG CCGGGCGCCA CAGAACCTCC GGTTCTACGC CGGCGAGGCC CAGCGGCTGA CCGGCGAGAC GTTCCCGACC GGGGACGGGA GCATGGTGCT GACCCTGCGG GAGCCGGTCG GCGTGGTCGC GGCGATCACG CCGTGGAACT TCCCGCTCAA CATCCCCTCC CGCAAGCTCG GCCCCGCGCT CGCCGCCGGC AACGGCGTCG TGTTCAAGCC CAGCGAGGTC ACCCCGCTCC TCGGGCAGCG GCTGGTCGAG GCCCTCGTCG AGGCCGGTGT CCCCGGCGGC GCGCTGGCCC TGGTGCACGG TCACGGCGAG GTGGGCAAGG CCCTGGTGTC CGACACCCGG ATCGACGCGG TCACGTTCAC CGGCTCGACG GCGGTCGGCG AGGCGATCCA CGCGAACGTG CCGCCGTGGG TGCGCTGCCA GCTGGAGATG GGCGGCAAGA ACGCGGTCGT CGTCTGCGAC GACGCCGACC TCGACAAGGC CGCCGCCATC GTCGTCCGCG GCGCGTTCGG GCTCAGCGGC CAGGCGTGCA CCGGGACCTC CCGGGTCGTC GTCTACGAGA GCGTGCTCGG CGGCCTGCTC GACCGGGTGA TGGAGGCCGC CCGCGACGCC GTGCTCGGCA ACGGTCTCGA CGACGGCGTG ACCATGGGGC CGCTGGCGAC CGAGGCGCAG CTCGCGAAGT ACCACTCCTA CCTGGCCTGG GGGCGGGGGA GCGACGCCAT GCTCGAGACC CCGCGGTACG GCGCCGACCC GGACGGCGGC TTCTTCGCCC GTCCCGCGAT CTTCTCCGGC GTGCGGCCCG ACAGCCGCCT GGCCCAGGAG GAGATCTTCG GCCCGATCCT CTCCTTCCTC ACCGTGGGCG GGTACGACGA GGCGGTCGAG GTCGTCAACG GCACGCCGTA CGGGCTCTCC TCGGGCATCG TCACGACGAG CATGGCGACC GCGATGGCGT TCGCCCGCGA TGCGCGGACC GGATTGGTCA AGGTCAACCA GCCGACCACC GGGATGGCGA TGAACGCGCC GTTCGGCGGG ATGGGGAGGT CGAGCACGCA GACGCACAAG GAGCAGGCCG GCGCCTCGAT GATGGCGTTC TACACCCACG ACAAGACGAC GTACTTCTCA GCGTGA
|
Protein sequence | MSEPELRGNF VGGHWVGASG GRTFERRNPA DPADVVSVAP DSDATDVDQA VGHVATHYRE WAELAPEVRA DVLCRAADQL EQRADTLVAE LVREEGKTRA EARMEVRRAP QNLRFYAGEA QRLTGETFPT GDGSMVLTLR EPVGVVAAIT PWNFPLNIPS RKLGPALAAG NGVVFKPSEV TPLLGQRLVE ALVEAGVPGG ALALVHGHGE VGKALVSDTR IDAVTFTGST AVGEAIHANV PPWVRCQLEM GGKNAVVVCD DADLDKAAAI VVRGAFGLSG QACTGTSRVV VYESVLGGLL DRVMEAARDA VLGNGLDDGV TMGPLATEAQ LAKYHSYLAW GRGSDAMLET PRYGADPDGG FFARPAIFSG VRPDSRLAQE EIFGPILSFL TVGGYDEAVE VVNGTPYGLS SGIVTTSMAT AMAFARDART GLVKVNQPTT GMAMNAPFGG MGRSSTQTHK EQAGASMMAF YTHDKTTYFS A
|
| |