Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4031 |
Symbol | |
ID | 4596545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4254499 |
End bp | 4256034 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639778637 |
Product | acetaldehyde dehydrogenase (acetylating) |
Protein accession | YP_925215 |
Protein GI | 119718250 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02518] acetaldehyde dehydrogenase (acetylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.808002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCACG ACGAGCTCGA CAGCGACCTG CGCTCGATCC AGGAGGCCCG CCGCCTCGCC ACGGCGGCCC GGGCGGCTCA GCGGGAGTTC GCCCACGCCT CGCAGGCCGA GGTGGACCGG ATCTGCGCGG CGATGGCCGA CGCGGTCTAC CGTGAGGCCG CCCGCCTCGG GCAGCTGGCG ACCGACGAGA CCGGGTACGG CGTACCCGCC CACAAGCGGC TCAAGGTCGA GTTCGCCTCG CGCACGGTGT GGGAGTCGAT CCGCGACGTG CCGACCGTGG GCGTGCTGCG CCGAGACGAG GCGAAGGGGA TCGTCGAGAT CGGCTGGCCG GTCGGCGTGA TCGTCGGCCT GTGCCCCTCC ACCAACCCCA ACTCGACGGC GATCTACAAG GTGCTGATCT CGGTCAAGGC GCGCAACGCC TGCATCATCG CCCCGCACCC CTCGGCCAAG GCCGCCACCT ACGAGGCGGT GCGGATCATG ATCGAGGCGG GGGAGCGGGC CGGCATGCCC AAGGGCCTGG TCGGCTGCAT GCAGGAGGTC AGCCTCCCCG GCTCCCAGGA GCTGATGCGG CACTACGCGA CGTCGATGAT CCTGGCCACC GGCGGCACGC CGATGGTGCG CGCGGCCCAC AGCATGGGCA AGCCCGCGCT CGGCGTCGGG CCCGGCAACG TCCCGGCGTA CGTCGACCGC AGTGCGGACG TGCTGGCGGC CGCCACCGCG ATCGTCAACA GCAAGTCCTT CGACTGCTCC ACGATCTGTG CGACCGAGCA GGCGGTCGTA GCGGACGCGC CGATCGCCGG CGCGCTGCGC GCCGAGATGG AGCGCCTCGG CGCCTACTTC GTCTCTGCGG AGGAGAAGGC GGCGCTCGAG CGCACCGTGT TCAACCCGGG CGGCGCGATG AACCCCAAGG CGGTCGGGAA GTCGCCGCAG GCCCTGGCGG CGCTGGCGGG CATCCAGGTC CCCGAGCATG CCCGGATCCT CGTTGCCGAG CTGGGCAGCG TCGGTCCGCA GGAGCCGCTC AGCGCCGAGA AGCTCACCAC CGTGCTCGGC TGGTACGTCG AGGACGGCTG GCGGGCCGGC TGCGAGCGGT CGATCGAGCT GCTGAAGTTC GGCGGCGACG GGCACTCGCT GGTGATCCAC GCGACCGACG AGGAGGTGAT CATGGCGTTC GGGCTAGAGA AGCCCGCCTT CCGGATCCTC GTCAACACCT GGGGCACCCT CGGCGCGATC GGTGCGACGA CCGGCGTGAT GCCGGCGCTG ACGCTCGCCC CGGGCGGGAT CGGCGGTGCC GTGGTCAGCG ACAACATCAC CGTTACGCAC CTGCTCAACG TCAAGCGTCT GGCCTTCAAG CTGCACGAGC CGCCCGCCGC GGCGTACGAG CACGCACCCG ACGTGCGGGG CGCCCCCCGC CACGACGGCC CCCGCTCGGC CGAGGCGACC CCGGCGGCGC GCGTCGCCGA ACCCGCTGCG GTGAGCGGGG ACCAGGTGGA ACGCATCGTC CGCCGGGTGC TCAGCGAGCT CGGAGCCGGC CGATGA
|
Protein sequence | MTHDELDSDL RSIQEARRLA TAARAAQREF AHASQAEVDR ICAAMADAVY REAARLGQLA TDETGYGVPA HKRLKVEFAS RTVWESIRDV PTVGVLRRDE AKGIVEIGWP VGVIVGLCPS TNPNSTAIYK VLISVKARNA CIIAPHPSAK AATYEAVRIM IEAGERAGMP KGLVGCMQEV SLPGSQELMR HYATSMILAT GGTPMVRAAH SMGKPALGVG PGNVPAYVDR SADVLAAATA IVNSKSFDCS TICATEQAVV ADAPIAGALR AEMERLGAYF VSAEEKAALE RTVFNPGGAM NPKAVGKSPQ ALAALAGIQV PEHARILVAE LGSVGPQEPL SAEKLTTVLG WYVEDGWRAG CERSIELLKF GGDGHSLVIH ATDEEVIMAF GLEKPAFRIL VNTWGTLGAI GATTGVMPAL TLAPGGIGGA VVSDNITVTH LLNVKRLAFK LHEPPAAAYE HAPDVRGAPR HDGPRSAEAT PAARVAEPAA VSGDQVERIV RRVLSELGAG R
|
| |