Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0037 |
Symbol | |
ID | 4598391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 41511 |
End bp | 43124 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639774652 |
Product | aldehyde dehydrogenase |
Protein accession | YP_921274 |
Protein GI | 119714309 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCG AGACCACCGA GACCACCGAG ACCGTCACGG GCGACCAGCT CGTCGCCGGT GCGGCCACGC GCGGCTCGAG CGGTGTCTTC CACGCAGTCG ATCCGCGCAC CGGCGAGGAG CTGGCGACGG CCTTCGCCGA GGCGACCGTC GCCGAGGTGG ACCGAGCGGT CGAGGCCGCC GTGGATGCGT TCGCCTCCTT CCGTGACTGG GACGACGCGC GTCGCGCCGA CCTCCTCGAC GCGATCGCCG CGGCCCTCGT GCACGACGGT TCGGCGATCC TCTCGGCTGT GGAGGCGGAG ACCGCGCTCC CCCGCGCCCG CGCCGAGGGC GAGCTGGTCC GCACCGCCGA GCAGTTCCGC GCCTTCGCCC GGGTGCTGCG GCAGGGCTGG CACCGCGACG CGCTCGTCGA CCCGCCGGAC CCGGGGGCCG TGCCCGTCCC GCGCCCCGAC GTGCGCCGGA TCAACGTGCC CGTCGGCCCG GTCGCGGTGT TCGGCGCGAG CAACTTCCCC CTGGCATTCA GCACGCCGGG CGGCGACACG GCCGCCGCGC TCGCGGCCGG CTGCCCGGTG GTGGTCAAGG GCCACCCCAG CCATCCCGCG ACCAGCGAGC TGTGCGGACG TGCCATCGTG CGGGCCCTTC GCGAGCACGA CGCCCCTGCC GGCACCTTCT CCCTCCTGCA GAGCACCCGG AACGAGGTGG GCGCCGCGCT CGTGCAGCAC CCGCAGGTGG CCGCGGTCGG CTTCACCGGG TCGGAGGCCG GCGGGCGAGC CTTGTTCGAC CTCGCCTCGC GGCGACCGAC GCCGATCCCG GTGTACGCCG AGATGGGCAG CCTGAACCCC GTCCTGGTGA CCGTGGCCGC TCTCGAGGCG CGCGCGGACG CGATCGCGCA AGGACTCTCC GGCTCCTTCC TCTTCTGCGC CGGGCAGTAC TGCACCAAGC CGGGCCTCGT GCTCGTGCCC GAGGGCCCCG CGGGCGACCG CTTCGTGGGC CTGCTCGCCA CGACGGTCCG CGAGCAGGAG GCGTTGCCGG TGCTGGCCGC CAACATCGGC AGCGCCTTCG ACACCTCGGT CGGCGCGCTC GAGGCTGCTC TCGGAGACGA CGCCGTGGTG CACGGGCAGG CCCGCCGCCG GGGTCTGGAG CGCGAGGCCG CACTCGTGGT CGTGGACGCC GCGCGCGTGC GCGAGGCTCC CGATCTCCTC GTCGAGCACT TCGGGCCGCT GTCGGTCGTG GTGCGATACG CGAGCCCCAC CGACGTGCTG GACGTCATCG CGCAGGTGCC CGGCAGCCTC ACCGCCACCG TGCACGGCGA GCCCGACGAC CACGACCTGG TCCGTCAGCT CCTGCCCGCG CTGGTGGAGA AGGCCGGCCG GGTGCTGTGG AACGGATACC CGACGGGAGT GTCCGTGACG GGCGCGATGA TGCACGGCGG GCCGTACCCC TCCTCCACCT TCCCCGCGCA CACCTCGGTG GGGTGGACCG CCATCCGCCG CTTCCTGCGG CCGGTCACGT TCCAGAACTT CCCCGACGAA CTGCTGCCGG CCCCGCTGCG CGCCGACAAC CCCCTGGCCG CTCCCCGCCT CGTCGACGGG GCGCTGAGCA CCGGTCCCAG CTGA
|
Protein sequence | MTTETTETTE TVTGDQLVAG AATRGSSGVF HAVDPRTGEE LATAFAEATV AEVDRAVEAA VDAFASFRDW DDARRADLLD AIAAALVHDG SAILSAVEAE TALPRARAEG ELVRTAEQFR AFARVLRQGW HRDALVDPPD PGAVPVPRPD VRRINVPVGP VAVFGASNFP LAFSTPGGDT AAALAAGCPV VVKGHPSHPA TSELCGRAIV RALREHDAPA GTFSLLQSTR NEVGAALVQH PQVAAVGFTG SEAGGRALFD LASRRPTPIP VYAEMGSLNP VLVTVAALEA RADAIAQGLS GSFLFCAGQY CTKPGLVLVP EGPAGDRFVG LLATTVREQE ALPVLAANIG SAFDTSVGAL EAALGDDAVV HGQARRRGLE REAALVVVDA ARVREAPDLL VEHFGPLSVV VRYASPTDVL DVIAQVPGSL TATVHGEPDD HDLVRQLLPA LVEKAGRVLW NGYPTGVSVT GAMMHGGPYP SSTFPAHTSV GWTAIRRFLR PVTFQNFPDE LLPAPLRADN PLAAPRLVDG ALSTGPS
|
| |