Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3529 |
Symbol | |
ID | 4595711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3740200 |
End bp | 3741654 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639778137 |
Product | aldehyde dehydrogenase (acceptor) |
Protein accession | YP_924716 |
Protein GI | 119717751 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCGC TCTTTGAGTA CGCCCCCGCA CCCGAGTCGC GCGCCGTGGT CGACATCAAG CCGTCGTACG GCCTGTTCGT CAACGGTGCT TTCGTCGACG GCCACGGCGC GTCGTTCAAG ACGATCAGCC CCGCGACCGA GGAGGTGCTC GCCGAGATCT CCGAGGCCGA CGAGTCCGAC GTCGATGCGG CGGTGAAGGC CGCGCGTACC GCCTACGACA AGGTCTGGTC GCGGATGCCC GGCCGCGAGC GCGCCAAGTA CCTCTACCGG ATCGCCCGGA TCATCCAGGA GCGCAGCCGT GAGCTCGCCG TGCTGGAGTC GCTCGACAAC GGCAAGCCGA TCAAGGAGTC GCGCGACGTC GACGTGCCGA TCGCGGCGGC GCACTTCTTC TACTACGCCG GCTGGGCGGA CAAGCTCGAG TACGCGGGCC ACGGCCGGGA TCCGCAGCCG CTGGGCGTCG CCGCCCAGGT GATCCCGTGG AACTTCCCGC TGCTGATGCT GTCGTGGAAG ATCGCGCCGG CGCTGGCCTG CGGCAACACC GTGGTGCTCA AGCCCGCGGA GACCACGCCG CTCAGCGCGC TGCTGTTCGC CGAGATCTGC CAGCAGGCCG ACCTGCCGCC GGGCGTGGTC AACATCGTCA CCGGAGCCGG CGGCACCGGC CAGGCGCTCG TCGGCCACCC CGGGGTCGAC AAGGTCGCGT TCACCGGCTC GACCGAGGTC GGCAAGGCGA TCGCCCGGTC GGTCGCCGGC ACCAGCAAGC GGGTCACCCT CGAGCTCGGC GGCAAGGCCG CCAACATCGT CTTCGACGAC GCGCCGATCG ACCAGGCCGT CGAGGGCATC GTCGACGGGA TCTTCTTCAA CCAGGGCCAC GTCTGCTGCG CGGGCTCCCG GCTGCTGGTC CAGGAGAGCA TCGCCGAGGA CCTGCTCGAG CGGCTCAAGG CACGGATGTC CACGCTGCGC ATGGGCGACC CGCTCGACAA GAACACCGAC ATCGGCGCGA TCAACTCCGG CGAGCAGCTC AAGCGGATCC GCGAGCTCTC CGAGGTCGGC GACGCCGAGG GTGCCGAGCG CTGGGAGGTC GCCTGCGACC TGCCCACCAA GGGGTTCTGG TTCCCGCCGA CCATCTTCAC CGGCGTCTCC CAGGCCCACC GGATCGCCCG CGAGGAGATC TTCGGCCCGG TGCTGTCGGT GCTGACCTTC CGCACCCCGG CCGAGGCGCT CGAGAAGGCC AACAACACGC CGTACGGCCT GTCCGCGGGC GTGTGGACCG ACAAGGGCTC GCTGATCCTC AAGATGGCCG CCTCGCTGCG CGCCGGCGTG GTCTGGGCCA ACACGTTCAA CAAGTTCGAC CCGACCAGCC CGTTCGGTGG CTACAAGGAG TCGGGCTACG GCCGCGAGGG CGGCCGCCAC GGGCTGGAGG CGTACCTAAG ATCACCCCAC GAAGGCGCGC GATGA
|
Protein sequence | MPSLFEYAPA PESRAVVDIK PSYGLFVNGA FVDGHGASFK TISPATEEVL AEISEADESD VDAAVKAART AYDKVWSRMP GRERAKYLYR IARIIQERSR ELAVLESLDN GKPIKESRDV DVPIAAAHFF YYAGWADKLE YAGHGRDPQP LGVAAQVIPW NFPLLMLSWK IAPALACGNT VVLKPAETTP LSALLFAEIC QQADLPPGVV NIVTGAGGTG QALVGHPGVD KVAFTGSTEV GKAIARSVAG TSKRVTLELG GKAANIVFDD APIDQAVEGI VDGIFFNQGH VCCAGSRLLV QESIAEDLLE RLKARMSTLR MGDPLDKNTD IGAINSGEQL KRIRELSEVG DAEGAERWEV ACDLPTKGFW FPPTIFTGVS QAHRIAREEI FGPVLSVLTF RTPAEALEKA NNTPYGLSAG VWTDKGSLIL KMAASLRAGV VWANTFNKFD PTSPFGGYKE SGYGREGGRH GLEAYLRSPH EGAR
|
| |