Gene Noca_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1053 
Symbol 
ID4599659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1110758 
End bp1112275 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content74% 
IMG OID639775651 
Productaldehyde dehydrogenase 
Protein accessionYP_922258 
Protein GI119715293 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGG CCCCCGCCCG CCCGCAGTCC CCTCAGCCCG CCGAGCCGGC CCCGTCCGGC 
TCGACGTTCG AGTCGCTGGA CCCGGCGACC GGGGCGGTCG TCGGGACGTT CCCGGTGCAC
GGCGAGGCGG AGGTGCGCGC GGCCGTCGAG CGCGCCCGCA CGGCCGCCGA GTGGTGGTCG
GCGCTCTCGT TCAAGGACCG CAAGGTCTAC CTGACCACCT GGAAGGCCGC GATCACCCGC
CGGATGCCCG AACTCGCGGA GCTCATGCAC CGCGAGACCG GCAAGCCCCG CTCCGACGCG
ATGCTCGAGG CGACGCTCGG GGTCGACCAC CTCGGCTGGG CCGCCGGCCA CGCGGGCAAG
GTGCTGGGGC GGCACCGGGT CTCGCCCGGG ATGTTGATGG TCAACCAGGC CGCGACCGTG
GAGTTCCGCC CGCTCGGCGT CGTCGGCGTG ATCGGCCCGT GGAACTACCC GGTGTTCACC
CCGCTCGGCT CGATCGCCTA CGCGCTCGCG GCCGGCAACG CCGTGGTGTT CAAGCCCAGC
GAGCACACGC CCGCGGTCGG CGAGTGGCTG GCCCGCACGT TCGGCGAGTG CGTCGGGCGA
CCGGTCCTCC AGGTCGTCAC CGGCCGCGGC GAGACCGGTG CCGCGCTGTG CCGCTCCGGG
GTCGACAAGG TGGCGTTCAC CGGCTCGACC GGCACGGGGA AGAAGGTGAT GGCCGCCTGC
GCCGAGACCC TGACCCCGGT CGTCATCGAG GCCGGTGGCA AGGACCCGCT GATCGTCGAC
GCGGACGCCG ACGTCCCGGC CGCCGCCGAC GCCGCGCTGT GGGGCGCCTG CAGCAACGCC
GGCCAGACCT GCGCGGGCGT CGAGCGGGTC TACGTGCACG AGCGGGTGTA CGACGAGTTC
CTCGCCGAGA TCACCCGCAA GGCGCAGGGC CTGAGCGCCC ATGGCGGCGA CGACGCGAAG
ATCGGCCCGA TCACGATGCC GGGCCAGCTC GACGTGATCC GCCGCCACAT CGACGACGCG
CTCGAACGCG GCGGCCGCGC GGTCGTCGGC GGGGCGGACG CGGTGGGCGA GCGGTTCGTG
CAGCCCACGA TCCTCGTCGA CGTGCCCGAG GACTCCGCGG CGGTCCAGGA GGAGACGTTC
GGCCCGACCG TGACGATCGC GAAGGTGCGC GACATGGACG AGGCGATCGA GCTCGCCAAC
GGGACGCCGT ACGGCCTCGG GGCGACGGTC TTCAGCAGGA GCAACGGGAT GGCCATCGCC
GAGCGGATCC GCTCCGGCAT GACCGCGATC AACGCGGTGA TCTCGTTCGC GGCGATCCCG
AGCCTGCCGT TCGGCGGCGT CGGCGACTCC GGATTCGGAC GGATCCACGG GCCCGAGGGC
CTCAAGGAGT TCACCTACGC GAAGGCGATC GCCCGGCAGC GGTTCAAGCC GGCCCTCGCG
CTGACCACGT TCGAGCGCAC CGAGCAGGCC GACCGGCGGC TCGCCGCGAT CGTCCGGGCG
CTGCACGGCC GCGGCTGA
 
Protein sequence
MTQAPARPQS PQPAEPAPSG STFESLDPAT GAVVGTFPVH GEAEVRAAVE RARTAAEWWS 
ALSFKDRKVY LTTWKAAITR RMPELAELMH RETGKPRSDA MLEATLGVDH LGWAAGHAGK
VLGRHRVSPG MLMVNQAATV EFRPLGVVGV IGPWNYPVFT PLGSIAYALA AGNAVVFKPS
EHTPAVGEWL ARTFGECVGR PVLQVVTGRG ETGAALCRSG VDKVAFTGST GTGKKVMAAC
AETLTPVVIE AGGKDPLIVD ADADVPAAAD AALWGACSNA GQTCAGVERV YVHERVYDEF
LAEITRKAQG LSAHGGDDAK IGPITMPGQL DVIRRHIDDA LERGGRAVVG GADAVGERFV
QPTILVDVPE DSAAVQEETF GPTVTIAKVR DMDEAIELAN GTPYGLGATV FSRSNGMAIA
ERIRSGMTAI NAVISFAAIP SLPFGGVGDS GFGRIHGPEG LKEFTYAKAI ARQRFKPALA
LTTFERTEQA DRRLAAIVRA LHGRG