Gene Noca_3529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3529 
Symbol 
ID4595711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3740200 
End bp3741654 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content70% 
IMG OID639778137 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_924716 
Protein GI119717751 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCGC TCTTTGAGTA CGCCCCCGCA CCCGAGTCGC GCGCCGTGGT CGACATCAAG 
CCGTCGTACG GCCTGTTCGT CAACGGTGCT TTCGTCGACG GCCACGGCGC GTCGTTCAAG
ACGATCAGCC CCGCGACCGA GGAGGTGCTC GCCGAGATCT CCGAGGCCGA CGAGTCCGAC
GTCGATGCGG CGGTGAAGGC CGCGCGTACC GCCTACGACA AGGTCTGGTC GCGGATGCCC
GGCCGCGAGC GCGCCAAGTA CCTCTACCGG ATCGCCCGGA TCATCCAGGA GCGCAGCCGT
GAGCTCGCCG TGCTGGAGTC GCTCGACAAC GGCAAGCCGA TCAAGGAGTC GCGCGACGTC
GACGTGCCGA TCGCGGCGGC GCACTTCTTC TACTACGCCG GCTGGGCGGA CAAGCTCGAG
TACGCGGGCC ACGGCCGGGA TCCGCAGCCG CTGGGCGTCG CCGCCCAGGT GATCCCGTGG
AACTTCCCGC TGCTGATGCT GTCGTGGAAG ATCGCGCCGG CGCTGGCCTG CGGCAACACC
GTGGTGCTCA AGCCCGCGGA GACCACGCCG CTCAGCGCGC TGCTGTTCGC CGAGATCTGC
CAGCAGGCCG ACCTGCCGCC GGGCGTGGTC AACATCGTCA CCGGAGCCGG CGGCACCGGC
CAGGCGCTCG TCGGCCACCC CGGGGTCGAC AAGGTCGCGT TCACCGGCTC GACCGAGGTC
GGCAAGGCGA TCGCCCGGTC GGTCGCCGGC ACCAGCAAGC GGGTCACCCT CGAGCTCGGC
GGCAAGGCCG CCAACATCGT CTTCGACGAC GCGCCGATCG ACCAGGCCGT CGAGGGCATC
GTCGACGGGA TCTTCTTCAA CCAGGGCCAC GTCTGCTGCG CGGGCTCCCG GCTGCTGGTC
CAGGAGAGCA TCGCCGAGGA CCTGCTCGAG CGGCTCAAGG CACGGATGTC CACGCTGCGC
ATGGGCGACC CGCTCGACAA GAACACCGAC ATCGGCGCGA TCAACTCCGG CGAGCAGCTC
AAGCGGATCC GCGAGCTCTC CGAGGTCGGC GACGCCGAGG GTGCCGAGCG CTGGGAGGTC
GCCTGCGACC TGCCCACCAA GGGGTTCTGG TTCCCGCCGA CCATCTTCAC CGGCGTCTCC
CAGGCCCACC GGATCGCCCG CGAGGAGATC TTCGGCCCGG TGCTGTCGGT GCTGACCTTC
CGCACCCCGG CCGAGGCGCT CGAGAAGGCC AACAACACGC CGTACGGCCT GTCCGCGGGC
GTGTGGACCG ACAAGGGCTC GCTGATCCTC AAGATGGCCG CCTCGCTGCG CGCCGGCGTG
GTCTGGGCCA ACACGTTCAA CAAGTTCGAC CCGACCAGCC CGTTCGGTGG CTACAAGGAG
TCGGGCTACG GCCGCGAGGG CGGCCGCCAC GGGCTGGAGG CGTACCTAAG ATCACCCCAC
GAAGGCGCGC GATGA
 
Protein sequence
MPSLFEYAPA PESRAVVDIK PSYGLFVNGA FVDGHGASFK TISPATEEVL AEISEADESD 
VDAAVKAART AYDKVWSRMP GRERAKYLYR IARIIQERSR ELAVLESLDN GKPIKESRDV
DVPIAAAHFF YYAGWADKLE YAGHGRDPQP LGVAAQVIPW NFPLLMLSWK IAPALACGNT
VVLKPAETTP LSALLFAEIC QQADLPPGVV NIVTGAGGTG QALVGHPGVD KVAFTGSTEV
GKAIARSVAG TSKRVTLELG GKAANIVFDD APIDQAVEGI VDGIFFNQGH VCCAGSRLLV
QESIAEDLLE RLKARMSTLR MGDPLDKNTD IGAINSGEQL KRIRELSEVG DAEGAERWEV
ACDLPTKGFW FPPTIFTGVS QAHRIAREEI FGPVLSVLTF RTPAEALEKA NNTPYGLSAG
VWTDKGSLIL KMAASLRAGV VWANTFNKFD PTSPFGGYKE SGYGREGGRH GLEAYLRSPH
EGAR