Gene Noca_4281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4281 
Symbol 
ID4596796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4523084 
End bp4524529 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content73% 
IMG OID639778888 
Productaldehyde dehydrogenase 
Protein accessionYP_925465 
Protein GI119718500 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000626871 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAC CGGAGCTGCG GGGCAACTTC GTCGGCGGGC ACTGGGTGGG CGCCTCGGGC 
GGGCGCACCT TCGAGCGCCG CAACCCCGCC GACCCCGCCG ACGTCGTCTC GGTCGCGCCC
GACTCCGACG CGACCGACGT CGACCAGGCC GTCGGGCACG TCGCGACCCA CTACCGCGAG
TGGGCCGAGC TCGCGCCCGA GGTGCGCGCG GACGTGCTGT GCCGGGCCGC CGACCAGCTC
GAGCAGCGGG CCGACACCCT GGTGGCCGAG CTGGTCCGCG AGGAGGGCAA GACCCGGGCC
GAGGCCCGGA TGGAGGTGCG CCGGGCGCCA CAGAACCTCC GGTTCTACGC CGGCGAGGCC
CAGCGGCTGA CCGGCGAGAC GTTCCCGACC GGGGACGGGA GCATGGTGCT GACCCTGCGG
GAGCCGGTCG GCGTGGTCGC GGCGATCACG CCGTGGAACT TCCCGCTCAA CATCCCCTCC
CGCAAGCTCG GCCCCGCGCT CGCCGCCGGC AACGGCGTCG TGTTCAAGCC CAGCGAGGTC
ACCCCGCTCC TCGGGCAGCG GCTGGTCGAG GCCCTCGTCG AGGCCGGTGT CCCCGGCGGC
GCGCTGGCCC TGGTGCACGG TCACGGCGAG GTGGGCAAGG CCCTGGTGTC CGACACCCGG
ATCGACGCGG TCACGTTCAC CGGCTCGACG GCGGTCGGCG AGGCGATCCA CGCGAACGTG
CCGCCGTGGG TGCGCTGCCA GCTGGAGATG GGCGGCAAGA ACGCGGTCGT CGTCTGCGAC
GACGCCGACC TCGACAAGGC CGCCGCCATC GTCGTCCGCG GCGCGTTCGG GCTCAGCGGC
CAGGCGTGCA CCGGGACCTC CCGGGTCGTC GTCTACGAGA GCGTGCTCGG CGGCCTGCTC
GACCGGGTGA TGGAGGCCGC CCGCGACGCC GTGCTCGGCA ACGGTCTCGA CGACGGCGTG
ACCATGGGGC CGCTGGCGAC CGAGGCGCAG CTCGCGAAGT ACCACTCCTA CCTGGCCTGG
GGGCGGGGGA GCGACGCCAT GCTCGAGACC CCGCGGTACG GCGCCGACCC GGACGGCGGC
TTCTTCGCCC GTCCCGCGAT CTTCTCCGGC GTGCGGCCCG ACAGCCGCCT GGCCCAGGAG
GAGATCTTCG GCCCGATCCT CTCCTTCCTC ACCGTGGGCG GGTACGACGA GGCGGTCGAG
GTCGTCAACG GCACGCCGTA CGGGCTCTCC TCGGGCATCG TCACGACGAG CATGGCGACC
GCGATGGCGT TCGCCCGCGA TGCGCGGACC GGATTGGTCA AGGTCAACCA GCCGACCACC
GGGATGGCGA TGAACGCGCC GTTCGGCGGG ATGGGGAGGT CGAGCACGCA GACGCACAAG
GAGCAGGCCG GCGCCTCGAT GATGGCGTTC TACACCCACG ACAAGACGAC GTACTTCTCA
GCGTGA
 
Protein sequence
MSEPELRGNF VGGHWVGASG GRTFERRNPA DPADVVSVAP DSDATDVDQA VGHVATHYRE 
WAELAPEVRA DVLCRAADQL EQRADTLVAE LVREEGKTRA EARMEVRRAP QNLRFYAGEA
QRLTGETFPT GDGSMVLTLR EPVGVVAAIT PWNFPLNIPS RKLGPALAAG NGVVFKPSEV
TPLLGQRLVE ALVEAGVPGG ALALVHGHGE VGKALVSDTR IDAVTFTGST AVGEAIHANV
PPWVRCQLEM GGKNAVVVCD DADLDKAAAI VVRGAFGLSG QACTGTSRVV VYESVLGGLL
DRVMEAARDA VLGNGLDDGV TMGPLATEAQ LAKYHSYLAW GRGSDAMLET PRYGADPDGG
FFARPAIFSG VRPDSRLAQE EIFGPILSFL TVGGYDEAVE VVNGTPYGLS SGIVTTSMAT
AMAFARDART GLVKVNQPTT GMAMNAPFGG MGRSSTQTHK EQAGASMMAF YTHDKTTYFS
A