Gene Noca_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0037 
Symbol 
ID4598391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp41511 
End bp43124 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content75% 
IMG OID639774652 
Productaldehyde dehydrogenase 
Protein accessionYP_921274 
Protein GI119714309 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCG AGACCACCGA GACCACCGAG ACCGTCACGG GCGACCAGCT CGTCGCCGGT 
GCGGCCACGC GCGGCTCGAG CGGTGTCTTC CACGCAGTCG ATCCGCGCAC CGGCGAGGAG
CTGGCGACGG CCTTCGCCGA GGCGACCGTC GCCGAGGTGG ACCGAGCGGT CGAGGCCGCC
GTGGATGCGT TCGCCTCCTT CCGTGACTGG GACGACGCGC GTCGCGCCGA CCTCCTCGAC
GCGATCGCCG CGGCCCTCGT GCACGACGGT TCGGCGATCC TCTCGGCTGT GGAGGCGGAG
ACCGCGCTCC CCCGCGCCCG CGCCGAGGGC GAGCTGGTCC GCACCGCCGA GCAGTTCCGC
GCCTTCGCCC GGGTGCTGCG GCAGGGCTGG CACCGCGACG CGCTCGTCGA CCCGCCGGAC
CCGGGGGCCG TGCCCGTCCC GCGCCCCGAC GTGCGCCGGA TCAACGTGCC CGTCGGCCCG
GTCGCGGTGT TCGGCGCGAG CAACTTCCCC CTGGCATTCA GCACGCCGGG CGGCGACACG
GCCGCCGCGC TCGCGGCCGG CTGCCCGGTG GTGGTCAAGG GCCACCCCAG CCATCCCGCG
ACCAGCGAGC TGTGCGGACG TGCCATCGTG CGGGCCCTTC GCGAGCACGA CGCCCCTGCC
GGCACCTTCT CCCTCCTGCA GAGCACCCGG AACGAGGTGG GCGCCGCGCT CGTGCAGCAC
CCGCAGGTGG CCGCGGTCGG CTTCACCGGG TCGGAGGCCG GCGGGCGAGC CTTGTTCGAC
CTCGCCTCGC GGCGACCGAC GCCGATCCCG GTGTACGCCG AGATGGGCAG CCTGAACCCC
GTCCTGGTGA CCGTGGCCGC TCTCGAGGCG CGCGCGGACG CGATCGCGCA AGGACTCTCC
GGCTCCTTCC TCTTCTGCGC CGGGCAGTAC TGCACCAAGC CGGGCCTCGT GCTCGTGCCC
GAGGGCCCCG CGGGCGACCG CTTCGTGGGC CTGCTCGCCA CGACGGTCCG CGAGCAGGAG
GCGTTGCCGG TGCTGGCCGC CAACATCGGC AGCGCCTTCG ACACCTCGGT CGGCGCGCTC
GAGGCTGCTC TCGGAGACGA CGCCGTGGTG CACGGGCAGG CCCGCCGCCG GGGTCTGGAG
CGCGAGGCCG CACTCGTGGT CGTGGACGCC GCGCGCGTGC GCGAGGCTCC CGATCTCCTC
GTCGAGCACT TCGGGCCGCT GTCGGTCGTG GTGCGATACG CGAGCCCCAC CGACGTGCTG
GACGTCATCG CGCAGGTGCC CGGCAGCCTC ACCGCCACCG TGCACGGCGA GCCCGACGAC
CACGACCTGG TCCGTCAGCT CCTGCCCGCG CTGGTGGAGA AGGCCGGCCG GGTGCTGTGG
AACGGATACC CGACGGGAGT GTCCGTGACG GGCGCGATGA TGCACGGCGG GCCGTACCCC
TCCTCCACCT TCCCCGCGCA CACCTCGGTG GGGTGGACCG CCATCCGCCG CTTCCTGCGG
CCGGTCACGT TCCAGAACTT CCCCGACGAA CTGCTGCCGG CCCCGCTGCG CGCCGACAAC
CCCCTGGCCG CTCCCCGCCT CGTCGACGGG GCGCTGAGCA CCGGTCCCAG CTGA
 
Protein sequence
MTTETTETTE TVTGDQLVAG AATRGSSGVF HAVDPRTGEE LATAFAEATV AEVDRAVEAA 
VDAFASFRDW DDARRADLLD AIAAALVHDG SAILSAVEAE TALPRARAEG ELVRTAEQFR
AFARVLRQGW HRDALVDPPD PGAVPVPRPD VRRINVPVGP VAVFGASNFP LAFSTPGGDT
AAALAAGCPV VVKGHPSHPA TSELCGRAIV RALREHDAPA GTFSLLQSTR NEVGAALVQH
PQVAAVGFTG SEAGGRALFD LASRRPTPIP VYAEMGSLNP VLVTVAALEA RADAIAQGLS
GSFLFCAGQY CTKPGLVLVP EGPAGDRFVG LLATTVREQE ALPVLAANIG SAFDTSVGAL
EAALGDDAVV HGQARRRGLE REAALVVVDA ARVREAPDLL VEHFGPLSVV VRYASPTDVL
DVIAQVPGSL TATVHGEPDD HDLVRQLLPA LVEKAGRVLW NGYPTGVSVT GAMMHGGPYP
SSTFPAHTSV GWTAIRRFLR PVTFQNFPDE LLPAPLRADN PLAAPRLVDG ALSTGPS