Gene Noca_0086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0086 
Symbol 
ID4600059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp96812 
End bp98341 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content73% 
IMG OID639774697 
Productaldehyde dehydrogenase 
Protein accessionYP_921319 
Protein GI119714354 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.932272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGAT CCGAGACCCC GGACGACCGC CAGCCGCCCG CGACGGTGGA GCTGATGGCA 
CGAGCCGCGG CCGACGCATT CCCCGCCTGG TCGGCGACGC CTCCCCGGCA GCGTGCCTCC
GCGCTGTCGG CGGTGGCGGA CGCCGTCGCG ACCGCAGGCC CGGAGCTGAT CGCCACCGCC
ATGTCCGAGA CCGGTCTGCC GGAGGCGCGG CTCACCGGTG AGCTGAACCG CACGGTCGTC
CAGATCCGCC TGTTCGCCGA CATCGTCGTC GACGGCGCCT ACCTCGACGT GCGCATCGAC
GAGGCGGACG ACGACTTCGT CCTGGACAGC CGACCGGACC TGCGCCGCTA CCACGTGCCG
GTCGGACCCG TGCTGAACTT CGCCGCGAGC AACTTCCCCT TCGCGTTCTC GGTCATCGGA
GGCGACACCG TCTCGGCGCT GGCTGCCGGG TGCCCGGTCG TCGTCAAGGC GCACCCGGGA
CACCTGGAGC TGTCGAGGCA GACCGCCGCC GTCGTCCGCG CTGCCCTCAC CGGAAGCGGT
GCGCCCGACG CGACACTCCA GCTGCTGGTC GGGCAGGAGC AGGGCGTGGC GATGCTGCTC
GACCCGCGCA TCCGTGCCGC CAGCTTCACG GGCTCGACCC GGGCCGGCCG CATGCTGGCC
GATCTCGCCC TGGGCCGGCC GGCACCGATC CCCTTCTACG GCGAGCTCGG GAGCGTGAAC
CCGGCGTTCG TCACGTCCGA GGCAGCGGCC CAGCACGGTG CGGCGATCGC GCAGGGATTC
CTCACGAGCG TGTGCGGGTC CGCCGGCCAG CTGTGCACCA AGCCGGGCTT CCTCTTCGTG
CCCCGCGGGA GCGGCGTCAC GGCCGATGTC GCGCAAGCGG CTGGTGCGGT GGTCGAACAG
CGTCTTCTCA ACCCGTCCAT CACCGCTGGC TACACCGCGC GGCGGGACGC CATCCTCGGG
ACGCCCGGGG TTCGCGCTCT CGCCGTGGGG GACGTACGTG TCGACGGCGA CGGACAGGGC
TGGGCGACAC CGACGTTGGT GGCGACCGAT GTCGCGACGC TCCACCACCA CCGTGAGTCG
CTGCTCGACG AGGCGTTCGG CCCACTGTCG GTCGTCGTGG AGTACGACGA CGAGGCAGGG
CTGCCCGGAG TCGCCGACGA GCTGTTCGAG GGCAACCTCA CCAGCACCAT CCATGCGGGT
GACGGTGAGG ACACGCCCAC GTTGCGCGCT CTCGTCGACT GGGCCGCGCG GACCACCGGC
CGCATCGTCT TCGGGGGTTG GCCGACCGGC GTGTCCGTGA CTCATGCGAC CCAGCACGGG
GGGCCCTGGC CGGCGACGAC GAACGACGCC GGGACGTCGG TTGGGAGCGC GGCCATCGGG
AGGTTTCTGC GCGCCGTCGC CTACCAGGAC ACGCCGCAAG CACTGCTTCC GGCGCCGTTG
CGCGACGACA ACCCGTGGGG TGTGCCGCAG CTGCGCTCGC CCGCCGGTCG GTCGCGGAGC
TGGGGCGAGG CGTTCCGCGT CGACGGGTGA
 
Protein sequence
MDGSETPDDR QPPATVELMA RAAADAFPAW SATPPRQRAS ALSAVADAVA TAGPELIATA 
MSETGLPEAR LTGELNRTVV QIRLFADIVV DGAYLDVRID EADDDFVLDS RPDLRRYHVP
VGPVLNFAAS NFPFAFSVIG GDTVSALAAG CPVVVKAHPG HLELSRQTAA VVRAALTGSG
APDATLQLLV GQEQGVAMLL DPRIRAASFT GSTRAGRMLA DLALGRPAPI PFYGELGSVN
PAFVTSEAAA QHGAAIAQGF LTSVCGSAGQ LCTKPGFLFV PRGSGVTADV AQAAGAVVEQ
RLLNPSITAG YTARRDAILG TPGVRALAVG DVRVDGDGQG WATPTLVATD VATLHHHRES
LLDEAFGPLS VVVEYDDEAG LPGVADELFE GNLTSTIHAG DGEDTPTLRA LVDWAARTTG
RIVFGGWPTG VSVTHATQHG GPWPATTNDA GTSVGSAAIG RFLRAVAYQD TPQALLPAPL
RDDNPWGVPQ LRSPAGRSRS WGEAFRVDG