Gene Noca_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4031 
Symbol 
ID4596545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4254499 
End bp4256034 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content72% 
IMG OID639778637 
Productacetaldehyde dehydrogenase (acetylating) 
Protein accessionYP_925215 
Protein GI119718250 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02518] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.808002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCACG ACGAGCTCGA CAGCGACCTG CGCTCGATCC AGGAGGCCCG CCGCCTCGCC 
ACGGCGGCCC GGGCGGCTCA GCGGGAGTTC GCCCACGCCT CGCAGGCCGA GGTGGACCGG
ATCTGCGCGG CGATGGCCGA CGCGGTCTAC CGTGAGGCCG CCCGCCTCGG GCAGCTGGCG
ACCGACGAGA CCGGGTACGG CGTACCCGCC CACAAGCGGC TCAAGGTCGA GTTCGCCTCG
CGCACGGTGT GGGAGTCGAT CCGCGACGTG CCGACCGTGG GCGTGCTGCG CCGAGACGAG
GCGAAGGGGA TCGTCGAGAT CGGCTGGCCG GTCGGCGTGA TCGTCGGCCT GTGCCCCTCC
ACCAACCCCA ACTCGACGGC GATCTACAAG GTGCTGATCT CGGTCAAGGC GCGCAACGCC
TGCATCATCG CCCCGCACCC CTCGGCCAAG GCCGCCACCT ACGAGGCGGT GCGGATCATG
ATCGAGGCGG GGGAGCGGGC CGGCATGCCC AAGGGCCTGG TCGGCTGCAT GCAGGAGGTC
AGCCTCCCCG GCTCCCAGGA GCTGATGCGG CACTACGCGA CGTCGATGAT CCTGGCCACC
GGCGGCACGC CGATGGTGCG CGCGGCCCAC AGCATGGGCA AGCCCGCGCT CGGCGTCGGG
CCCGGCAACG TCCCGGCGTA CGTCGACCGC AGTGCGGACG TGCTGGCGGC CGCCACCGCG
ATCGTCAACA GCAAGTCCTT CGACTGCTCC ACGATCTGTG CGACCGAGCA GGCGGTCGTA
GCGGACGCGC CGATCGCCGG CGCGCTGCGC GCCGAGATGG AGCGCCTCGG CGCCTACTTC
GTCTCTGCGG AGGAGAAGGC GGCGCTCGAG CGCACCGTGT TCAACCCGGG CGGCGCGATG
AACCCCAAGG CGGTCGGGAA GTCGCCGCAG GCCCTGGCGG CGCTGGCGGG CATCCAGGTC
CCCGAGCATG CCCGGATCCT CGTTGCCGAG CTGGGCAGCG TCGGTCCGCA GGAGCCGCTC
AGCGCCGAGA AGCTCACCAC CGTGCTCGGC TGGTACGTCG AGGACGGCTG GCGGGCCGGC
TGCGAGCGGT CGATCGAGCT GCTGAAGTTC GGCGGCGACG GGCACTCGCT GGTGATCCAC
GCGACCGACG AGGAGGTGAT CATGGCGTTC GGGCTAGAGA AGCCCGCCTT CCGGATCCTC
GTCAACACCT GGGGCACCCT CGGCGCGATC GGTGCGACGA CCGGCGTGAT GCCGGCGCTG
ACGCTCGCCC CGGGCGGGAT CGGCGGTGCC GTGGTCAGCG ACAACATCAC CGTTACGCAC
CTGCTCAACG TCAAGCGTCT GGCCTTCAAG CTGCACGAGC CGCCCGCCGC GGCGTACGAG
CACGCACCCG ACGTGCGGGG CGCCCCCCGC CACGACGGCC CCCGCTCGGC CGAGGCGACC
CCGGCGGCGC GCGTCGCCGA ACCCGCTGCG GTGAGCGGGG ACCAGGTGGA ACGCATCGTC
CGCCGGGTGC TCAGCGAGCT CGGAGCCGGC CGATGA
 
Protein sequence
MTHDELDSDL RSIQEARRLA TAARAAQREF AHASQAEVDR ICAAMADAVY REAARLGQLA 
TDETGYGVPA HKRLKVEFAS RTVWESIRDV PTVGVLRRDE AKGIVEIGWP VGVIVGLCPS
TNPNSTAIYK VLISVKARNA CIIAPHPSAK AATYEAVRIM IEAGERAGMP KGLVGCMQEV
SLPGSQELMR HYATSMILAT GGTPMVRAAH SMGKPALGVG PGNVPAYVDR SADVLAAATA
IVNSKSFDCS TICATEQAVV ADAPIAGALR AEMERLGAYF VSAEEKAALE RTVFNPGGAM
NPKAVGKSPQ ALAALAGIQV PEHARILVAE LGSVGPQEPL SAEKLTTVLG WYVEDGWRAG
CERSIELLKF GGDGHSLVIH ATDEEVIMAF GLEKPAFRIL VNTWGTLGAI GATTGVMPAL
TLAPGGIGGA VVSDNITVTH LLNVKRLAFK LHEPPAAAYE HAPDVRGAPR HDGPRSAEAT
PAARVAEPAA VSGDQVERIV RRVLSELGAG R