Gene Svir_20250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_20250 
Symbol 
ID8387352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp2160085 
End bp2161545 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content69% 
IMG OID644976088 
Productbenzaldehyde dehydrogenase (NAD+) 
Protein accessionYP_003133870 
Protein GI257056038 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.177841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.892194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCTA GTACAGCCGG TCCGGTCGGT GACAGCCCCA CTCGCTGGTC TTCCTCGGCG 
GCCACGGGCG GTACTTTGAC GGTGACGGCG CCTGCCACGG GCAGTGTCCT CGCCGAGGTC
GACGCGGCTT CCCCCGCCGA TGTCGACCGT GCCGTCGCCA AGGCGAAGCA GGCCCAGCGG
GACTGGGCCG CCACCACCTA CGACCAGCGC GCCGCGGTGC TGCGGCGGGC CGCTCGACTG
TTGGAAGCCG ACCCCGACCG GTTGCGCCGT TGGCTCGTCC CCGAATCGGG TTCGGCGATG
GGCAAGGCGT CGTTCGAGGT CGGCCTGGTG GTCTCGGAAC TCGACGAATG CGCGGCGCTG
GCCTCTCATC CCTACGGTGA ACTGCTGCAC TCCACCAAAC CGCGCCTGTC GCTGGCACGT
CGCGTACCGG TCGGCGTGGT GGGGGTGATC TCGCCGTTCA ACTTCCCCGG AATCCTGTCG
ATGCGGTCCG TCGCGCCCGC GTTGGCGGTG GGCAACGCCG TGGTGTTGAA ACCCGATCCC
CGCACGCCGA TCTCGGGTGG TCTCGCGCTC GCCGAGCTGC TCGCCGAGGC GGGGTTGCCC
GACGGGTTGC TCACCGTGTT GCCCGGCGGC GCGGAGGTCG GACAGGCGCT GGTGGCCCAC
CCCGACGTGC CGTGCATCTC GTTCACCGGA TCCACCCCGG CCGGACGCAA GATCGCGGAG
GCCGCCGCGC CGCTGCTCAA GCGCGTGCAC CTGGAACTCG GGGGCAACAA CGCACTACTC
GTGCTCCCGG ACGCCGACGT CGAAGCCGCC GCCTCGGCCG CCGCGTGGGG TTCGTTCCTG
CACCAGGGAC AGATCTGCAT GACGGCGGGG CGGCACCTCG TGCACTCGTC GATCGCGGAC
GAGTTCACGG CGTTGCTCGC GAAGAAGGCC GAGGCCATCA CCGTCGGAGA CCCGACGGAC
GAGAACAACG CACTCGGCCC GATCATCGAC GAACGGCAGC GGGAGCAGAT CCACCGCATC
GTGACCGACA CGGTGGACGC GGGCGCGAAA CTGCTCGCCG GAGGGAAGTA CGACGGCCTG
TTCTACCGGC CGACGGTCCT GGGGAACGTC CCGGTCGACA GCCCGGCGTT CCGGCAGGAG
ATCTTCGGTC CGGTGGCCCC CGTGGTCACG TACGACACCG TGGACGAGGC GATCGAGCTC
ATCAACGACA GCGAGTTCGG CCTCAGCGTC GGCATCCTGA CCTCCGACGC GTTCCGGGCA
TACGAGCTCG CGGACCGGAT CGAGTCGGGC ATGGTCCACA TCAACGACCA GACGGTCGAC
GACGAGGCCA CGATCCCGTT CGGTGGCGTG AAGGCCTCCG GGGCGGGCGG CCGCTTCGGT
GGTGCGCGGG CCAACCTGGA GTCGTTCACC GAGATCCAGT GGATCACGAT GCAGTCGTCG
ATCGAGCGTT ATCCGTTCTG A
 
Protein sequence
MTASTAGPVG DSPTRWSSSA ATGGTLTVTA PATGSVLAEV DAASPADVDR AVAKAKQAQR 
DWAATTYDQR AAVLRRAARL LEADPDRLRR WLVPESGSAM GKASFEVGLV VSELDECAAL
ASHPYGELLH STKPRLSLAR RVPVGVVGVI SPFNFPGILS MRSVAPALAV GNAVVLKPDP
RTPISGGLAL AELLAEAGLP DGLLTVLPGG AEVGQALVAH PDVPCISFTG STPAGRKIAE
AAAPLLKRVH LELGGNNALL VLPDADVEAA ASAAAWGSFL HQGQICMTAG RHLVHSSIAD
EFTALLAKKA EAITVGDPTD ENNALGPIID ERQREQIHRI VTDTVDAGAK LLAGGKYDGL
FYRPTVLGNV PVDSPAFRQE IFGPVAPVVT YDTVDEAIEL INDSEFGLSV GILTSDAFRA
YELADRIESG MVHINDQTVD DEATIPFGGV KASGAGGRFG GARANLESFT EIQWITMQSS
IERYPF