Gene Ndas_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1074 
Symbol 
ID9244920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1320951 
End bp1322606 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content73% 
IMG OID 
ProductPyridoxal-dependent decarboxylase 
Protein accessionYP_003679022 
Protein GI297560048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0179371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.433316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCC TCGCGCGGTC CGCACCGGGC CCGCTGCACG GGAACGCCGG TGCCCTCGGC 
ACCAGGCCCA CACCCCCCTC CGGCCAGGAC CCCGACGACC CCCGGACACA GCTGTTCGAC
GCCCGGAGCG CCGAACGCTA CCGCGACCTG ACCGGGGAGG CCGCGGCCCG CGTCGCCCGA
CGCATCGCCG GAGCCGACCG GCCGCTCACC GGAGCCACCG CCGAGGACCT GCGCCCCCAG
ATCGACAAGG TCGACCTCGA CCAGCCGCTC CACGACCCCA CCGCGGCCCT GGACGAACTC
GAACGCGTCT ACCTCGACGA CGCCGTCTAC TTCCACCACC CCCGCTACAT GGGACACCTC
AACTGCCCCG TGGTCCTGCC CGCCCTGCTC GGCGAGACCG TCCTGTCCGC CGTCAACTCC
TCCCTCGACA CCTGGGACCA GAGCGCCGGG GGCACCCTGA TCGAACAGCG GCTCATCGAC
TGGACGTGCG AGCGCGTCGG CTTCGGCGAG ACCGCGGACG GCGTCTTCAC CAGCGGCGGC
AGCCAGTCCA ACCTCCAGGC CCTGCTCATG GCACGCGACG AGGCCCACCA CCGCGCCAAG
GCGCAGGAGG GCCAGGACAC CCGCCTGGCC GAACTCCTGC CCCGGATGCG CGTCCTGACC
TCCGAGGCCG GACACTTCAG CGTCGCCAAG TCCGCCGCCC TCCTCGGACT GGGCTACGAA
TCCGTCATCA CCGTGGCCTG CGACGACAGA CGCCGCATGC GCCCCGACGC CCTCGCCGCC
CAACTGCGCC GATGCCGCGC CGAAGGACTC CTGCCCATAG CCGTCGTCGC CACCGCCGGA
ACCACCGACT TCGGCAGCAT CGACCCCCTG CCCCGCATCG CCGACCTGTG CCGACAGCGC
GGCGTGTGGA TGCACGTCGA CGCCGCCTAC GGCTGCGGCC TGCTGGTCTC GCGCCACCGC
CACCTGCTGG AGGGCGTCGA ACGCGCCGAC TCGGTCACCG TGGACTTCCA CAAGTCCTTC
TTCCAACCGG TCAGCTCCAG CGCGATCGTG GTCCGCGACC GCGACGTCCT GCGCCACGTC
ACCTACCACG CCGACTACCT CAACTCCCGC TCGGACGGCA GCACCCCCCT GCTCTCCCCC
AACCAGGTCG ACAAGAGCCT GCAGACCACA CGCCGCTTCG ACGCCCTCAA ACTGTGGCTC
ACCCTGCGCG TCATGGGCGC CGACGGCGTG GGCGCCCTCT TCGACAGCGT CCTGGACCTG
GCCGCCACCG CCTGGACCCT GCTCGACGCC GACCCGCGCT TCACCGTGGT CACCCGGCCC
AGCCTGAGCA CCCTGGTCTT CCGCTGCGCC GTACCCGGCG CCGACCCCGA CACCGCCGAC
GCCGCCCACC GCTACGCACG CGAGGCGCTG CTGGCCTCGG GCCGCGCCTT CGTGGCCCGC
ACCACCGTCG ACGGCAGGCC CCACCTCAAA CTCACCCTGC TCAACCCCAG GGCCACCCGG
GAGGACGTCG CCGAGGTACT GGACCTGATC GCCGCGCACG TCGACCACTT CATGAACGGA
CGCGACATCC CCGACCGCCC GGCCCCACCC GCCCCGACCA CGACCGCGGC CCACGCGGCC
TCCGCCCTGC CGACCACCAC CGGGAGGTCC CGTTGA
 
Protein sequence
MSSLARSAPG PLHGNAGALG TRPTPPSGQD PDDPRTQLFD ARSAERYRDL TGEAAARVAR 
RIAGADRPLT GATAEDLRPQ IDKVDLDQPL HDPTAALDEL ERVYLDDAVY FHHPRYMGHL
NCPVVLPALL GETVLSAVNS SLDTWDQSAG GTLIEQRLID WTCERVGFGE TADGVFTSGG
SQSNLQALLM ARDEAHHRAK AQEGQDTRLA ELLPRMRVLT SEAGHFSVAK SAALLGLGYE
SVITVACDDR RRMRPDALAA QLRRCRAEGL LPIAVVATAG TTDFGSIDPL PRIADLCRQR
GVWMHVDAAY GCGLLVSRHR HLLEGVERAD SVTVDFHKSF FQPVSSSAIV VRDRDVLRHV
TYHADYLNSR SDGSTPLLSP NQVDKSLQTT RRFDALKLWL TLRVMGADGV GALFDSVLDL
AATAWTLLDA DPRFTVVTRP SLSTLVFRCA VPGADPDTAD AAHRYAREAL LASGRAFVAR
TTVDGRPHLK LTLLNPRATR EDVAEVLDLI AAHVDHFMNG RDIPDRPAPP APTTTAAHAA
SALPTTTGRS R