Gene Ndas_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4236 
Symbol 
ID9248110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5053468 
End bp5054502 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content77% 
IMG OID 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_003682133 
Protein GI297563159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.637642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGG CGGAGCTGCT GCGGGAGCGC CCGATCGTAC AGGCGCCCAT GGCGGGCGGG 
GCCGCCACGC CCGCGCTGGT GGCGGCCGTG GCGGGAGCGG GCGGAACGGG TTTCCTCGCC
GCCGGGTACC TGGCCCCCGA GGTCCTCGCC GACCAGCTCG GGGCGGTGCG CGACGCCGGG
GTCGGCGCGT TCGGGGTGAA CGTGTTCGTG CCGGGCCCGC CCTCCGACCC CGACGTGGCG
GCGTCCTACC GTTGCGACCT GGAGTCCGAG GCCGAGCGGT ACGGGACGCC GGTCGGCGCC
CCGGTGCACG ACGACGACGC GTGGGCGGCC AAGATCGACC TGCTGGCGCG GGCGGCCGTG
CCGGTGGTGA GCTTCACCTT CGGCTGCCCG GAGGCCGCCG TGTTGGAGCG GCTGCGCGCG
GCGGGCAGTG CCACGGTGGT CACCGTGACC ACGGTCGGGG AGGCGCGCGA GGCCGTGGCC
CGCGGAGCCG ACGGGGTGTG CGCGCAGGGT ACGGAGGCCG GGGGCCACCG CGGCGCGTTC
GACCCGGTCG GGAACGGAGG TCTGCCGCTG CGGGAGCTGC TGGCGGACGT GGTCGGCGCG
GTGGAGGTAC CGGTGATCGC CGCCGGGGGG ATCATGACCG GGGCCGACGT GGCCGGGGCC
CTGGACGCGG GTGCCGCCGC TGTGCAGCTG GGCACGGCGT TCCTGCGCTG TCCCGAGAGC
GGCGCCAACC CGGTCCACAA GGCCGCGCTG GCCGATCCCG CGTACACCGG GACGGCCGTG
ACGTGGGCTT TCACCGGCCG TCCGGCGCGG GGCCTGGCCA ACAGGTTCAT CGCCGAGCAC
CCGCGGAGGC CCTTCGCCTA CCCCGAGATC CACCACATGA CGAAGCCGCT GCGCGCGGCC
GCCGCCCGGG CCGGAGACCC CGGCGGCATG GCGCTGTGGG CGGGAGAGGG GTTCCGGGCG
GCCAGCGACG ATCCCGCGGC GCTGGTCGTG GAGCGGTTGC GCCGCGAGGC CGCGGAGGCG
GGCCGGAAGG TCTGA
 
Protein sequence
MSLAELLRER PIVQAPMAGG AATPALVAAV AGAGGTGFLA AGYLAPEVLA DQLGAVRDAG 
VGAFGVNVFV PGPPSDPDVA ASYRCDLESE AERYGTPVGA PVHDDDAWAA KIDLLARAAV
PVVSFTFGCP EAAVLERLRA AGSATVVTVT TVGEAREAVA RGADGVCAQG TEAGGHRGAF
DPVGNGGLPL RELLADVVGA VEVPVIAAGG IMTGADVAGA LDAGAAAVQL GTAFLRCPES
GANPVHKAAL ADPAYTGTAV TWAFTGRPAR GLANRFIAEH PRRPFAYPEI HHMTKPLRAA
AARAGDPGGM ALWAGEGFRA ASDDPAALVV ERLRREAAEA GRKV