Gene Ndas_4302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4302 
Symbol 
ID9248177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5118495 
End bp5119715 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID 
Productformaldehyde dehydrogenase, glutathione-independent 
Protein accessionYP_003682197 
Protein GI297563223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.271716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA ACAGGGGTGT GGTGTACAGG GGAGCGGGCC GGGTCGAGGT CGAGGACGTC 
GACTACCCCG AGTTCGTCAT CAAGGACGGC CCCGGGGTCA ACCCGGCGAA CGTCGGACGC
AAGGTGCCGC ACGGGGCGAT CCTGAAGGTC GTCGCCACCA ACATCTGCGG AAGCGACCAG
CACATGGTGC GGGGGCGCAC CACCGCGCCG GAAGGGCTGG TCCTGGGCCA CGAGATCACC
GGCGAGGTGG TCGAGACCGG ACCCGGCGTG GAGTTCGTCA AGGTCGGCGA CCTCGTCTCG
GTCCCGTTCA ACATCTCCTG CGGGCGCTGC CGCAACTGCA AGGCGCGGCG CACGGAGATC
TGCCTGAACG TGAACCCGGA CCGGCCGGGC TCGGCCTACG GCTACGTCGA CATGGGCGGC
TGGGTCGGCG GGCAGGCCAG GTACGCGCTC GTGCCCTACG CGGACTGGAA CCTCCTCGTG
TTCCCCGACC GCGACCAGGC GTTGGAGAAG ATCCTGGACC TGACGATGCT CTCGGACATC
TTCCCGACCG GCTACCACGG CTGCGTCACC GCGGGCGTGG GCGTGGGGTC GAGCGTCTAC
GTCGCGGGGG CCGGGCCCGT CGGGCTGGCC GCCGCCGCGT CGGCCCGACT GCTCGGCGCG
GCGGTGGTGA TCGTCGCGGA CATGAAGGAG GAGCGGCTGG CGCAGGCCCG CAGCTTCGGG
TGCGAGACGG TGAACGTGGC CGAGGGTGAC CTGGCCGGGC AGATCGAGCG GATCCTGGGC
GTCCCCGAGG TGGACTGCGC GGTGGACGCG GTCGGCTTCG AGGCGCACGG GACGGGAGAG
GGAGCCTCGA AGGAGGCGCC CGCCAGCGTG CTCAACACCG CGATGGACGT GACCAGGGCC
GGGGGGTCCA TCGGGATTCC GGGCCTGTAC GTGACCGGCG ACCCAGGCGC GTCCGACGAG
GCCGCCAAGG AGGGTTCCCT GTCGGTCCGG ATCGGGCTGG GCTGGTCGAA GTCGCACGCC
TTCTTCACCG GCCAGTGCCC GGTCATGAAG TACCACCGGG AGCTGATGGA GGCGATCCTC
CACGACCGGG TGCGGATCGC CGAGGCCGTC AACGCCGTGG CGATCCCGCT GGAGGAGGCG
CCCGAGGGGT ACCGGGCCTT CGACGAGGGT GCGGCCAGCA AGTACGTGCT CGACCCGAAC
AACTACCTCG GCACGCGCTG A
 
Protein sequence
MTGNRGVVYR GAGRVEVEDV DYPEFVIKDG PGVNPANVGR KVPHGAILKV VATNICGSDQ 
HMVRGRTTAP EGLVLGHEIT GEVVETGPGV EFVKVGDLVS VPFNISCGRC RNCKARRTEI
CLNVNPDRPG SAYGYVDMGG WVGGQARYAL VPYADWNLLV FPDRDQALEK ILDLTMLSDI
FPTGYHGCVT AGVGVGSSVY VAGAGPVGLA AAASARLLGA AVVIVADMKE ERLAQARSFG
CETVNVAEGD LAGQIERILG VPEVDCAVDA VGFEAHGTGE GASKEAPASV LNTAMDVTRA
GGSIGIPGLY VTGDPGASDE AAKEGSLSVR IGLGWSKSHA FFTGQCPVMK YHRELMEAIL
HDRVRIAEAV NAVAIPLEEA PEGYRAFDEG AASKYVLDPN NYLGTR