Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4302 |
Symbol | |
ID | 9248177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5118495 |
End bp | 5119715 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | formaldehyde dehydrogenase, glutathione-independent |
Protein accession | YP_003682197 |
Protein GI | 297563223 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.271716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCA ACAGGGGTGT GGTGTACAGG GGAGCGGGCC GGGTCGAGGT CGAGGACGTC GACTACCCCG AGTTCGTCAT CAAGGACGGC CCCGGGGTCA ACCCGGCGAA CGTCGGACGC AAGGTGCCGC ACGGGGCGAT CCTGAAGGTC GTCGCCACCA ACATCTGCGG AAGCGACCAG CACATGGTGC GGGGGCGCAC CACCGCGCCG GAAGGGCTGG TCCTGGGCCA CGAGATCACC GGCGAGGTGG TCGAGACCGG ACCCGGCGTG GAGTTCGTCA AGGTCGGCGA CCTCGTCTCG GTCCCGTTCA ACATCTCCTG CGGGCGCTGC CGCAACTGCA AGGCGCGGCG CACGGAGATC TGCCTGAACG TGAACCCGGA CCGGCCGGGC TCGGCCTACG GCTACGTCGA CATGGGCGGC TGGGTCGGCG GGCAGGCCAG GTACGCGCTC GTGCCCTACG CGGACTGGAA CCTCCTCGTG TTCCCCGACC GCGACCAGGC GTTGGAGAAG ATCCTGGACC TGACGATGCT CTCGGACATC TTCCCGACCG GCTACCACGG CTGCGTCACC GCGGGCGTGG GCGTGGGGTC GAGCGTCTAC GTCGCGGGGG CCGGGCCCGT CGGGCTGGCC GCCGCCGCGT CGGCCCGACT GCTCGGCGCG GCGGTGGTGA TCGTCGCGGA CATGAAGGAG GAGCGGCTGG CGCAGGCCCG CAGCTTCGGG TGCGAGACGG TGAACGTGGC CGAGGGTGAC CTGGCCGGGC AGATCGAGCG GATCCTGGGC GTCCCCGAGG TGGACTGCGC GGTGGACGCG GTCGGCTTCG AGGCGCACGG GACGGGAGAG GGAGCCTCGA AGGAGGCGCC CGCCAGCGTG CTCAACACCG CGATGGACGT GACCAGGGCC GGGGGGTCCA TCGGGATTCC GGGCCTGTAC GTGACCGGCG ACCCAGGCGC GTCCGACGAG GCCGCCAAGG AGGGTTCCCT GTCGGTCCGG ATCGGGCTGG GCTGGTCGAA GTCGCACGCC TTCTTCACCG GCCAGTGCCC GGTCATGAAG TACCACCGGG AGCTGATGGA GGCGATCCTC CACGACCGGG TGCGGATCGC CGAGGCCGTC AACGCCGTGG CGATCCCGCT GGAGGAGGCG CCCGAGGGGT ACCGGGCCTT CGACGAGGGT GCGGCCAGCA AGTACGTGCT CGACCCGAAC AACTACCTCG GCACGCGCTG A
|
Protein sequence | MTGNRGVVYR GAGRVEVEDV DYPEFVIKDG PGVNPANVGR KVPHGAILKV VATNICGSDQ HMVRGRTTAP EGLVLGHEIT GEVVETGPGV EFVKVGDLVS VPFNISCGRC RNCKARRTEI CLNVNPDRPG SAYGYVDMGG WVGGQARYAL VPYADWNLLV FPDRDQALEK ILDLTMLSDI FPTGYHGCVT AGVGVGSSVY VAGAGPVGLA AAASARLLGA AVVIVADMKE ERLAQARSFG CETVNVAEGD LAGQIERILG VPEVDCAVDA VGFEAHGTGE GASKEAPASV LNTAMDVTRA GGSIGIPGLY VTGDPGASDE AAKEGSLSVR IGLGWSKSHA FFTGQCPVMK YHRELMEAIL HDRVRIAEAV NAVAIPLEEA PEGYRAFDEG AASKYVLDPN NYLGTR
|
| |