Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3834 |
Symbol | |
ID | 9247705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4598823 |
End bp | 4599842 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Zn-dependent hydrolase including glyoxylase |
Protein accession | YP_003681737 |
Protein GI | 297562763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.537571 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCTGGA ATCGGAAGTC GAAGAAGCAG GAGGGCCAGG AGGCCACCGA TTCCGCTGCG GAGGCCGCGG CGGAGCGCGA GGAGACCACC GCCGCCAGCG GGGACGTGGC CGCGGAGGAC GGGGCCAAGG ACCCGGCCGG GACCGCCGAG GCCTCCGCCG AGGCCGCCGA CGACGGCGCC GCCGAGCCCG CCGTCGACGC CGGGGAGCTC ACCGGGGCGG AGAAGGACGG GGCCGCCGAG GGCTCCGCCG CCGACGAGAA GGCGGAGGAG GCCGGGGAGG CGGACGGGTC CGAGGACTCC GACGCGGACG ACGAGTCCGA CGACGAGGAG GACGAGGCGC CGGTGAAGGC CGACGTCTTC GCCGACGAGG GCGAGGAGGC CGGGGTGACC GGCCCCGACG AGGACGGCAT CACGCGCGTG CGCACCGCCG GGACCTTCGA GTTCGACGGC GAGGAGTACC CGGTCGTCAG CAACACCTGG ATCATCGACG CCGACGACGA GGGCGTCGTC GTCATCGACC CCGGCCACGA CGCCGAGGCG ATCCTGGCGG CCATCGGTGA GCGCGAGGTC TACCTGGTGG CCTGCACCAA CGCCTACAGC CCCCACATCG AGTCGGCCAT CAAGGTCGCC GAGGAGTCCG AGGCCCCCAT CGCGCTGCAC CCCCGCGAGA TGCGCGCCTG GCGCCGCCAC CACGGCGCCG AGCACCGCCC CGAGATCGAG GTGGAGGGCC AGGGCGCCCT CGACATCGGC AAGCTGCACA TCGACGTCCT GGCCCTGCCC GGCACCTCCC CCGGCACCGT CGGCTACTAC GTCAGCGAAC GCGGCGTCGT CTTCGGCGGC GACACCCTGC GCAAGGGCGA GCCCGGCATG GTCGGCAACA CCTACATCGA CTACACCACG CAGCTCGCCT CCATCGGCGA GGCGCTGCTG TCCCTGCCCC CGGACACGCG CATCCTGCCC GACCGCGGCC CGGCCACCAC CGCGGCGGCC GAGGGCAAGA ACTTCGACTC CTGGGTCTGA
|
Protein sequence | MFWNRKSKKQ EGQEATDSAA EAAAEREETT AASGDVAAED GAKDPAGTAE ASAEAADDGA AEPAVDAGEL TGAEKDGAAE GSAADEKAEE AGEADGSEDS DADDESDDEE DEAPVKADVF ADEGEEAGVT GPDEDGITRV RTAGTFEFDG EEYPVVSNTW IIDADDEGVV VIDPGHDAEA ILAAIGEREV YLVACTNAYS PHIESAIKVA EESEAPIALH PREMRAWRRH HGAEHRPEIE VEGQGALDIG KLHIDVLALP GTSPGTVGYY VSERGVVFGG DTLRKGEPGM VGNTYIDYTT QLASIGEALL SLPPDTRILP DRGPATTAAA EGKNFDSWV
|
| |