Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4716 |
Symbol | |
ID | 9248598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5597990 |
End bp | 5599264 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682608 |
Protein GI | 297563634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.397708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.643651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGTGTAC TACTCGCCTC CTACGCGGAG AAGACGCACT TCATCGGCAT GGTGTCCCTG GCGTGGGCGC TGCGCGCCGC CGGGCACGAG GTGCGCGTCG CCAGCCAGCC CGGACTGGCG GGGTTCCTGA GGAGTGCGGG TCTGCCCGCC GTCCCGGTCG GGCGGGACCA CCTGCTGCGC GAGCGGTTCG AGCTGGTGAC GCAGTGGGGC GAGGGCGACG CCCCGGGGCT GTTCGACGTG GGGGAGAGCT GGCCCGGCGA CCTGTCCTGG GACGAGATGC GCTGGGGCCT GCGCGACACC GCGGCCTGGT GGTGGCGCAT GGTGAACGAC CCGATGCTGG AGGACCTGGT CGCCTTCTGC CGCGAGTGGC GCCCCGACCT GGTCGTGTGG GAGGCGACGA CGTTCGCGGC CCCCGTCGCC GCGGAGGCGT GCGGTGCGGC GCACGTGCGC TTCCTGTGGA GCCTGGACCT GTTCGCCGCG ATGCGCGAAC AGTACCTGCG CCACATGGAA CGACAGCCCC CACAGGAACG CGACGACCCC CTCGCCGCAT GGCTGGGCGA CCGCGCCGCC CGCCACGGCG TCGACTTCTC CGAAACCCTC GTCCGCGGCC AGGCCACCCT GGACTACCTG CCCGCCTCCC TGGGCGTGCC CGCCCCCACC GGAGCCCGCC GCCTGCCCAT CCGCTACGTG CCCTACAACG GACGCGCCGT CGTCCCCGAC TGGCTGCGCA CACCCCCCAC CCGCCCCCGC GTCTGCCTCA GCGTGGGGAG CAGTACGACT GAGTGGTTCG GCGGGTACAC GTTCTCCCTG GCCGAGGTGG TGCGCGGCCT CGGCGAACTG GACGCGGAGG TGGTCGCGAC CCTGCCCCCC GAGGAGGAGG CCGCACTCGG CGCGGTCCCG GACAACGTGC GGCTGGTGGG GTACGCCCCC CTGCACGTCC TGGCCCCCAC CTGCGACGTC ATGATCACCC ACGCGGGGCC GGGGACCCTG TGCTCCGGGC TCTCCCACGG CGTCCCCCAG CTCCTCGTCC CCGGCCCCCG CCTCGACGCC CCCCTGCTCG CACGGCTGGT GGAGCGGGAG GGGGCCGGGC TGGTGGTGCC GTCGGGCGAG GCGGGGGCCG ACAGCGTCCG CGACGCGACC CGGCGCCTGC TGGAGGACCC CTCCCACGCC GAGGCGGCGC GGCGCCTGCG CGGGGAGATG GCCGCCATGC CCTCGCCCGC GGAGGCGGTG CGCGGCCTGC CCCGCGTCCT GGAGGGTCTG GGCGCCTCCG TCTGA
|
Protein sequence | MRVLLASYAE KTHFIGMVSL AWALRAAGHE VRVASQPGLA GFLRSAGLPA VPVGRDHLLR ERFELVTQWG EGDAPGLFDV GESWPGDLSW DEMRWGLRDT AAWWWRMVND PMLEDLVAFC REWRPDLVVW EATTFAAPVA AEACGAAHVR FLWSLDLFAA MREQYLRHME RQPPQERDDP LAAWLGDRAA RHGVDFSETL VRGQATLDYL PASLGVPAPT GARRLPIRYV PYNGRAVVPD WLRTPPTRPR VCLSVGSSTT EWFGGYTFSL AEVVRGLGEL DAEVVATLPP EEEAALGAVP DNVRLVGYAP LHVLAPTCDV MITHAGPGTL CSGLSHGVPQ LLVPGPRLDA PLLARLVERE GAGLVVPSGE AGADSVRDAT RRLLEDPSHA EAARRLRGEM AAMPSPAEAV RGLPRVLEGL GASV
|
| |