Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2564 |
Symbol | |
ID | 9246415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3055754 |
End bp | 3057778 |
Gene Length | 2025 bp |
Protein Length | 674 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF839 |
Protein accession | YP_003680489 |
Protein GI | 297561515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.200844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0537939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGAAT CCGCGCCCCG CCGCCGCCAG CTGCCGTTGA TCGGCCAGGT CGGTGGCGGT CGTTCCCGTG CGACATGCAG ACTGCGCTGC GGCGACGCCT GTTTCCACCC GGCTCCCAAC ACCAGCGACA ACCCCTACTT CGGGGACATC TTCGCCAAGG CGCTCTCCCG CCGCTCCGTC ATCCAGGCCG GTGCCGTGAG CGCCGGAGCG GGCGCGCTCG GACTCGCCGC GCTCAGCCCG GCCTCGGCCG ACCGCCGCGG CCCCGACCGC CCCCACCCCT CGCCGCGGCC CACCTTCACC AGCGTCCAGC CCAACAACGA CGACAAGATC ACCATCCCCC GGGGCTACGA CCAGCACGTC ATCATCCGCT GGGGCGAGCC GGTCCTGCCC GGCGCTCCCG AGTTCGACGT GTACGACCAG ACCGCCGAGG CGCAGGCCCA GCAGTTCGGC TACAACTGCG ACTACGTCGG CTTCCACCAG CTCGACGACG ACCACGCCCT GCTGTGGGTC AACCACGAGT ACACCAACGA GGAACTCATG TTCCCGGGGT ACGCCGGGGG CGACGCCGCC ACCGAGGAGC AGGTCCGCAT CGCCATGGCC GCCCACGGCG GTTCCATCGT CGAGCTCCAG CGCGAGGGCC GCACCGGCAA GTGGGTGCTC TCCACGGGCG AGCGCTCCTT CAACCGCCGC ATCACCGCCG ACACCCCCAT GCGCCTCACC GGCCCCGCCG CCGGGCACGA CCTGCTCAAG ACGGCGGAGG ACCCCACCGG CACCCTGGTC AGGGGCATGC TCAACAACTG CGCGGGCGGC ATGACCCCGT GGGGCACCTT CCTCACCGCC GAGGAGAACT TCAACCAGTA CTTCGCCAAC GGCTCCGGGT CCGCCGAGAC CAGGCGCTAC GGCGTCGGCA CCGGTGCCAC GGGGCGCCGC TGGGAGCGTT TCGAGGAGCG CTTCGACCTC TCCAAGCACC CCAACGAGAT CAACCGCTTC GGCTACATCG TCGAGGTCGA CCCGCTCGAC CCCGGGTCCG AGCCGCTCAA GCGCACCATG CTCGGCCGCT TCAAGCACGA GGGCGCCACC ACCCGCCTGG CCGACGACGG CCGGGTCGTG GCCTACATGG GCGACGACGA GCGCTTCGAC TACATGTACA AGTTCGTCAG CGCCAAGAAG TACGTCGAGG GCTCCCGGCG CCACAACCTC TCCCTGCTGG ACACCGGCAC GCTGTACGTG GCGCGCCTGT CCGGCAACAG CCCCGCCGAG GAGTTCGACG GCTCCGGCGC GCTGCCCGCC GACGGCGAGT TCGACGGTTC CGGCGAGTGG GTCGCGCTGT GCACCGACAC CGAGAGCTTC GTCCCCGGCT TCAGCGTCGC CGAGGTGCTC ATCCACACCC GTCTGGCCGC CGACGCCGTG GGCCCCACCA AGATGGACCG CCCCGAGGAC TTCGAGCCCA GCCCGGTCAC CGGCAAGGTC TACTGCGCGC TGACCAACAA CTCCGCCCGC GAGCCCGGCC AGGCCGACGA GCCCAACCCG CGCGGCCCCA ACCGGCACGG CCACGTCCTG GAGATCGTGG AGTCCGGCAA CGACGCGGCC GCCACCACCT TCGCCTGGAA CGTGCCGCTG GTGTGCGGCG ACCCCGAGGA CGACGACACC TACTACGCGG GCTTCGACAA GTCCAAGGTC ATGCCGATCT CCGCGCCGGA CAACCTGACC TTCGACAAGG ACGGCAACCT GTGGATCTCC ACGGACGGCC AGCCGGGCGC CCTGGGGATC AACGACGGCC TGCACGTCAT GCCGGTCGAG GGCCGCTTCC GAGGTGAGCT GAAGACCTTC GCCACCGTCC CGGTCGGCGC GGAGGCCTGC GGCCCCTTCG TCACCGAGGA CAGCAAGACG GTGTTCCTGG CCCCCCAGCA CCCCGGTGAC GGCGGCAGCT TCGAGGCTCC CACCAGCACC TGGCCCGACG GCGAGTTCCC GCGCCCGTCC GTGGTGTGCA TCTGGCACAC CGCGGGCCGC GAGGTCGGCA GGTAG
|
Protein sequence | MPESAPRRRQ LPLIGQVGGG RSRATCRLRC GDACFHPAPN TSDNPYFGDI FAKALSRRSV IQAGAVSAGA GALGLAALSP ASADRRGPDR PHPSPRPTFT SVQPNNDDKI TIPRGYDQHV IIRWGEPVLP GAPEFDVYDQ TAEAQAQQFG YNCDYVGFHQ LDDDHALLWV NHEYTNEELM FPGYAGGDAA TEEQVRIAMA AHGGSIVELQ REGRTGKWVL STGERSFNRR ITADTPMRLT GPAAGHDLLK TAEDPTGTLV RGMLNNCAGG MTPWGTFLTA EENFNQYFAN GSGSAETRRY GVGTGATGRR WERFEERFDL SKHPNEINRF GYIVEVDPLD PGSEPLKRTM LGRFKHEGAT TRLADDGRVV AYMGDDERFD YMYKFVSAKK YVEGSRRHNL SLLDTGTLYV ARLSGNSPAE EFDGSGALPA DGEFDGSGEW VALCTDTESF VPGFSVAEVL IHTRLAADAV GPTKMDRPED FEPSPVTGKV YCALTNNSAR EPGQADEPNP RGPNRHGHVL EIVESGNDAA ATTFAWNVPL VCGDPEDDDT YYAGFDKSKV MPISAPDNLT FDKDGNLWIS TDGQPGALGI NDGLHVMPVE GRFRGELKTF ATVPVGAEAC GPFVTEDSKT VFLAPQHPGD GGSFEAPTST WPDGEFPRPS VVCIWHTAGR EVGR
|
| |