Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4348 |
Symbol | |
ID | 9248223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5181057 |
End bp | 5182313 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | domain of unknown function DUF1727 |
Protein accession | YP_003682243 |
Protein GI | 297563269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0229746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.964565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC TTCCCCTGCG CGCCCAACTG GCATCGGTTC TGGGAAGGAG CGCGGCCAGC CTGTCCCGTG CCACCGGACG CGGAGACGGC TCCGTCATCG GCGGCCGGGT GGCGCTCAAG GTCGAACCCG ACCTGCTCGC CAAGCTCGCC CGGGGCCGCA GGCTCGCCCT GGTCAGCGCC ACCAACGGCA AGACCACCAC CACCAGGCTC ATCTCGCACG CCCTGCGCGA GTTCGGCGAC GTGGCCACCA ACGAGCACGG CGCGAACATG CCCACCGGGC ACATCACGGC TCTGTCGAAC AACCAGTCCG CCGTCAACGG CGTGCTGGAG GTGGACGAGA AGTACCTCCC GCAGGTGCTG CTCGCCACGC AGCCCGCGTT CGTGGTGCTG ATGAACCTCA GCCGCGACCA GATGGACCGC GCCTCCGAGA TCAACCTGCT CGCCAAGAAG TGGCGCCTCG CGCTGGGCAA GAGCAACGCC CACGTCATCG CCAACGCCGA CGACCCGCTC GTGGCCTGGG CGGGCCTGGG CGCGCCCAAC GCCACCTGGG TGTCCGCGGG TCAGCGCTGG AAGGAGGACT CCTGGTGCTG CCCCGAGTGC GGCGGCCACC TCAAGCGCGA CGTGGACCCG CACTGGGCCT GCCCCGAGTG CGGGCTGGCC CGCCCCGCGA CCACCTGGGC GGTGGACAAC GCCTCCGACT CCCTCCTCAC CCCGGAGGGC CAGAGCATCA AGCTGCGGCT GAACCTGCCC GGTGACGCCA ACCGCTCCAA CGCCGCGATC GCCGCGGCCA CCGCCGCCGG GTACGGCATC CACCCCGAGC GCACCGTGGA GCGGCTGCGC GAGATCACCT CCGTCGCGGG CCGCTACACC TCCGTGGTGA CCATGGGCGT CGAGGTGCGG CTGCTGCTCT CCAAGAACCC CGCCGGATGG CTGGAGTCCT TCGCCGTCCT CGACCCGCCC CACACCCCGG TGATCCTCTC GGTCAACGCG CAGGTCCCCG ACGGCAAGGA CACCTCCTGG CTGTGGGACG TGGACTACAC CGTCCTGCGC GGACGCCGCG TGTTCGTCAT GGGCGAGCGC CGCACCGATC TCGCGCTGCG CCTGGAGACC GACGGCGTGC GGTTCGAGGT GGCCGACCGG GTCGACGAGG TCCTGGGCCG CATCAAGGCC GACCAGCCGG GCATCACCAA GGTCGACCTC ATCGCCAACT ACACCGCCTT CCAGCAGATC CGCACGGCGT ACGGCCGCGT CCAGTAG
|
Protein sequence | MSELPLRAQL ASVLGRSAAS LSRATGRGDG SVIGGRVALK VEPDLLAKLA RGRRLALVSA TNGKTTTTRL ISHALREFGD VATNEHGANM PTGHITALSN NQSAVNGVLE VDEKYLPQVL LATQPAFVVL MNLSRDQMDR ASEINLLAKK WRLALGKSNA HVIANADDPL VAWAGLGAPN ATWVSAGQRW KEDSWCCPEC GGHLKRDVDP HWACPECGLA RPATTWAVDN ASDSLLTPEG QSIKLRLNLP GDANRSNAAI AAATAAGYGI HPERTVERLR EITSVAGRYT SVVTMGVEVR LLLSKNPAGW LESFAVLDPP HTPVILSVNA QVPDGKDTSW LWDVDYTVLR GRRVFVMGER RTDLALRLET DGVRFEVADR VDEVLGRIKA DQPGITKVDL IANYTAFQQI RTAYGRVQ
|
| |