Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4707 |
Symbol | |
ID | 9248589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5585980 |
End bp | 5587266 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682599 |
Protein GI | 297563625 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.806542 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTGC TCGTCGTCAG CCAGGCGGAG AAGACCCATC TGCTGGGCCT CATCCCGCAG GCGTGGGCTC TGCGCGCCGC CGGGCACGAG GTGCGGGTGG CCAGCCAGCC CGCGCTGGTC CCCGTCGCGG CGCGCACCGG GCTGCCCGCC GTCCAGGTGG GCCGGGACCA CCTCTTCCAC CAGCTGCTGA CCACGTTGAA GGGCCTGGGC TTCGGCGACA GCCGGGGCTT CGACATGACC AGGAGCGATC CGGAGGCGCT GGGCTGGGAC TACCTGCTCG ACGGCTACCG CGAGTTCGTC CGGCTGTGGT GGCATCCGGT CAACACGCCC ATGCTGGACG ACCTCACCGA CCTGTGCCGC TCGTGGCGCC CCGACCTGGT CCTGTGGGAG CCGACCACCT TCGCCGCGCC GGTGGCGGCC CGGGCCTCGG GCGCGGCGCA CGTCCGGGTC CTGTGGGGGC TGGACGTGTT CTCCCGCACC CGCCGCCGGT TCCTGGAGCG CGCCGCCGCG CTGTCCGCCG CCGACCGCGA GGACCCCCTC GCCGACTGGC TGGAGCGCAG CGCCCGGCGC GTGGGCGCCG ACTTCTCCGA GGACCTGGTC CGGGGCCAGG CCACCCTGGA CCCCTATCCG CCGGGTGTGC GCCTGGACCC CGAGGAGGGC GTGCGCCACA TCCCCCTGCG CTACGTGCCC TACAACGGGA CCGCGGTGGT GCCCGACTGG CTGCGCTCCC CGGGCGGGCG CAGGCGGGTC TGCCTGACCC TCGGCTCGGC CGTGCCGGAG AAGTTCGACG ACCGCTACCG GCTGCCCCTC GCGGAGCTGC TGGAGTCGGT CGCCGGACTG GACGTGGAGG TCGTCGCCAC CCTGTCCGCC GAGCAGAGCG CGCGGGCCGG AACCCTGCCG GACAACGTCC GGGTGGTGGA GCACGTGCCG TTGCACGCGC TCATGCCCCA CTGCGACGCG GTGGTCCACC ACGGGGGAGC CGGGACCTTC TGCACCGCGG TGTTCCACGG TGTCCCGCAG CTCGTCCTCC CCGAGTTCTC CATGGCCCAG TACGTCTTCG ACGAACCGCT GCTCGCCGAG CGGATCACCG GGTTGGGGGC GGGCCTCGCG CTGGCGGGGG CCGGGATGAC CGGCGGGGAG GTCGCCCTCC AGGTCGGGCG CCTGCTCGAC GAGCCCCGCT TCGCCGAGGG CGCGCGCGTG CTCCGCGACA GGGCGCACGG GATGACCAGC CCGGCCGGAC TCGTCCCGGT GCTGGAGGAG CTGGCCGCCG AGGGCCGAAG CGCCTGA
|
Protein sequence | MRVLVVSQAE KTHLLGLIPQ AWALRAAGHE VRVASQPALV PVAARTGLPA VQVGRDHLFH QLLTTLKGLG FGDSRGFDMT RSDPEALGWD YLLDGYREFV RLWWHPVNTP MLDDLTDLCR SWRPDLVLWE PTTFAAPVAA RASGAAHVRV LWGLDVFSRT RRRFLERAAA LSAADREDPL ADWLERSARR VGADFSEDLV RGQATLDPYP PGVRLDPEEG VRHIPLRYVP YNGTAVVPDW LRSPGGRRRV CLTLGSAVPE KFDDRYRLPL AELLESVAGL DVEVVATLSA EQSARAGTLP DNVRVVEHVP LHALMPHCDA VVHHGGAGTF CTAVFHGVPQ LVLPEFSMAQ YVFDEPLLAE RITGLGAGLA LAGAGMTGGE VALQVGRLLD EPRFAEGARV LRDRAHGMTS PAGLVPVLEE LAAEGRSA
|
| |