Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5349 |
Symbol | |
ID | 9249252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 525690 |
End bp | 526736 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003683235 |
Protein GI | 297564262 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.584645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.183228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAACA CACTCCGTTA CTCCTTCGCC ACCGTCATGA CCGCCGGGCT GGTCGTGGCG CCCGTGGCGG CGGCCTTCGC CGACACCACC ACCAGCGGCT CCGGCGGGAT CGGCAGCGGG AACCAGGTCG TCGTGCCCGT GGACGTCGAA GCCGAGCTGT GCGGCAACTC GATCGCGATC CTCGGCATCT CCAGCGCCAC GTGCACCCAG GTCTCGGAGG TCCTCTACGA GGCCAGCGGC CAGGGCGGGG CCTCCACGGA CGGCTCCGGC GGTGTGGCCA GCGGCAACCA GATCATCGTC CCCGTGGACG CCGCCATCGA CGCCTGCGGC AACGCGGCCG CGGTCGGCGG CATCAGCCAG GCCGAGTGCG TCGAGGTGGT CGAGGTCCTG GAGGAGGAGA GCGCCGACGC GCCCACCACC AGGACCGACG GCTCCGGCGG TGTGGCCAGC GGCAACCAGA TCATCGTCCC CGTGGACGCC GCCATCAACG TCTGCGGCAA CTCGGTGGCC GTCCTGGGCG GCTCCAGCGC CAAGTGCACC ACCATCATCA ACATCATCCA GGCCTCCCCC GAGAACGAGG GCGCCCCCGA CGCCGCCACC AGCGGCGCGG GCGGGATCGG CAGCGGCAAC CAGGTCGTGG TCCCGGTGGA CGCGGCCGTC GACATCTGCG GCAACGCCGT GTCCGTGCTC GGCCTGGCCG AGGGCTCCTG CATGGAGATC ATCTCCGAGG AGGAGCGGCC GGAGGAGCCC GGCGAGGAGA AGCCCGAGGA GCCCGGCCAG CCCGAGGAGG AGAAGCCGGA GGAGGAGCAG CCCGAGGAGC CGGGTGAGGA GAAGCCCGAG GAGCCCCGCG AGGAGGACAA GGGCGAGGAC GACTCCAGCA CCGGCGAGGA GCCGACCGAG CCCCAGGCCG ACGAGCAGCT CCCCGTGACC GGTGGCGCCC TGGCCGGTCT GGTCGCCGCG GGCGTCGCCG CGGTCGGCGC GGGCGGTGCC GGGCTGTACT TCGCCCGCAA GCGCAAGGCC GCCGCCGTGA CCGGCGACGA CGAGTAG
|
Protein sequence | MRNTLRYSFA TVMTAGLVVA PVAAAFADTT TSGSGGIGSG NQVVVPVDVE AELCGNSIAI LGISSATCTQ VSEVLYEASG QGGASTDGSG GVASGNQIIV PVDAAIDACG NAAAVGGISQ AECVEVVEVL EEESADAPTT RTDGSGGVAS GNQIIVPVDA AINVCGNSVA VLGGSSAKCT TIINIIQASP ENEGAPDAAT SGAGGIGSGN QVVVPVDAAV DICGNAVSVL GLAEGSCMEI ISEEERPEEP GEEKPEEPGQ PEEEKPEEEQ PEEPGEEKPE EPREEDKGED DSSTGEEPTE PQADEQLPVT GGALAGLVAA GVAAVGAGGA GLYFARKRKA AAVTGDDE
|
| |