Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0755 |
Symbol | |
ID | 9244597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 925470 |
End bp | 926993 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | alpha-L-arabinofuranosidase domain protein |
Protein accession | YP_003678706 |
Protein GI | 297559732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.426486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCGAG CCTCCGTCAC CGTCGACCCG GCGGCCATCG TCTCCCCCGT GCACCGCCGC ACCTTCGGCT CGTTCGTCGA GCACATGGGC CGCTGCGTCT ACACCGGCAT CTACGAACCC GGGCACCCGA CGGCCGACGC CGACGGCTTC CGACGCGACG TCGCGGACCT GGTCCGGGAA CTGGGCGTCA CCACCGTGCG CTACCCGGGC GGCAACTTCG TGTCCGGGTA CCGGTGGGAG GACGGCGTCG GCCCCCGGGA CCGCCGACCG GTCCGCCGCG ACCTGGCCTG GCACAGCATC GAGACCAACC AGTTCGGCCT CGACGAGTTC ACCGCCTGGT GCCGCGGCCT GGACATCGAG CCGATGATGG CGGTCAACCT CGGCACCCGC GGCCTGGCCG AGGCCCTGGA CCTGCTGGAG TACTGCAACC ACCCCGGCGG CACCCACCTG TCCGACCAGC GCGCCGCCAA CGGGCACCCC GAACCGCACG GCATCCGCAT GTGGTGCCTG GGCAACGAGA TGGACGGCCC CTGGCAGATC GGCCACCTCG ACGCGCGCTC CTACGGACGC AAGGCCGGGC AGGTGGCGCG CGCCATGAGG ATGGCCGACC GCGACCTCGA ACTGGTCGTC TGCGGCAGCT CCGGATCGGC CATGCCCACC TTCGGCCAGT GGGAGGCCAC CGTCCTGGAG GAGACCTACG ACGCGGTGGA CCACATCTCG CTGCACGCGT ACTACGAGGA GCGCGACGGG GACCTGGCCG ACTTCCTCGG CTCGTCGACC GACATGGACC GCTTCATCGA CTCCGTCGTC TCCACCGCCG ACGCCGTCGG CGCGCGCCTG CGCGACCCCA AACGCATCCA GCTCTCCTTC GACGAGTGGA ACGTGTGGTA CCTGAGCCGC CACCAGGCCC GGGCCGCCGC GCAGCCCGCA GACGACTGGC AGGTGGCGCC CCGCGTCATC GAGGACCGCT ACAGCGTCGC CGACGCCGTC GTCGTGGGCA ACATGCTCAT CAGCCTGCTC CGGCACGGCG ACCGGGTCAC CGCCGCCAGC CAGGCCCAGC TCGTCAACGT CATCGCGCCG ATCATGACCG AGCCCGGCGG CCCCGCGTGG CGCCAGACCG TCTTCCACCC CTTCGCCCTG ACCGCTCGGG CCGCACGGGG CCGGGTGCTG CACACCGGCG TCACCGCGCC CCGGTACACG ACCGCCAGCC ACGGCGAGAT CCCGCTCCTG GACGCCGTCG TCACCTTCGA CGAGGAGGAG GGCACCGCGT CGCTGTTCGC GGTCAACCGC TCCACCGACC AGCACCTCGC CCTCGCCGCC GACCTGCGCG GCCTGGCCCC GACGGCCGTC ACGGACGCCC GGACCCTCAG CGACGAGGAC CCCTACGCCC ACAACACCAT GGACGCCCCC GACCGGGTCG TCCCGCGACC GGCCGGGGGC GTGACCCTGG ACGGCGGCAG GCTCTCCGCC GTCCTGCCCC CCGTGTCCTG GTCCGTCATC ACCCTCTCGA CCAGTCGGAA CTGA
|
Protein sequence | MLRASVTVDP AAIVSPVHRR TFGSFVEHMG RCVYTGIYEP GHPTADADGF RRDVADLVRE LGVTTVRYPG GNFVSGYRWE DGVGPRDRRP VRRDLAWHSI ETNQFGLDEF TAWCRGLDIE PMMAVNLGTR GLAEALDLLE YCNHPGGTHL SDQRAANGHP EPHGIRMWCL GNEMDGPWQI GHLDARSYGR KAGQVARAMR MADRDLELVV CGSSGSAMPT FGQWEATVLE ETYDAVDHIS LHAYYEERDG DLADFLGSST DMDRFIDSVV STADAVGARL RDPKRIQLSF DEWNVWYLSR HQARAAAQPA DDWQVAPRVI EDRYSVADAV VVGNMLISLL RHGDRVTAAS QAQLVNVIAP IMTEPGGPAW RQTVFHPFAL TARAARGRVL HTGVTAPRYT TASHGEIPLL DAVVTFDEEE GTASLFAVNR STDQHLALAA DLRGLAPTAV TDARTLSDED PYAHNTMDAP DRVVPRPAGG VTLDGGRLSA VLPPVSWSVI TLSTSRN
|
| |