Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1245 |
Symbol | |
ID | 9245095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1546613 |
End bp | 1548259 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003679190 |
Protein GI | 297560216 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.267383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0561993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA CCGGAACCGA GGCCGCCGCC TTCGAGACCC CCGGCCCGGA CGCACTCGCC GAGGGGGAGC GCCTCTACCG CCGACTGCGC TCCGAGCAGC GCTGGGAGGT GCCCGAGCGG TACAACATGG CCGTCGACGT CGCCGACCAC CACCCCGGGG ACCGGGTCGC CCTCGTCTTC GAGGACCACA CCGGGCACCG CGACGAGGTC ACCTGGGACC GCGTCCGCGA CCGGGCGGCC CGCGTCGCCG CCCACCTGCA CGCCCTGGGC GTGCGCGAGG GCGACCGCGT CGCGGTCCTG CTGCCCCAGC GCCCCGACAC CCCCGCCACC TACCTGGGCG TGCTGCGCAC CGGTGCCGTC CTGGTCACCA TGTCCCTGCT GTGGGCCGAC GACCCCATCG CCTACCGGCT CGCCGACTCC GGCAGCACCG TCCTGATCAC CGAGGACGCC GCCCTGGAAC GCGCCGCGGG AGCCGTCCGC GCCGCCCGCG CCTCCCACGG GCTCGACGCC GACGTCCGCG TCGTGAGCGT GGACGACCCG GCCGTGGCCG CCCGCGAGCC CCGCCACGAG GCCGCCGACA CCGCCGCCGA CGACCCCGCG CTCATCTTCT ACACCTCGGG GACCACCGGC GCGGCCAAGG GCATCGTGCA CGCCCACCGC ACCCTGCTGG GCCACAACGA GTTCCGCTAC TGCCACGACC TGCGCCCCGG CGACGTGTTC TACGGGGCGG GCGACTGGGC CTGGTCCATG GCCAAGCTCA TGGGCCCGCT GCGAGCCGGG GCCACCCACC TGGTCCACCG CCCCCGCGCG GGCTTCGATC CCGAGGGGCT GCTCGCCGCG ATGGCGCGCA ACCGGGTCAC CACCGCCCTG GTCAACCCCA CCCTGCTGCG CAAGCTCCGC GCCGGGGTGC CCGACGCCGG ACGCCGCCAC CCCCAGTCCC TGCGCGTGGT GTGCTGCTCC AACGAGCCGC TGACCACCGA CCTCATCACC TGGTTCCGGG AGCAGTTCGG CGTCACCCCG TTCGACTACT ACGGGTCCAC CGAGTCCTAC CCCCTGCTCG GGAACATGCC CGGAATCCCG GTCAAGCCCG GCTCGGTGGG CCGCCCGCTG CCCGGCTGGG ACGTCGTCCT GCTGGACGCA CAGGACCGCG AGGTCCCCGA CGGGGAACCC GGCGAGATCT GCCTGCGCGC CCGCAGCAAC CCGCAGTTCC CGCTGGGCTA CTGGAACCGG CCCGAGGCCT CCGCCGAGAC CTTCGGCGGC ACCTGGTACC GCACCAAGGA CCAGGCCACC CGCGACGCCG ACGGCTACTT CTGGTTCCTG GGCCGCACCG ACGACGTCAT CAAGACCTCC GGCTACCGGG TCGGACCCTA CGAGGTCGAG GCGGCCCTGC GCGAGCACCC GGCCGTCGCC GACGCCGGGG TCGTGGGCCT GCCCGACCCG CTGCGCGGAC AGGTCGTCAA GGCATGGATC GAACTGGCCG ACGGGTACGA GCCGAGCGAG GAGCTCGCCG ACCGGATCCG CGCCTTCGCC CGCGAGCACC ACTCCCTCTT CGCCTACCCG CGCCTGATCG CCTTCGAGGA GCGCCTGCCC CGCTCCGCCA CCGGCAAGGT CCAACGCGCC GAGCTGCGCC GACGCGACAC GGCCTGA
|
Protein sequence | MTTTGTEAAA FETPGPDALA EGERLYRRLR SEQRWEVPER YNMAVDVADH HPGDRVALVF EDHTGHRDEV TWDRVRDRAA RVAAHLHALG VREGDRVAVL LPQRPDTPAT YLGVLRTGAV LVTMSLLWAD DPIAYRLADS GSTVLITEDA ALERAAGAVR AARASHGLDA DVRVVSVDDP AVAAREPRHE AADTAADDPA LIFYTSGTTG AAKGIVHAHR TLLGHNEFRY CHDLRPGDVF YGAGDWAWSM AKLMGPLRAG ATHLVHRPRA GFDPEGLLAA MARNRVTTAL VNPTLLRKLR AGVPDAGRRH PQSLRVVCCS NEPLTTDLIT WFREQFGVTP FDYYGSTESY PLLGNMPGIP VKPGSVGRPL PGWDVVLLDA QDREVPDGEP GEICLRARSN PQFPLGYWNR PEASAETFGG TWYRTKDQAT RDADGYFWFL GRTDDVIKTS GYRVGPYEVE AALREHPAVA DAGVVGLPDP LRGQVVKAWI ELADGYEPSE ELADRIRAFA REHHSLFAYP RLIAFEERLP RSATGKVQRA ELRRRDTA
|
| |