Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1751 |
Symbol | |
ID | 9245601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2138402 |
End bp | 2140018 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003679685 |
Protein GI | 297560711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.748681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0123673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGAGG GATGCGTCCC CTGGCCGGAG GAGTTCGCCC GACGGTACCG GCGGGAGGGG TACTGGCTCG GCCGGTCCCT GGCCGAACTC TTCGACACCT GGTGCTCCGC GCACCCGGAG CGCACCGCGC TGGTCTGCGG CTCCCGGCGC TGGACCTACC GGGAACTGCA CGAGCGGGTC GCCCGCACGG CCGGGGGCCT GCGGCAGCGC GGGCTGGGCG GCGGCGACCG GGTCCTGGTG CAGCTGCCCA ACACCGCCGA GTTCGTGACC GCCCTGTGCG CGCTGCTGAG GATCGGCGCC ATCCCCGTCC TGGCGCTCAC CTCCCACCGC CGCGCCGAGC TGCTGGAGCT GTGCCGGGTC TCGGAGGCGG TCGCCCACCT CGTCCCGGAC CGGCACCGCG GCCACGACCA CCGGGAGGAG GCCGCGCGGG TACGGGCGGA CGCCGGGGGC GGCCTGGACG TCATCGTCGA CGGGGACCCG GGCGCCTTCA CCCGCCTGGC CGACGTGACC GGCCCGCCGG CTCCGGCGGC CGCGACCGAC CCGGGGGAGG TCGCCCTCTT CCTGCTCTCG GGCGGCACCA CCGGCCGGTC CAAGCTCATC CCGCGGACCC ACGACGACTA CGCCTACAAC GTCCGCATCA CCTCCGACAA CGCCGGACTC ACCCCCGACG ACGTCTACCT GTGCGTCCTG CCCGCCTCGC ACAACTACGC CCTCGGCTGC CCCGGGGTGC TCGGCGCCCT CTCCCGCGGC GCCACGGTCG TGCTCAGCGA CAGCGCGGAC GCCGAGGACG CCTTCGCCCT GGTCGAGGAG GAGGGCGTGA CCGTCACCGC CCTGGTGCCC TCCCTGGCCG CGCTGTGGAC GGAGGCCGCG GACCTCACCC ACCGCGACCT CTCCACCCTG CGGCTGGTCC AGGTCGGCGG CAGCAGGGCC TCGGCCGACG ACGTCGCCGA GACCGAACTC GCCCTGGGGT GCCGCGTCCA GCAGTCCTTC GGCATGGGCG AGGGCATCCT CAGCCAGTCC GGTCCCGACG ACTCCTTCGG CATGCGCACC ACCACGCAGG GCCGCCCCCT GTCACCGGCC GACGAGGTGC GCGTGGTGGA CGAGGACGAC CGCCCCCTGC CCGCGTGCAC CACCGGACGC CTCCAGGTCC GCGGCCCCTA CACGATCCGG GGCTACTACC GGGCCGAGGA GGCCAACGCC GCCGCCTTCA CCCCGGACGG CTTCCTGCGC ACCGGAGACC TGGCCCGCCT GACCAGCTAC GGCCACCTGG TCGTGGAGGG CAGGACCAAG GACGTCATCA ACCGGGCGGG CGACAAGATC GCCGCGGCCG AGGTCGAGGA CGCCCTCACG GCCCTTCCGT CCGTCCGCTC CTGCGCGGTC GTGTCCGTGC CCGACCCCCT CCTGGGCGAG GCCTCCTGCG CCTTCCTCGT CTGCTCGGGG CCGCCGCCCT CCTCCGAGGA GGTCTCCCGG CACCTGCGCT CGCTGGGGCT GGCCGCCTTC AAGGTCCCCG ACCGGATCGA GAACGTGGCG GAGCTCCCGC TGACCAGGGT CGGCAAGGTG GACAAGGAAT GGCTGCGCCG CGGCCTGGAG GAAGCCGCGG ACCGGGGGAC CCGGTGA
|
Protein sequence | MLEGCVPWPE EFARRYRREG YWLGRSLAEL FDTWCSAHPE RTALVCGSRR WTYRELHERV ARTAGGLRQR GLGGGDRVLV QLPNTAEFVT ALCALLRIGA IPVLALTSHR RAELLELCRV SEAVAHLVPD RHRGHDHREE AARVRADAGG GLDVIVDGDP GAFTRLADVT GPPAPAAATD PGEVALFLLS GGTTGRSKLI PRTHDDYAYN VRITSDNAGL TPDDVYLCVL PASHNYALGC PGVLGALSRG ATVVLSDSAD AEDAFALVEE EGVTVTALVP SLAALWTEAA DLTHRDLSTL RLVQVGGSRA SADDVAETEL ALGCRVQQSF GMGEGILSQS GPDDSFGMRT TTQGRPLSPA DEVRVVDEDD RPLPACTTGR LQVRGPYTIR GYYRAEEANA AAFTPDGFLR TGDLARLTSY GHLVVEGRTK DVINRAGDKI AAAEVEDALT ALPSVRSCAV VSVPDPLLGE ASCAFLVCSG PPPSSEEVSR HLRSLGLAAF KVPDRIENVA ELPLTRVGKV DKEWLRRGLE EAADRGTR
|
| |