Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0051 |
Symbol | |
ID | 9243878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 62377 |
End bp | 64017 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003678009 |
Protein GI | 297559035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00256371 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCCCA ACGCCGAAGG CCCGCACGCC ACCTCCCGCA CGCGACACGT CCTGGAAACG GTGCGGGTGC TCGTCCGCGC CGGTGTGCTC AGCCCGGGAC GTCCCGACAG GGTCGTCCGC CAGCTCCGCG CGCTGCGCAC GTGGGGCGCC ACGATCGCCG GGGGCTACGC CGCCGCGAAC GAGCGCGTGC CCGACGCCGT CGCGGTCGTC GACGAGCTGG GCCCCACGAC CTTCCGCGAG ATGGACCTGC GCGGCCGCGA GCTGGCCAGG GACCTGCGCC GCGCGGGTGT GGGCGCGGGT GCGCGCGTGG GCGTGCTGTG CCGCAACCAC AGCGGTTTCG TGCAGACGAT GGTGGCCTGC GGGCGCTTGG GCGCCGACAC CGTGCTGCTG AACACGGGGC TGTCGGCGGG CCAGCTGGCC GCGGCCGTGG GGGGCAACGG GGTGTCGGTG GTCGTCGCCG ACGCCGAGTT CGAGGGGATG TGCGCCGAGC TCGATCCCGG TGTCGTGCGG GTGACGGCCT GGGGGGAGCC CTCCGACGGG GGCCCGGGGG TGCCGTGGCG GGCGCGGGAG GTCTCGGAGT CCGATCCGGA GCCGCCGGAG CGCCCCGGCC GTCTGGTCGT GCTGACGTCG GGGACGACGG GCGCGCCCAA GGGGGCGCGG AGGCCCACTC CGAGGGGGCC GCAGGACGCG GCGTCGGTGC TGTCGCGCAT CCCGCTGCGG TCCTCGGACC GCATCCTGGT GTCGGCGCCG ATCTTCCACA CGTGGGGGCT GGCGGGTGTG CAGCTGGGCA TGTCGATGCG GGCGACGTTG GTGATGAGGC GGCTGTTCGA GCCGGAGGAC GCTCTGCGCA CGGTCCAGGA GGAGCGGTGC ACCGCGCTGT TCGCGGTGCC GGTGATGCTG CGGCGGATCA TGGACCTGCC CCGGTCCACG CGCGAGCGGT ACGACACGTC GTCGCTGCGG ATCGTGGCGT GCAGCGGCTC GGCGATGAGC CCGCAGCTGA TCACGGGTTT CATGGACGCG TTCGGGGACG TGCTCTACAA CCTGTACGGG TCGACCGAGG TGTCGTGGGC GACGATCGCG ACTCCCGGGG AGATGCGGCT GTCCCCGACG ACGGCGGGCC GTCCGCCCCT GGGGACGCGG ATCGCGGTCC TGGACCCCGG CGGGGTGGAG GTGCCTCCGG GGACGGAGGG GTCGATCCAC GTCGGCAACG ACATGCTCTT CGACGGCTAC ACGGACGGGC GGAGCAGGAG GGTCGCGGAC GGGTTGATGG AGACCGGCGA CCGGGGTCAC GTGGACGCGA ACGGACTGCT GCACGTGGCG GGCCGGGACG ACGACATGAT CGTCTCGGGC GGGGAGAACG TGTTCCCCCG GCCGGTCGAG GAGGCGATCG TGACGCTTCC CGAGGTGCGT GAGGTCGTGG TGACGGGTGT GCCCGACGAG GAGTTCGGGC AGCGGTTCGC GGCGTACGTG GTGCCGCACG AGGGCCGGGG CGTGGATCCG AATCAGGTGC GGACGCACGT GCGGCGGGTG CTCGGGCGGT TCTCGGTGCC GCGCGACGTG GTGGTCGTGG ACGAGCTGCC GCGCAACGCC ACGGGCAAGG TGGTCAAGCG CCTGCTGCCC CGTCGGGAGG AGCGCGGATA G
|
Protein sequence | MAPNAEGPHA TSRTRHVLET VRVLVRAGVL SPGRPDRVVR QLRALRTWGA TIAGGYAAAN ERVPDAVAVV DELGPTTFRE MDLRGRELAR DLRRAGVGAG ARVGVLCRNH SGFVQTMVAC GRLGADTVLL NTGLSAGQLA AAVGGNGVSV VVADAEFEGM CAELDPGVVR VTAWGEPSDG GPGVPWRARE VSESDPEPPE RPGRLVVLTS GTTGAPKGAR RPTPRGPQDA ASVLSRIPLR SSDRILVSAP IFHTWGLAGV QLGMSMRATL VMRRLFEPED ALRTVQEERC TALFAVPVML RRIMDLPRST RERYDTSSLR IVACSGSAMS PQLITGFMDA FGDVLYNLYG STEVSWATIA TPGEMRLSPT TAGRPPLGTR IAVLDPGGVE VPPGTEGSIH VGNDMLFDGY TDGRSRRVAD GLMETGDRGH VDANGLLHVA GRDDDMIVSG GENVFPRPVE EAIVTLPEVR EVVVTGVPDE EFGQRFAAYV VPHEGRGVDP NQVRTHVRRV LGRFSVPRDV VVVDELPRNA TGKVVKRLLP RREERG
|
| |