Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1745 |
Symbol | |
ID | 9245595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2123254 |
End bp | 2125131 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | amino acid adenylation domain protein |
Protein accession | YP_003679679 |
Protein GI | 297560705 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.825969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCA ACAGTGCGGT GAGCGGCCTG GAGGGGAAGA CCTTCCTCGC CGGCCTTCTC GAACGCCAGG CCAGGAACCA GCCGGACGCG ACCGGCCTCG TGTATGCGGG CCGCGCCCAC ACCTACGCCG ACCTCAACGC GAGGGCCAAC CGGCTCGCGC GTGCGCTCAT CGACGCCGGG GTGGGGCCGG AGACGCGCGT GGCGGTGTCG ATGCGGCGCT GCCCGGGTGC GATCGTCGCC CTCTTCGCCG TGCTCAAGGC GTGCGGCGTC TACCTTCCGA TCGACGCCAC CCACCCCCGA GAGCGCATCG GATACGTCCT GGCCGACAGC GCCCCGAAGG TCGCCGTCAC CGATGACCGG GGCGCCGACG CCCTCGGCCG CCACGGTGTG CCGATGACGT TCCTGCGCCT GGGAGAGGAC GGCGACACCG GCGGACACCC CGCCGACCAC GACGTCCGCG ACGAGGAGCG CCGGTCACCG CTGCGCCCCG ACAACCTCGC GTACGTCATG TACACCTCCG GCTCCACGGG CAGACCCAAG GGCGTGCAGA TCTCCCAGTC CAGCCTGGGC CTGTACAGCC GCCACTACGG CCGGTTCTTC GACGAGGTGG ACGCCGGACG GCGCCTGCGC ATCGCCCACA CCGCCGCGCT GACCTTCGAC GTCGGCTGGA ACTCCGTCAT CGGCTTGGCC GCGGGCCACG AGATGCACCT CTACGCGGAG GAGGACTACC GGGACGTCGA TCGCTTCGTG CGGATCATGA GCCGACACCG ACTCGACTGC GTCGTGTTCA CCGCGTCCTA CTGGGGGGCG CTGGTGCAGT CCGCGGAGTG GGGCAGGGGG GAGCACACGC CGCGGGTGCT GCTCTCCTGC GGGGAGGCGT TCCCGAACGC CCTCTGGCAG CGGCTCCGGG GGATCGAGGG AACCCGCGTG ATGAACACCT ACGGGCCGAC CGAGGCCACC GTGGAGGCGG TGGCCACCGA CACCGACGCC ACCCCCCGCC CCACGCTGGG CACACCGATC CCGGACACGG GCATCCACGT CCTGGACGAC GCGCTCGCGC CCGCGCCGAC CGGGTCGCCC GGCGAGCTCT ACATCACCGG CGCCCGCCTC GCCCGCGGGT ACCTCAACCG ACCCGGGCTG ACCGCCGAGC GGTTCGTCGC CTCGCCGTTC GCCCCGGGAG AGCGCATGTA CCGGACGGGT GACGTCGTCC GGCGGAACGA CGTCGGCGAT CTGGAGTTCC TCGGGCGCGT CGACGACCAG GTGAAGATCC GCGGCTTCCG TGTCGAGCCG GGGGAGGTCG AGGCGGTGCT CGCCTCCCAC CCCGCCGTCT CCCGGGCCGC GGTCGTGGTG CGGGAGGACC GGGACGGGGC GCGCGGTCTC GTCGGCTACT TCGTCGTGGA CGGTGGCGGC GTGGACGAGG CCGAACTGCG CCGGCACCTG GGCAGGGCAC TGCCGGACTA CATGGTGCCC TCCGCCCTGT TGAGGGTGGA CGAGATGCCG CTCAACGCCA ACGGCAAGCT CGACAGGGGA GCGCTCCCGG AGCCGACGAG GAACGCGGAG AGCGCGCCGG ACCGGGCGGA CACGGTCGAG GAGGTCCTCC TCCACATCCT TCGCGAGGTG CTGGAGGAAC CCGGGCTCGG GCCCGGGGAC CTCTTCACCG AACGCGGAGG CGACAGCATC CGGGCGTTCC GCGTGGTCAC CCGGGCCCGG GACTCCGGAG TCGTGGTCTC CACGACCGAC GTCCTCAGGC ACCAGTCCGC GACGGCGATC GCAGGGGCGG CCACCGTGGA CGCCGGGGCC GGGTCGGACG GGGCCGGGCC GACCACGCGT GTCCCGGACC GCGAGGTCAG CGAACTCCAG AAGGAACTCG GCCTGTGA
|
Protein sequence | MATNSAVSGL EGKTFLAGLL ERQARNQPDA TGLVYAGRAH TYADLNARAN RLARALIDAG VGPETRVAVS MRRCPGAIVA LFAVLKACGV YLPIDATHPR ERIGYVLADS APKVAVTDDR GADALGRHGV PMTFLRLGED GDTGGHPADH DVRDEERRSP LRPDNLAYVM YTSGSTGRPK GVQISQSSLG LYSRHYGRFF DEVDAGRRLR IAHTAALTFD VGWNSVIGLA AGHEMHLYAE EDYRDVDRFV RIMSRHRLDC VVFTASYWGA LVQSAEWGRG EHTPRVLLSC GEAFPNALWQ RLRGIEGTRV MNTYGPTEAT VEAVATDTDA TPRPTLGTPI PDTGIHVLDD ALAPAPTGSP GELYITGARL ARGYLNRPGL TAERFVASPF APGERMYRTG DVVRRNDVGD LEFLGRVDDQ VKIRGFRVEP GEVEAVLASH PAVSRAAVVV REDRDGARGL VGYFVVDGGG VDEAELRRHL GRALPDYMVP SALLRVDEMP LNANGKLDRG ALPEPTRNAE SAPDRADTVE EVLLHILREV LEEPGLGPGD LFTERGGDSI RAFRVVTRAR DSGVVVSTTD VLRHQSATAI AGAATVDAGA GSDGAGPTTR VPDREVSELQ KELGL
|
| |