Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0429 |
Symbol | |
ID | 9244268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 519424 |
End bp | 521142 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003678382 |
Protein GI | 297559408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.312463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGAG AGACGGCAGC CGCGGGCGCG GCCGAGTTCC GCGCCGCCCG GGACCTGCTC CTGGACCTGC GCGAGGACCA GGGGAGGGCG CACCGGGAGT TCACCTGGCC CCGGTCGGAC CGGTTCAACT GGGCGCTGGA CTACTTCGAC GAGGTGGCCG GGCGCCGTCC GGACCGGACG GCGCTGTGGA TCGTGGAGGA GGACGGCGGC CAGGCCAGGT ACACCTACCG GCGGATGTCG GAGCGCTCCG CCCAGGTGGC CAACTGGCTG AACAACCAGG GCGTGCACCC GGGCGACCGG ATCCTGCTGA TGCTGGGCAA CCAGGTGGAG CTGTGGGAGA CCCTGCTCGC CGCGACCAAG CTCGGCGCCG TGGTCTGCCC CACCCCCACC TCCCTGTCCG AGGGCGACCT GCTGGACCGG CTGGAGCGCG GCGAGATCGC GCACGTGGTG TGCTCGGCCG CCGAGACCGA GAAGTTCGCC CCGCTGCGGG GGCACTGGAC GCGGATCTGC ACGGGCTACA TGGAGGGGTG GCTGAACTAC GCCGACTCCG AGCACGCCGG TCTGGACTTC CACCCCCCGC ACACCACCCA CCCCGACGAC CCCCTGCTGC TGTACTTCAC CTCCGGCACC GCCACCCTGC CCAAGCTGGT CGTGCACACC CAGCGCTCGT ACCCGGTGGG GCACCTGTCG ACCATGTACT GGCTGGGTGT GCGGCCGGGC GACGTCCACC TCAACGTGTC CGAGCCGGGG TGGGCCAAGC ACGCCTACGG CAGCGTGTTC GCGCCGTGGA ACGCCGAGGC GACGGTGCTG GTCGTCAACC AGGAGCGCTT CGACGCGGCG GGGCTGCTGG ACGCGATCGT GCGCTGCGGG GTGGACACGC TGTGCGCGCC GCCGACGGTG TGGCGGACTC TGGCGCAGGC CGACCCCGCC GCGTGGGACG TGGGGCTGCG CGAGGCGGTG GCCGCCGGGG AGCCGCTCAA CCCCGAGGTG GTGGACCGCG TGCGCGAGGC ATGGGGCGTG ACCGTGCGCG ACGGGTTCGG GCAGACCGAG ACGACCGTAC TGCTGGGCAA CGGCCCCGGT CAGCGCGTGG TGCGCGGGTC GATGGGGCGG GAGATGCCGG GCTACGACGT GGTGCTGACG GACCCGGCCA CCGACGAGCC CGCCGACACC GGGCAGATCT GCGTGGACCT GGCCCGGGAG CCGGTGGGGG TGATGAAAGG CTACGCCGAC AACCCCGGCC TCAGCCGCGA GGTGGTCCGC GGGGACCGCT ACCGCACCGG TGACATCGCC AGCCGGGATT CGAACGGTTA CATTACCTAT ATCGGTCGAT CCGACGATGT GTTCAAGGCC TCAGATTACC GCATCTCACC ATTCGAACTC GAAAGCGTTC TGGTCGAGCA CGAATACGTG GTCGAGGCGG CCGTCGTTCC CTCCCCCGAC CCGCTGCGGC TGGCGGTGGC CAAGGCGTAC GTGGCCCTGG CCGAGGGGGT GGCCCCCGAC GCCGAGACCG CGCGGTCGAT CCTGGCCCAC GCGCGCGAGC GCCTGTCACC GCACCAGAGG GTTCGCCGCC TGGAGTTCGG TGAGCTGCCC AAGACCGTCT CCGGTAAGAT CCGTCGCGTG CAGCTGCGTC GGGCCGAGGC CGAGCGCGGC ACGGTCGCCG ACGGCGCGCG CAACCCGCGC GAGTACTGGG AGGAGGACCT CCCGGGGTTG GAGCGGTGA
|
Protein sequence | MSRETAAAGA AEFRAARDLL LDLREDQGRA HREFTWPRSD RFNWALDYFD EVAGRRPDRT ALWIVEEDGG QARYTYRRMS ERSAQVANWL NNQGVHPGDR ILLMLGNQVE LWETLLAATK LGAVVCPTPT SLSEGDLLDR LERGEIAHVV CSAAETEKFA PLRGHWTRIC TGYMEGWLNY ADSEHAGLDF HPPHTTHPDD PLLLYFTSGT ATLPKLVVHT QRSYPVGHLS TMYWLGVRPG DVHLNVSEPG WAKHAYGSVF APWNAEATVL VVNQERFDAA GLLDAIVRCG VDTLCAPPTV WRTLAQADPA AWDVGLREAV AAGEPLNPEV VDRVREAWGV TVRDGFGQTE TTVLLGNGPG QRVVRGSMGR EMPGYDVVLT DPATDEPADT GQICVDLARE PVGVMKGYAD NPGLSREVVR GDRYRTGDIA SRDSNGYITY IGRSDDVFKA SDYRISPFEL ESVLVEHEYV VEAAVVPSPD PLRLAVAKAY VALAEGVAPD AETARSILAH ARERLSPHQR VRRLEFGELP KTVSGKIRRV QLRRAEAERG TVADGARNPR EYWEEDLPGL ER
|
| |