Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3018 |
Symbol | |
ID | 9246871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3602954 |
End bp | 3605800 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003680934 |
Protein GI | 297561960 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.244171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.490973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCACT CGATGGTCGA ACAACTGGTA GTCCGGGGAG CCCGGGAGCA CAACCTCAAG GACGTCTCGC TGGACCTGCC GCGCGACTCC ATGATCGTGT TCACCGGCCT GTCCGGTTCG GGCAAGTCGT CGCTGGCCTT CGACACGATC TTCGCCGAGG GCCAGCGGCG CTACGTGGAG TCGCTGTCGG CCTACGCCCG CCAGTTCCTG GGGCAGATGG ACAAGCCGGA CGTGGACTTC ATCGAGGGCC TGTCCCCGGC GGTGTCCATC GACCAGAAGT CCACCAGCCG CAACCCCCGC TCCACAGTCG GCACCATCAC CGAGGTCTAC GACTACCTGC GGCTGCTGTG GGCGCGCGTC GGCGTCCCGC ACTGTCCCGA GTGCCGCCGC GAGATCGCCC GCCAGACCCC GCAGCAGATC GTGGACCGCG TCCTGGAGAT GGAGGAGGGC ACCCGCTTCC AGGTGCTCGC CCCCGTCGTG CGCGGACGCA AGGGCGAGTA CGTCGAGCTG TTCAAGGACC TGCAGTCCAA GGGCTACACC CGCGCGGTGG TGGACGGGCA GGCCGTGCGC CTGGACGAGG CGCCCAAGCT GGGCCGCTAC GACAAGCACG ACATCGCCGT GGTCGTGGAC CGCCTCAGCG TCAAGCCCTC CTCCCGCGGG CGCCTGACCG ACTCGGTGGA GACGGCGCTC AAGCTGGCCG GGGGCACCAT CATCCTGGAC TTCGTGGACG TGGAGGCCGG GGACCCCGAC CGCGAGAAGG TCTTCTCCGA GCACCTGTAC TGCCCTTACG ACGACCTGTC CTTCGAACAG CTCGAACCGC GCTCCTTCTC CTTCAACGCC CCCTACGGCG CCTGCGCCGA GTGCTCGGGC CTGGGCACCC GCATGGAGGT CGACCCCGAA CTCCTGGTCC CCGACCCCGA GAAGACGCTG GCCGAGGGCG CCATCGGCCC CTGGTCCGGC GGGCCCAACA GCGGCTACTG GGAGCGCATC CTCAAGGCGG TGGGCGAGGC GATCGGCTTC GACCTGGACA CCCCCTGGGA GCGGCTGCCG CGCCGCGCGC GCAAGGCCCT GCTGGAGGGG CACGACACCC AGGTCCACGT CAGCTACCGC AACAGGTACG GGCGCAACCG CTCCTACTAC ACCGAGTTCG AGGGCGTCAT CCCCTGGGTC AAGCGCCGCC ACTCCGAGAC CGAGAGCGAC TACGGCCGCG AACGGCTGGA GGGGTACATG CGCACCGTGC CCTGCCCGAC CTGCGAGGGC ACCCGCCTCA AGCCGGTCGT GCTCGCGGTG ACCGTGGGCG GCAAGTCCAT CGCCGAGGTG GCCCAGATGC CGCTCAGCGA CAGCGCGGCC TTCCTGGCCG GGCTCACGCT CTCCGAGCGC GACGCGGTCA TCGCGGCCCA GGTGCTCAAG GAGATCAACG CCCGGCTCGG CTTCCTGCTC GACGTCGGCC TGGACTACCT GAGCCTGGCG CGCTCCTCGG GCTCGCTCTC GGGCGGGGAG GCCCAGCGCA TCCGCCTGGC CACCCAGATC GGCTCCGGCC TGGTGGGCGT GCTGTACGTG CTGGACGAGC CCTCCATCGG CCTGCACCAG CGCGACAACG CGCGCCTGCT GGAGACCCTC CAGCGCCTGC GCGACATCGG CAACACGCTC ATCGTCGTGG AGCACGACGA GGACACCATC CGCGCCGCCG ACTGGGTCGT GGACATCGGC CCCGGCGCGG GTGAGCACGG CGGCCACGTC GTGGTCTCGG GGATCGTGGA CGAGCTGCTC ACCTCCGAGG ACTCCCTCAC CGGCGAGTAC CTGTCCGGCA AGCGCGGCAT CGAGGTGCCC GTGGAGCGCC GCCCCCTCAC CCGGGGGCAC GAGCTGGTGG TCCGGGGAGC GCGCGAGAAC AACCTCCACG GGGTCGACGT CGCCTTCCCG CTGGGCGTGT TCACCGCCGT CACCGGCGTG TCCGGCTCGG GCAAGTCCAC CCTGGTCAAC GAGATCCTCT ACAAGGCGCT GGCCAAGGAG CTCAACGGGG CGCGCGACGT GCCCGGCCGC CACCTGCGGG TCAACGGCAT GAACAAGGTC GACAAGGTCG TGCACGTGGA CCAGAGCCCC ATCGGGCGGA CCCCGCGCTC CAACCCGGCC ACCTACTCGG GGGTCTTCGA CCACATCCGC AAGCTGTTCG CGCAGACCAC CGACGCCAAG ACGCGCGGCT ACCAGCCGGG CCGGTTCTCC TTCAACGTCA AGGGCGGCCG CTGCGAGGCC TGCTCCGGCG ACGGCACGCT CAAGATCGAG ATGCAGTTCC TGCCCGACGT CTACGTGCCC TGCGAGGTGT GCCACGGCGC CCGGTACAAC CGGGAGACCC TCCAGGTCCG CTACAAGGGC AAGAACATCT CCGAGGTCCT CAACATGCCG ATCTCGGAGG CCCTGGAGTT CTTCGAGCCG ATCAACGCCA TCCGCCGCCA CCTCCAGACC CTGGCCGACG TCGGCCTGGG CTACGTGCGG CTGGGCCAGC CCGCCACGAC GCTGTCGGGC GGTGAGGCGC AGCGGGTCAA GCTCGCCGCC GAACTCCAGC GCCGCTCCAC CGGGCGGACG GTGTACGTGC TCGACGAGCC CACGACCGGC CTGCACTTCG AGGACATCCG CAAGCTGCTG GGCGTGCTCA ACCGCCTGAC CGACACCGGC AACACGGTGA TCGTCATCGA GCACAACCTC GACGTCATCA AGACGGCCGA CCACGTCATC GACATGGGCC CCGAGGGCGG CTCCGGTGGC GGCACCGTGG TCGCGCAGGG AACCCCGGAG GAGGTCGCCG CGGTGGCCGA GTCCTACACC GGGCGTTTCC TGGCCAAGAT GCTCTGA
|
Protein sequence | MSHSMVEQLV VRGAREHNLK DVSLDLPRDS MIVFTGLSGS GKSSLAFDTI FAEGQRRYVE SLSAYARQFL GQMDKPDVDF IEGLSPAVSI DQKSTSRNPR STVGTITEVY DYLRLLWARV GVPHCPECRR EIARQTPQQI VDRVLEMEEG TRFQVLAPVV RGRKGEYVEL FKDLQSKGYT RAVVDGQAVR LDEAPKLGRY DKHDIAVVVD RLSVKPSSRG RLTDSVETAL KLAGGTIILD FVDVEAGDPD REKVFSEHLY CPYDDLSFEQ LEPRSFSFNA PYGACAECSG LGTRMEVDPE LLVPDPEKTL AEGAIGPWSG GPNSGYWERI LKAVGEAIGF DLDTPWERLP RRARKALLEG HDTQVHVSYR NRYGRNRSYY TEFEGVIPWV KRRHSETESD YGRERLEGYM RTVPCPTCEG TRLKPVVLAV TVGGKSIAEV AQMPLSDSAA FLAGLTLSER DAVIAAQVLK EINARLGFLL DVGLDYLSLA RSSGSLSGGE AQRIRLATQI GSGLVGVLYV LDEPSIGLHQ RDNARLLETL QRLRDIGNTL IVVEHDEDTI RAADWVVDIG PGAGEHGGHV VVSGIVDELL TSEDSLTGEY LSGKRGIEVP VERRPLTRGH ELVVRGAREN NLHGVDVAFP LGVFTAVTGV SGSGKSTLVN EILYKALAKE LNGARDVPGR HLRVNGMNKV DKVVHVDQSP IGRTPRSNPA TYSGVFDHIR KLFAQTTDAK TRGYQPGRFS FNVKGGRCEA CSGDGTLKIE MQFLPDVYVP CEVCHGARYN RETLQVRYKG KNISEVLNMP ISEALEFFEP INAIRRHLQT LADVGLGYVR LGQPATTLSG GEAQRVKLAA ELQRRSTGRT VYVLDEPTTG LHFEDIRKLL GVLNRLTDTG NTVIVIEHNL DVIKTADHVI DMGPEGGSGG GTVVAQGTPE EVAAVAESYT GRFLAKML
|
| |