Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2679 |
Symbol | |
ID | 9246530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3190003 |
End bp | 3193149 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Superfamily I DNA and RNA helicase |
Protein accession | YP_003680600 |
Protein GI | 297561626 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.357877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0252741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAGCA ACGCCCGCCA ACGGATCATC GAGCAGGAAC AGACCGCTGT GGATCGGGCC CACCGATGTC TGGAACGGCA GCGCGGCCAG ACCACACGGC TGGCCACCGC CGACGCCGCC GCCAGCGCCA AGGACAGCGT GGCCCAGCAC GAGGAGTACG CACGCTGGGT CGCGCAGTAC GAACTCGGCG GACAGCAGCT CGTCGTCCAA CGTGTGGATC TTCAGGAGGA GAGCGGCGAC GAGACCTTCT ACGTGGGCCG CAGAAGTGTC CGGGATGAGG ACGGGAACGT CTTCGTCGTC AAGTGGTCCA GCCCGGCGGC GGTCCGCTGG CGCCGTGAGC GGGGAACCGA GAAGGGCGCC GTGACGCTTC GCCGACGCCT GCGCTGCCAC GGCGAACGCG TGGTGGACTA CCACGACGAA CTCGTCCGCA AGGCCGACGA GCCTTCCTCG GAGCAGGCTT CCCCGGCCCA GGTGGCGGTC GCTATCCGGG CCCGGAAGGA GCAGGAGGCG GGCACGGCAC AGGACCCGTT CCTCCTCCGG GAACTGGACC GCTCCCGCGA CGGTCTCATG CGCGACATCG TCGAGACGAT CCACCGCGAC CAGCTCGACC TGGTCTGCCA CGACCGGCCC GGGGCGCTGG TGGTCCAGGG CGGTCCGGGG ACCGGCAAGA CCGCGATCGG CCTGCACCGT GTCACCTGGC TGCTGGACAA CGACCACTTC ACCCCGGGCC AGATCCTCGT CGTCGGCCCC AGCCAGTACT TCCTCGACTA CGTCAGCGAG GTGCTGCCCT CACTGGGCAC ACGCGGCGTG ACCTCACTGC GCGTCGACGA CCTGTGCCCG GGGCGAAGCG GAGGCCACGA CACCGCCGAG CAGCACCGGA TCAAGTCCGA TGCCCGGATG GCCGGGGTCC TGCGCCAGGC GGTCCGCCAG ACGGTACGCG CGGGGGCCTG GTCGAAGTTC CTCGACGGCG ATCAGCTCAG AATCCGGGTC GACGAGAGCC CCTTGGCCGT GGCCGAGGCC GACATCGAGC GGATCTTCGA CGAGGTGCTG GAGTCGGACG CACCCCTCAA CACCCGCCGC CAGCGCTTCA CCGACCTGCT CGTGGACCGC ATGCTCGACC AGGTCTCCTA CCGGCACCGC CGCCAGGGAC AGGCGCTGCG CCGTCGCATC ACGTCCTCCC TGGCCGGACC CGTCAACGCC ACCTGGCCCC GGATCTCCGA GAACCGCCTC TACCGCAGGC TGCTCGGCGA CGCCCAGGTC CTGGGACAGG CCTCCGAGGG CGTCCTCACC GCCGAGGAGC AGTCCGCCCT GTACCGGCTC ACGGCCGCCC AGACCGGGCA GGAGTCCTGG ACCAGCGCCG ACCTGCTCTG TCTGGAGGAA CTGCGGATCC TCCTCACCGG AGACACCCCC GACCGCTACC GGCACATCGT GGTGGACGAG GCACAGAACC TCACCCCCAT GCAGCTGCGT GCCCTGGCCC GCCGCTGCCC GAGCGGTTCC CTCACCATCC TGGGCGACCT CGCCCAGTCC ACGGGCACGC ACAGCCACAC CGACTGGGCG GCCCTGACCG ACCACCTGGA ACTGCCCGAC GGCTGGGAAC TACAGGAACT CACCCTCGGA TACCGCATCC CCAGCCAGGT GATGCACACG GCCGTCCCCG CCGCCGTGGC CGCCTCCGAA CTGACCACCT TCCCCGAAAC GCTCCGCGAG CCCCGGGACG GGGAGCTGAC CATGGCCCGT CTCGCCCCCG AGGACCTGAT CGATGGTGTC CGTGAGCGCG CGACCGAACT GCTGGCCAAG GGAGGCGAGC GCTCCGTGGC GGTCATCGCC GACGACGCCT CCCCGCACCT GAAGTCGATC ACGGACGCCT TGGCGACCGG CCCCGCACCC GCCGAGGGCA CCGTCCGAGC CCTGGCGGCC TCCGACGTCA GCGGCCTGGA GTTCGACCAC GTCATCCTGG TGGAGCCGCG CCAGATCTCC GACGCCGGCC CCGGAGGCCA CGGACGGCTC TACGTCGCTC TGACCCGGTG CACCCAGACC CTGACCGTCC TGCACACCGG ATCTCTGCCC GACACCCTCG TCGACCCGTT CGCCCCCGTG ACAGACCACG AGAGAACCTG CACCCGCCAC CACGCCGACG GTCAGCGGTG CCGCAACCGC ACGAGTTCCC CGGACGGCTG GTGCCGGCAG CCCGGATGCG GCGGTTACCG CACCAGGCGG GCCCGGCGCC CGGAGGGCCA GCAGACCGTG CTGGGTTCGC CCGCCGGGCT GGACACCGGA GCCCGGTTGG AGCCCTCCGT GCGGGCTGCC ACGATCACCG TCAGCGCCGC GGCGCGCGCC CGCTTCGCGG TCCGCCACAG GGCCACGGCC AGGGAGGCCG AGGTCGAGAT CCGGGCCATG CTCGGCGACT TCCTGACCGA GGGCCGACAG GCCCGGCGCA CGGACGGGTA CTGGCACCTG GAGCGCGACG GCTACCGGTT GGTCCTGGAC CGGTCCGCCG CGTCGGTGGT CGACTACCAG ACCGTGCACG CCGAACGCAG CTGGGCCCAG CACAGGGCGG GGATCGACTC CCGGATCTCC CAGCGCACAA GGCACAACGA CACCACCACA AGGCACCGTG ACACCGCGAC CGAGCGGATC GAGGAGCAGC CGATGAGCGC GCCCACACCA CCGGTTCCGC CGCAGCCCGG GCCCCAGGAC CCCACCGCGG CCGAGCACCT GCGCCTCTTC CTGGCCGGAA CCGCACAGCG GGAGGAGGCC CGGGACCAGA GCGTGTACGG CTTCCTGCGC CACAGCCTCA TCGCCGACCT GTACCGGGCG GGGAGCCGAC CGGACGACCA AGAGAACGGC GACGTACTCT GCCATCTCTC CGGCCTGTCC GTGCTGTACC GCGTTCTGCC CGAGGAGGAC ACGGGTTACG AAAGGCTGCG CCGCGAGGCC CTGGAGTTGC TGGAGGCGCG CTGGGCCCGG GGAGCGGAAG CCGACCGGGT GTGCCTCGTC CTCCCCGCTC CTCCTGAGGA GGACTGGTCG GCCCCGGCGC TGCTCGGCGC CCTGGGGGTG TCGGTGATCT GGCGGGAGGC GGACACGTGG CGCGGCGAGA ACGCGGCCCT GATCGCCGGG GAGCGCACGC ACGCCGACCG GGCCTGA
|
Protein sequence | MGSNARQRII EQEQTAVDRA HRCLERQRGQ TTRLATADAA ASAKDSVAQH EEYARWVAQY ELGGQQLVVQ RVDLQEESGD ETFYVGRRSV RDEDGNVFVV KWSSPAAVRW RRERGTEKGA VTLRRRLRCH GERVVDYHDE LVRKADEPSS EQASPAQVAV AIRARKEQEA GTAQDPFLLR ELDRSRDGLM RDIVETIHRD QLDLVCHDRP GALVVQGGPG TGKTAIGLHR VTWLLDNDHF TPGQILVVGP SQYFLDYVSE VLPSLGTRGV TSLRVDDLCP GRSGGHDTAE QHRIKSDARM AGVLRQAVRQ TVRAGAWSKF LDGDQLRIRV DESPLAVAEA DIERIFDEVL ESDAPLNTRR QRFTDLLVDR MLDQVSYRHR RQGQALRRRI TSSLAGPVNA TWPRISENRL YRRLLGDAQV LGQASEGVLT AEEQSALYRL TAAQTGQESW TSADLLCLEE LRILLTGDTP DRYRHIVVDE AQNLTPMQLR ALARRCPSGS LTILGDLAQS TGTHSHTDWA ALTDHLELPD GWELQELTLG YRIPSQVMHT AVPAAVAASE LTTFPETLRE PRDGELTMAR LAPEDLIDGV RERATELLAK GGERSVAVIA DDASPHLKSI TDALATGPAP AEGTVRALAA SDVSGLEFDH VILVEPRQIS DAGPGGHGRL YVALTRCTQT LTVLHTGSLP DTLVDPFAPV TDHERTCTRH HADGQRCRNR TSSPDGWCRQ PGCGGYRTRR ARRPEGQQTV LGSPAGLDTG ARLEPSVRAA TITVSAAARA RFAVRHRATA REAEVEIRAM LGDFLTEGRQ ARRTDGYWHL ERDGYRLVLD RSAASVVDYQ TVHAERSWAQ HRAGIDSRIS QRTRHNDTTT RHRDTATERI EEQPMSAPTP PVPPQPGPQD PTAAEHLRLF LAGTAQREEA RDQSVYGFLR HSLIADLYRA GSRPDDQENG DVLCHLSGLS VLYRVLPEED TGYERLRREA LELLEARWAR GAEADRVCLV LPAPPEEDWS APALLGALGV SVIWREADTW RGENAALIAG ERTHADRA
|
| |