Gene Ndas_2679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2679 
Symbol 
ID9246530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3190003 
End bp3193149 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSuperfamily I DNA and RNA helicase 
Protein accessionYP_003680600 
Protein GI297561626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0252741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAGCA ACGCCCGCCA ACGGATCATC GAGCAGGAAC AGACCGCTGT GGATCGGGCC 
CACCGATGTC TGGAACGGCA GCGCGGCCAG ACCACACGGC TGGCCACCGC CGACGCCGCC
GCCAGCGCCA AGGACAGCGT GGCCCAGCAC GAGGAGTACG CACGCTGGGT CGCGCAGTAC
GAACTCGGCG GACAGCAGCT CGTCGTCCAA CGTGTGGATC TTCAGGAGGA GAGCGGCGAC
GAGACCTTCT ACGTGGGCCG CAGAAGTGTC CGGGATGAGG ACGGGAACGT CTTCGTCGTC
AAGTGGTCCA GCCCGGCGGC GGTCCGCTGG CGCCGTGAGC GGGGAACCGA GAAGGGCGCC
GTGACGCTTC GCCGACGCCT GCGCTGCCAC GGCGAACGCG TGGTGGACTA CCACGACGAA
CTCGTCCGCA AGGCCGACGA GCCTTCCTCG GAGCAGGCTT CCCCGGCCCA GGTGGCGGTC
GCTATCCGGG CCCGGAAGGA GCAGGAGGCG GGCACGGCAC AGGACCCGTT CCTCCTCCGG
GAACTGGACC GCTCCCGCGA CGGTCTCATG CGCGACATCG TCGAGACGAT CCACCGCGAC
CAGCTCGACC TGGTCTGCCA CGACCGGCCC GGGGCGCTGG TGGTCCAGGG CGGTCCGGGG
ACCGGCAAGA CCGCGATCGG CCTGCACCGT GTCACCTGGC TGCTGGACAA CGACCACTTC
ACCCCGGGCC AGATCCTCGT CGTCGGCCCC AGCCAGTACT TCCTCGACTA CGTCAGCGAG
GTGCTGCCCT CACTGGGCAC ACGCGGCGTG ACCTCACTGC GCGTCGACGA CCTGTGCCCG
GGGCGAAGCG GAGGCCACGA CACCGCCGAG CAGCACCGGA TCAAGTCCGA TGCCCGGATG
GCCGGGGTCC TGCGCCAGGC GGTCCGCCAG ACGGTACGCG CGGGGGCCTG GTCGAAGTTC
CTCGACGGCG ATCAGCTCAG AATCCGGGTC GACGAGAGCC CCTTGGCCGT GGCCGAGGCC
GACATCGAGC GGATCTTCGA CGAGGTGCTG GAGTCGGACG CACCCCTCAA CACCCGCCGC
CAGCGCTTCA CCGACCTGCT CGTGGACCGC ATGCTCGACC AGGTCTCCTA CCGGCACCGC
CGCCAGGGAC AGGCGCTGCG CCGTCGCATC ACGTCCTCCC TGGCCGGACC CGTCAACGCC
ACCTGGCCCC GGATCTCCGA GAACCGCCTC TACCGCAGGC TGCTCGGCGA CGCCCAGGTC
CTGGGACAGG CCTCCGAGGG CGTCCTCACC GCCGAGGAGC AGTCCGCCCT GTACCGGCTC
ACGGCCGCCC AGACCGGGCA GGAGTCCTGG ACCAGCGCCG ACCTGCTCTG TCTGGAGGAA
CTGCGGATCC TCCTCACCGG AGACACCCCC GACCGCTACC GGCACATCGT GGTGGACGAG
GCACAGAACC TCACCCCCAT GCAGCTGCGT GCCCTGGCCC GCCGCTGCCC GAGCGGTTCC
CTCACCATCC TGGGCGACCT CGCCCAGTCC ACGGGCACGC ACAGCCACAC CGACTGGGCG
GCCCTGACCG ACCACCTGGA ACTGCCCGAC GGCTGGGAAC TACAGGAACT CACCCTCGGA
TACCGCATCC CCAGCCAGGT GATGCACACG GCCGTCCCCG CCGCCGTGGC CGCCTCCGAA
CTGACCACCT TCCCCGAAAC GCTCCGCGAG CCCCGGGACG GGGAGCTGAC CATGGCCCGT
CTCGCCCCCG AGGACCTGAT CGATGGTGTC CGTGAGCGCG CGACCGAACT GCTGGCCAAG
GGAGGCGAGC GCTCCGTGGC GGTCATCGCC GACGACGCCT CCCCGCACCT GAAGTCGATC
ACGGACGCCT TGGCGACCGG CCCCGCACCC GCCGAGGGCA CCGTCCGAGC CCTGGCGGCC
TCCGACGTCA GCGGCCTGGA GTTCGACCAC GTCATCCTGG TGGAGCCGCG CCAGATCTCC
GACGCCGGCC CCGGAGGCCA CGGACGGCTC TACGTCGCTC TGACCCGGTG CACCCAGACC
CTGACCGTCC TGCACACCGG ATCTCTGCCC GACACCCTCG TCGACCCGTT CGCCCCCGTG
ACAGACCACG AGAGAACCTG CACCCGCCAC CACGCCGACG GTCAGCGGTG CCGCAACCGC
ACGAGTTCCC CGGACGGCTG GTGCCGGCAG CCCGGATGCG GCGGTTACCG CACCAGGCGG
GCCCGGCGCC CGGAGGGCCA GCAGACCGTG CTGGGTTCGC CCGCCGGGCT GGACACCGGA
GCCCGGTTGG AGCCCTCCGT GCGGGCTGCC ACGATCACCG TCAGCGCCGC GGCGCGCGCC
CGCTTCGCGG TCCGCCACAG GGCCACGGCC AGGGAGGCCG AGGTCGAGAT CCGGGCCATG
CTCGGCGACT TCCTGACCGA GGGCCGACAG GCCCGGCGCA CGGACGGGTA CTGGCACCTG
GAGCGCGACG GCTACCGGTT GGTCCTGGAC CGGTCCGCCG CGTCGGTGGT CGACTACCAG
ACCGTGCACG CCGAACGCAG CTGGGCCCAG CACAGGGCGG GGATCGACTC CCGGATCTCC
CAGCGCACAA GGCACAACGA CACCACCACA AGGCACCGTG ACACCGCGAC CGAGCGGATC
GAGGAGCAGC CGATGAGCGC GCCCACACCA CCGGTTCCGC CGCAGCCCGG GCCCCAGGAC
CCCACCGCGG CCGAGCACCT GCGCCTCTTC CTGGCCGGAA CCGCACAGCG GGAGGAGGCC
CGGGACCAGA GCGTGTACGG CTTCCTGCGC CACAGCCTCA TCGCCGACCT GTACCGGGCG
GGGAGCCGAC CGGACGACCA AGAGAACGGC GACGTACTCT GCCATCTCTC CGGCCTGTCC
GTGCTGTACC GCGTTCTGCC CGAGGAGGAC ACGGGTTACG AAAGGCTGCG CCGCGAGGCC
CTGGAGTTGC TGGAGGCGCG CTGGGCCCGG GGAGCGGAAG CCGACCGGGT GTGCCTCGTC
CTCCCCGCTC CTCCTGAGGA GGACTGGTCG GCCCCGGCGC TGCTCGGCGC CCTGGGGGTG
TCGGTGATCT GGCGGGAGGC GGACACGTGG CGCGGCGAGA ACGCGGCCCT GATCGCCGGG
GAGCGCACGC ACGCCGACCG GGCCTGA
 
Protein sequence
MGSNARQRII EQEQTAVDRA HRCLERQRGQ TTRLATADAA ASAKDSVAQH EEYARWVAQY 
ELGGQQLVVQ RVDLQEESGD ETFYVGRRSV RDEDGNVFVV KWSSPAAVRW RRERGTEKGA
VTLRRRLRCH GERVVDYHDE LVRKADEPSS EQASPAQVAV AIRARKEQEA GTAQDPFLLR
ELDRSRDGLM RDIVETIHRD QLDLVCHDRP GALVVQGGPG TGKTAIGLHR VTWLLDNDHF
TPGQILVVGP SQYFLDYVSE VLPSLGTRGV TSLRVDDLCP GRSGGHDTAE QHRIKSDARM
AGVLRQAVRQ TVRAGAWSKF LDGDQLRIRV DESPLAVAEA DIERIFDEVL ESDAPLNTRR
QRFTDLLVDR MLDQVSYRHR RQGQALRRRI TSSLAGPVNA TWPRISENRL YRRLLGDAQV
LGQASEGVLT AEEQSALYRL TAAQTGQESW TSADLLCLEE LRILLTGDTP DRYRHIVVDE
AQNLTPMQLR ALARRCPSGS LTILGDLAQS TGTHSHTDWA ALTDHLELPD GWELQELTLG
YRIPSQVMHT AVPAAVAASE LTTFPETLRE PRDGELTMAR LAPEDLIDGV RERATELLAK
GGERSVAVIA DDASPHLKSI TDALATGPAP AEGTVRALAA SDVSGLEFDH VILVEPRQIS
DAGPGGHGRL YVALTRCTQT LTVLHTGSLP DTLVDPFAPV TDHERTCTRH HADGQRCRNR
TSSPDGWCRQ PGCGGYRTRR ARRPEGQQTV LGSPAGLDTG ARLEPSVRAA TITVSAAARA
RFAVRHRATA REAEVEIRAM LGDFLTEGRQ ARRTDGYWHL ERDGYRLVLD RSAASVVDYQ
TVHAERSWAQ HRAGIDSRIS QRTRHNDTTT RHRDTATERI EEQPMSAPTP PVPPQPGPQD
PTAAEHLRLF LAGTAQREEA RDQSVYGFLR HSLIADLYRA GSRPDDQENG DVLCHLSGLS
VLYRVLPEED TGYERLRREA LELLEARWAR GAEADRVCLV LPAPPEEDWS APALLGALGV
SVIWREADTW RGENAALIAG ERTHADRA