Gene Ndas_2898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2898 
Symbol 
ID9246749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3461219 
End bp3464503 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content77% 
IMG OID 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003680815 
Protein GI297561841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.343101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTTCT CCATCCTCGG GCCCCTGGCG GTCCACGACG CGACCGGGCG ACCCGTCGCC 
ATCGGCGGTG CGCGCCTGCG CACGCTGCTC ACCCTCCTCC TGCTCCGGCC CGGCCAGCGG
ATCGCCAACG ACGAGCTCAC CGACGCCGTC TGGGCGGGGA GCCCGCCCGC TGCGGCGGGC
AATGCCCTCC AGGCCCTGGT CTCCCGCCTG CGCCGCGCGC TGGGCGAGGG CGCGCGCATC
GACGGGGACG CGTCGGGGTA CCGGTTGGCG GTCGAGCCCG GCCAGGTGGA CCTGGCCGAG
TTCGAGTCCC TGGTCAGGCG GGGGCGGGCC GGGCTCGTCG CGGGCAGGGC CGCCGACGCC
GCCCGCGACC TGGGCGAGGC CCTCGCCCTG TGGCGCGGAC CGGCCCTGTC GGACCTGACC
GCGCACGGTC TGGCCGAGGA CACGGCGCTG CGCCTGGCCG AGACCCGCCG AGCGGCCCTG
GAGGACCGCC TGACCGCCCT GGCCGACCTC GGGCTGTACG CCGAGGTCCT GCCCGAGGCG
GAGGCCCTGT GCCGAAGCGA GCCGCACCGC GAGGGCCCCC TCGCGCTCCT CGTGCGCGCG
CTGGCCGCCA CCGGCCGCAC GGCCGACGCC CTGGCCGCCT ACGAGCGCTT CCGCTCCCAC
CTGGCCGACG AGCTGGGCCT GGACCCCTCG CCCCAGCTGC GCGACCTGCA CCTGAGGCTG
CTGCGCGGCG AACTCGACGC GTCCCCTGCC GCCGCGCCCT CCGCCCCGCC CCCGGCTCCG
GCCCCGCCCC TGCGCCTGCC CGCCTCCCTG ACCAGCTTCG TGCCGCGCGA CACCGAGGTG
GACACCGCCG TCGACCTGCT CATCCGCGAA CGCCTGGTCA CCCTGCTGGG CCCCGGCGGC
GCGGGCAAGA CCCGCCTGGC GATCGAGAGC GCCTCCGCGC TGGCCGCGCG GGCTCCCTCC
CTGCTCTCCC GCGGCGGCTG GTTCGTCGAA CTCGCCTCCA GGGCGGCCGC GGACGTCCCC
CAGGCACTGG CCTCCGCGCT GGAGCTGCGC GAGCACGCCG TGCTCCAGGC GCGCTCCGCG
GCCCCCAACG CCCCCGCCGC CCTCGTTCCG CTCCTGGAAC GGGTGGTCTC CTTCGTCGGC
GACCGCCACG TCCTCCTCGT CCTGGACAAC TGCGAGCACA TCGTCGAGGA GGTCGCCTCC
GCCGTGGCGA CGCTGCTGGC CCGCTGCCCC GGCCTGCGGA TCCTGGCCAC CTCCCGCGAA
CCCCTGGGGG TGCCCGGCGA ACAGCTCCTG ACCGTCCCCT CCCTGGACAT GCCGCCCGAG
GGGGCCTCCG CCGACCGGGC CGCCGCCTGC TCCTCGGTCG TCCTGTTCGC CGAACGCGCC
GCCGCCGTGC GTCCGGGCTT TCGCGTCACC CCCGACAACG CCGCCCACGT CGTCCGCGTC
GTCCGCGAGC TGGACGGCCT GCCGCTGGCC CTGGAGCTGG CCGCCGCGCG CCTGCGCTCC
ATGAGCACCG CCCAGCTCGC CGACCGGCTC CGCGACCGCT TCCGGCTCCT CACCGGGGGC
GCCCGGTCGA CGCTGCCGCG CCACCGCACC CTGCGCGCCG TCGTCGACTG GAGCTGGGAC
CTGCTCGACG AACCCGAGCG CCGCCTGCTG CGCCGCCTGT CGATCTTCGC CGGGGGAGCC
ACCCTGGAGG CCGTCGAACG GGTCTGCGCC GACCCCGGCA CCGAGGGGGA GATCGGCGGC
CACGACGCGT GGACCGTCCT GTTCGCCCTG GTCGACAAGT CCCTGGTGAT CGCCGAGAAC
CCCGACCGTG ACGACACCCC GCCCCGCTAC CGGCAGCTGG AGACCGTGCG TGCCTACGCC
GCCGAACGCC TGGCGAGCAG CGGGGAGGAG GAGCGCGTGC GCGACGCGCA CGCCCGCCAC
GTCCGCGACC TGTGGCGCTG GGCCGACCCG CTGCTGCGCG GCCCGCGCCA GGGGGAGCTG
CTCGCCCGGC TGGCCGCGGA GGCCGACAAC TGCGGCGCCG CCGTGCGCTG GGCCGTCGAG
CGGCGCGACG CCGGACTGGC CCTGGACCTG GTCGAGTGCA CCCAGTGGTA CTGGACCCTG
TGCGGCTCCT GGCGCCAGCT CCACCAGTGG GCCGTGGACG TCCTGGACAT GGTCGGCGAC
CGGGTGCCCG AGGGGCGCGC CGTGGCCTAC GCCAGCTGCC TGTTCCAGCG GGCCGACACG
ACCACCGACC ACGAGTCGGT GCTGGAGCGC ATACGCGAGG TCGAGGCCGT CCTGGAAGAG
GCCGGACAGC GGGCCGAGGA GCACCCCATG CTCGTCTACG GCCTGGTGTA CAGGGCGCTG
CTGGAGGGGA CGACCGGCGC CGCCCACGAA CGCCTCGCCG CCGCGGCCGA CCAGGCCGAC
CCGTGGATGC GGGCCCTCGT GGGGGTGCTG CTGTCGCTGT TCGACGCGGT CAACGGGCGC
ACCGGGCGGT CCATGGAGCG CGCGAACGCC GCCCTGGAGC AGTTCCGCGC GTGCGGCGAC
ACCTGGGGCG AGTGCCAGGC GCTCGTCCAG GCCGTGGACC TGTACCGGTT CGAGGACCTC
GACCGCTGCC GCGACCTGCT CACCCTCGGC GTGCGCAGGA CCGAGGAGGC GGGGCTGGAG
GCGCTGGACT GGATGTTCCG CGTCCGCCGG GCCCAGGTCC TCACCGACCT CGGCGACCTG
GAGGCCGCGC GTGAGGACCT GCGGGGGCTC CTCGGCTCCG AGCGGCCCGT GGAGAAGGAA
CAGATGGTGC TGCTGCGCCT GGCCGAGGGC CAGTGGCTGC GGGAGGCGGG GGAGCCGGAC
GCGGCCCGCG AGGTCCTGGA CCGGGCGGGC GAGGACCTCA AGGGCCTGGG CGGGTTCTCG
CCCGTCTACG TGGAGGCGGG CTGGCGGACC CTGTACACGA CCGTCGCCTG GAGGGCCGGT
GACACCGGGG AGGCCTGGGA GCACGCCCGG CGCGCCTGGC GGCTGGCCGA CCACGGCCTG
GGCCCGGTGT GCGCGGAGGT CCTGGACACC TTCGCCGTGA TGGCCGTCGG ACACGACCCG
AGGCGCGGCG CCTGGCTGCT GGCCTGCGCC GAGGTGCTGC GCGGCATGCC CGACACCGCC
ACGCCCCTCG TGGTGCGGGC CCGCGAGAGC GCGCGCCGGG AGCTGGGCGG ACGGGAGTAC
GACCTCGTCC TCGCCGGGGT CCGGGATGTG GGCGCCGACC GGATCCGCGG GCTCGTGGAC
GCCTGGCTGG CCGAGGGCGC GCCCGGTGGC GCGGAGCGCC CCTGA
 
Protein sequence
MRFSILGPLA VHDATGRPVA IGGARLRTLL TLLLLRPGQR IANDELTDAV WAGSPPAAAG 
NALQALVSRL RRALGEGARI DGDASGYRLA VEPGQVDLAE FESLVRRGRA GLVAGRAADA
ARDLGEALAL WRGPALSDLT AHGLAEDTAL RLAETRRAAL EDRLTALADL GLYAEVLPEA
EALCRSEPHR EGPLALLVRA LAATGRTADA LAAYERFRSH LADELGLDPS PQLRDLHLRL
LRGELDASPA AAPSAPPPAP APPLRLPASL TSFVPRDTEV DTAVDLLIRE RLVTLLGPGG
AGKTRLAIES ASALAARAPS LLSRGGWFVE LASRAAADVP QALASALELR EHAVLQARSA
APNAPAALVP LLERVVSFVG DRHVLLVLDN CEHIVEEVAS AVATLLARCP GLRILATSRE
PLGVPGEQLL TVPSLDMPPE GASADRAAAC SSVVLFAERA AAVRPGFRVT PDNAAHVVRV
VRELDGLPLA LELAAARLRS MSTAQLADRL RDRFRLLTGG ARSTLPRHRT LRAVVDWSWD
LLDEPERRLL RRLSIFAGGA TLEAVERVCA DPGTEGEIGG HDAWTVLFAL VDKSLVIAEN
PDRDDTPPRY RQLETVRAYA AERLASSGEE ERVRDAHARH VRDLWRWADP LLRGPRQGEL
LARLAAEADN CGAAVRWAVE RRDAGLALDL VECTQWYWTL CGSWRQLHQW AVDVLDMVGD
RVPEGRAVAY ASCLFQRADT TTDHESVLER IREVEAVLEE AGQRAEEHPM LVYGLVYRAL
LEGTTGAAHE RLAAAADQAD PWMRALVGVL LSLFDAVNGR TGRSMERANA ALEQFRACGD
TWGECQALVQ AVDLYRFEDL DRCRDLLTLG VRRTEEAGLE ALDWMFRVRR AQVLTDLGDL
EAAREDLRGL LGSERPVEKE QMVLLRLAEG QWLREAGEPD AAREVLDRAG EDLKGLGGFS
PVYVEAGWRT LYTTVAWRAG DTGEAWEHAR RAWRLADHGL GPVCAEVLDT FAVMAVGHDP
RRGAWLLACA EVLRGMPDTA TPLVVRARES ARRELGGREY DLVLAGVRDV GADRIRGLVD
AWLAEGAPGG AERP