Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2898 |
Symbol | |
ID | 9246749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3461219 |
End bp | 3464503 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | transcriptional regulator, winged helix family |
Protein accession | YP_003680815 |
Protein GI | 297561841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.343101 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGTTCT CCATCCTCGG GCCCCTGGCG GTCCACGACG CGACCGGGCG ACCCGTCGCC ATCGGCGGTG CGCGCCTGCG CACGCTGCTC ACCCTCCTCC TGCTCCGGCC CGGCCAGCGG ATCGCCAACG ACGAGCTCAC CGACGCCGTC TGGGCGGGGA GCCCGCCCGC TGCGGCGGGC AATGCCCTCC AGGCCCTGGT CTCCCGCCTG CGCCGCGCGC TGGGCGAGGG CGCGCGCATC GACGGGGACG CGTCGGGGTA CCGGTTGGCG GTCGAGCCCG GCCAGGTGGA CCTGGCCGAG TTCGAGTCCC TGGTCAGGCG GGGGCGGGCC GGGCTCGTCG CGGGCAGGGC CGCCGACGCC GCCCGCGACC TGGGCGAGGC CCTCGCCCTG TGGCGCGGAC CGGCCCTGTC GGACCTGACC GCGCACGGTC TGGCCGAGGA CACGGCGCTG CGCCTGGCCG AGACCCGCCG AGCGGCCCTG GAGGACCGCC TGACCGCCCT GGCCGACCTC GGGCTGTACG CCGAGGTCCT GCCCGAGGCG GAGGCCCTGT GCCGAAGCGA GCCGCACCGC GAGGGCCCCC TCGCGCTCCT CGTGCGCGCG CTGGCCGCCA CCGGCCGCAC GGCCGACGCC CTGGCCGCCT ACGAGCGCTT CCGCTCCCAC CTGGCCGACG AGCTGGGCCT GGACCCCTCG CCCCAGCTGC GCGACCTGCA CCTGAGGCTG CTGCGCGGCG AACTCGACGC GTCCCCTGCC GCCGCGCCCT CCGCCCCGCC CCCGGCTCCG GCCCCGCCCC TGCGCCTGCC CGCCTCCCTG ACCAGCTTCG TGCCGCGCGA CACCGAGGTG GACACCGCCG TCGACCTGCT CATCCGCGAA CGCCTGGTCA CCCTGCTGGG CCCCGGCGGC GCGGGCAAGA CCCGCCTGGC GATCGAGAGC GCCTCCGCGC TGGCCGCGCG GGCTCCCTCC CTGCTCTCCC GCGGCGGCTG GTTCGTCGAA CTCGCCTCCA GGGCGGCCGC GGACGTCCCC CAGGCACTGG CCTCCGCGCT GGAGCTGCGC GAGCACGCCG TGCTCCAGGC GCGCTCCGCG GCCCCCAACG CCCCCGCCGC CCTCGTTCCG CTCCTGGAAC GGGTGGTCTC CTTCGTCGGC GACCGCCACG TCCTCCTCGT CCTGGACAAC TGCGAGCACA TCGTCGAGGA GGTCGCCTCC GCCGTGGCGA CGCTGCTGGC CCGCTGCCCC GGCCTGCGGA TCCTGGCCAC CTCCCGCGAA CCCCTGGGGG TGCCCGGCGA ACAGCTCCTG ACCGTCCCCT CCCTGGACAT GCCGCCCGAG GGGGCCTCCG CCGACCGGGC CGCCGCCTGC TCCTCGGTCG TCCTGTTCGC CGAACGCGCC GCCGCCGTGC GTCCGGGCTT TCGCGTCACC CCCGACAACG CCGCCCACGT CGTCCGCGTC GTCCGCGAGC TGGACGGCCT GCCGCTGGCC CTGGAGCTGG CCGCCGCGCG CCTGCGCTCC ATGAGCACCG CCCAGCTCGC CGACCGGCTC CGCGACCGCT TCCGGCTCCT CACCGGGGGC GCCCGGTCGA CGCTGCCGCG CCACCGCACC CTGCGCGCCG TCGTCGACTG GAGCTGGGAC CTGCTCGACG AACCCGAGCG CCGCCTGCTG CGCCGCCTGT CGATCTTCGC CGGGGGAGCC ACCCTGGAGG CCGTCGAACG GGTCTGCGCC GACCCCGGCA CCGAGGGGGA GATCGGCGGC CACGACGCGT GGACCGTCCT GTTCGCCCTG GTCGACAAGT CCCTGGTGAT CGCCGAGAAC CCCGACCGTG ACGACACCCC GCCCCGCTAC CGGCAGCTGG AGACCGTGCG TGCCTACGCC GCCGAACGCC TGGCGAGCAG CGGGGAGGAG GAGCGCGTGC GCGACGCGCA CGCCCGCCAC GTCCGCGACC TGTGGCGCTG GGCCGACCCG CTGCTGCGCG GCCCGCGCCA GGGGGAGCTG CTCGCCCGGC TGGCCGCGGA GGCCGACAAC TGCGGCGCCG CCGTGCGCTG GGCCGTCGAG CGGCGCGACG CCGGACTGGC CCTGGACCTG GTCGAGTGCA CCCAGTGGTA CTGGACCCTG TGCGGCTCCT GGCGCCAGCT CCACCAGTGG GCCGTGGACG TCCTGGACAT GGTCGGCGAC CGGGTGCCCG AGGGGCGCGC CGTGGCCTAC GCCAGCTGCC TGTTCCAGCG GGCCGACACG ACCACCGACC ACGAGTCGGT GCTGGAGCGC ATACGCGAGG TCGAGGCCGT CCTGGAAGAG GCCGGACAGC GGGCCGAGGA GCACCCCATG CTCGTCTACG GCCTGGTGTA CAGGGCGCTG CTGGAGGGGA CGACCGGCGC CGCCCACGAA CGCCTCGCCG CCGCGGCCGA CCAGGCCGAC CCGTGGATGC GGGCCCTCGT GGGGGTGCTG CTGTCGCTGT TCGACGCGGT CAACGGGCGC ACCGGGCGGT CCATGGAGCG CGCGAACGCC GCCCTGGAGC AGTTCCGCGC GTGCGGCGAC ACCTGGGGCG AGTGCCAGGC GCTCGTCCAG GCCGTGGACC TGTACCGGTT CGAGGACCTC GACCGCTGCC GCGACCTGCT CACCCTCGGC GTGCGCAGGA CCGAGGAGGC GGGGCTGGAG GCGCTGGACT GGATGTTCCG CGTCCGCCGG GCCCAGGTCC TCACCGACCT CGGCGACCTG GAGGCCGCGC GTGAGGACCT GCGGGGGCTC CTCGGCTCCG AGCGGCCCGT GGAGAAGGAA CAGATGGTGC TGCTGCGCCT GGCCGAGGGC CAGTGGCTGC GGGAGGCGGG GGAGCCGGAC GCGGCCCGCG AGGTCCTGGA CCGGGCGGGC GAGGACCTCA AGGGCCTGGG CGGGTTCTCG CCCGTCTACG TGGAGGCGGG CTGGCGGACC CTGTACACGA CCGTCGCCTG GAGGGCCGGT GACACCGGGG AGGCCTGGGA GCACGCCCGG CGCGCCTGGC GGCTGGCCGA CCACGGCCTG GGCCCGGTGT GCGCGGAGGT CCTGGACACC TTCGCCGTGA TGGCCGTCGG ACACGACCCG AGGCGCGGCG CCTGGCTGCT GGCCTGCGCC GAGGTGCTGC GCGGCATGCC CGACACCGCC ACGCCCCTCG TGGTGCGGGC CCGCGAGAGC GCGCGCCGGG AGCTGGGCGG ACGGGAGTAC GACCTCGTCC TCGCCGGGGT CCGGGATGTG GGCGCCGACC GGATCCGCGG GCTCGTGGAC GCCTGGCTGG CCGAGGGCGC GCCCGGTGGC GCGGAGCGCC CCTGA
|
Protein sequence | MRFSILGPLA VHDATGRPVA IGGARLRTLL TLLLLRPGQR IANDELTDAV WAGSPPAAAG NALQALVSRL RRALGEGARI DGDASGYRLA VEPGQVDLAE FESLVRRGRA GLVAGRAADA ARDLGEALAL WRGPALSDLT AHGLAEDTAL RLAETRRAAL EDRLTALADL GLYAEVLPEA EALCRSEPHR EGPLALLVRA LAATGRTADA LAAYERFRSH LADELGLDPS PQLRDLHLRL LRGELDASPA AAPSAPPPAP APPLRLPASL TSFVPRDTEV DTAVDLLIRE RLVTLLGPGG AGKTRLAIES ASALAARAPS LLSRGGWFVE LASRAAADVP QALASALELR EHAVLQARSA APNAPAALVP LLERVVSFVG DRHVLLVLDN CEHIVEEVAS AVATLLARCP GLRILATSRE PLGVPGEQLL TVPSLDMPPE GASADRAAAC SSVVLFAERA AAVRPGFRVT PDNAAHVVRV VRELDGLPLA LELAAARLRS MSTAQLADRL RDRFRLLTGG ARSTLPRHRT LRAVVDWSWD LLDEPERRLL RRLSIFAGGA TLEAVERVCA DPGTEGEIGG HDAWTVLFAL VDKSLVIAEN PDRDDTPPRY RQLETVRAYA AERLASSGEE ERVRDAHARH VRDLWRWADP LLRGPRQGEL LARLAAEADN CGAAVRWAVE RRDAGLALDL VECTQWYWTL CGSWRQLHQW AVDVLDMVGD RVPEGRAVAY ASCLFQRADT TTDHESVLER IREVEAVLEE AGQRAEEHPM LVYGLVYRAL LEGTTGAAHE RLAAAADQAD PWMRALVGVL LSLFDAVNGR TGRSMERANA ALEQFRACGD TWGECQALVQ AVDLYRFEDL DRCRDLLTLG VRRTEEAGLE ALDWMFRVRR AQVLTDLGDL EAAREDLRGL LGSERPVEKE QMVLLRLAEG QWLREAGEPD AAREVLDRAG EDLKGLGGFS PVYVEAGWRT LYTTVAWRAG DTGEAWEHAR RAWRLADHGL GPVCAEVLDT FAVMAVGHDP RRGAWLLACA EVLRGMPDTA TPLVVRARES ARRELGGREY DLVLAGVRDV GADRIRGLVD AWLAEGAPGG AERP
|
| |