Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1372 |
Symbol | |
ID | 9245222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1683854 |
End bp | 1687537 |
Gene Length | 3684 bp |
Protein Length | 1227 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | transcriptional regulator, winged helix family |
Protein accession | YP_003679310 |
Protein GI | 297560336 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.291577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCG GTGTCCTGGG TCCCGTCCGC GCGTTCACGG ACGGCGGCCG GGCCGTGCCG ATCCCCGAAC GCAAGGCCCG CGTCCTGCTG GCGGCCCTGC TCGCCCACCG GGGCTGGCCG GTGTCGGCGG ACCGGCTCGT GGAGTGGCTG TGGGCGGACG GGCCCCGGCC CGGCAGCCCC GAACGCGCCC TGCAACGCAA GGTGTGGGCC CTGCGCCGGG CGCTGGAGGA GGCCGAACCC GGCGCCCGCG ACCTGGTCCG CCACCGCCCG CCCGGCTACC TCCTCGACGT GCCTCCCGCC TCCGTGGACG CCGAGCGCTT CCACCTGCTG GCGGACGCGG CCCAGGACGC CGCCGGTCCC CGCGAACGGG CCCGGCTGCT CACCGAGGCC CTGGAACTGT GGCGGGGCCC CGCCTACGCC GACGTGGCCG ACGAGGAGTT CGCCCGGCCG ACGGCCGCCC GGCTGGAGGA GCGCCGCCTG GGCGCGCTGG AGGAACACGC CCTCACCCGG CTCGACCTGG GCGAGCACCG CGGGGTGACC GGTCCGCTGG GCGCCCTCGT GGCCGAGCAC CCCCTGCGCG AGCGGCTGGT GGCCGCGTAC ATGCGCGCCC TGTACGAGGG CGGACGCCAG GCCGAGGCAC TGGCCGCGCA CACGGCCCTG GCCGGGCGCC TGCGGGAGGA GCTGGGGGTG GACCCCGGCC CCGAGGTGGC CGAGCTGCAC GGCCGCATCC TGCGGCACCA GCCGACCTCA CCGGTTTCCG GGGGCGTCTC CGAGTCCGGA ACACCGGCGG AAATCGTGCC CGCGACGGCA CGCGCCTCCC GAGTTCCGAC GGAACCCGCG CCGCGCGCGG GGGCCGTCCG CGCGACGCCG GGAGCGCCTG TGCCGGGGCC GGGAGGCGCC GCCGTGGCAC CCGGGGATCC GTTCCCGCAG GCAGGGAGCT TCCACGGGAC GCCGGAGGAG CCCTTTTCCG CGCCGGGGAC CGCGTCCGGA CCGCCGGGGG AGCCGGTGCC GCAGGTGGAG GGCGTCCACG GGGCGCCGGA GGCACCTGCC TCCGGTACGG GGAACACGCC CGGAACGGAC TCGAACGTTC CTCCCCGGGC GGGGGACGCC CGTGGAACAC CGGAGGCGCC CGCTCCCGGA CCGGGGACCC CGCCCGGAAC CCGGGCCGCG CCCTCCGCCG GCGCCACCCC GGCGCACGCG CTCACCGGCC GCGCCTTCCC GCCCCTGCCG CTCACCGCGC TCGTGGGCCG CGGGGAGGAG CAGCGGCGGG TGGCACGGCT GCTGGCGGAG GTCCGCCTGG TCACCCTCAC CGGGATCGGC GGGGTCGGCA AGACCCGCCT GGCCCTCGCG ATCGCGCACG AGCTCGCCCC CGGCTTCGGC GGCGGGGCGC ACATGGTCGA GTTCGCGGCC CAGCGGGCCG CGCCGGGCTC CCCCGCGTCC GAACCGGACC CGGTCACGGC CCTGGCCCGG GCACTCGGGG TCCGCGACGG CGGCGGGACG GACCCCCTGC GCCACGTGCT GGGCGCGCTG GAGGGCAGGC ACGCCCTGCT CGTCCTGGAC AACTGCGAGC ACCTCGCGGG CGAGGTCGCC GACCTGGTCT CGGCCCTGCT CGGCCGCCTG CCGGACCTGC GCGTCCTGAC CACCAGCCGC GAACCCCTCG GCGTGCCCGG AGAGGTCCTG TTCGGCGTCG AACCCCTGCC CGTGCCCTCC GCCGACACAC CGGCCGACCG CGTCGGCGAC TCCGGCGCGG TGCGCCTGTT CGCCGAACGC GCCGCCGCCT CCGCCCCCGG GTTCGCCCTG ACCCCGGACA ACGCCGCCGA CGTCGCCCTG CTGTGCCGCC GCCTCGACGG CATCCCCCTG GCCCTGGAGC TGGCCGCCAC CCGCGTGCGC GCCCTCGGCG TCACCGGTGT GCTCTCGCGC CTGGACGACC GCTTCCGGCT GCTGGCCACC GCCCGCCGCC ACCTGCCCCC GCGCCAGCGC ACCCTGCGCG CCATGGTCGA CTGGAGCTGG GAACTGCTCG ACGAGCGCGA GCGCGTCCTG CTGCGCCGCC TCGCCGTGTT CACCGGCGGG TGCGCCCCCG AGGACGCCGA GGCCGTCTGC TCGGGCGGGG GCATCGACGC CGCCGACGTC GTGGACCTGC TCACGAGCCT GGTCGACCGC TCCCTCGTCG CGGCCGTGGA CGACCCCCTC ACCGGCCGCC GCCACCGGCT CCTGGAGTCC GTCGCCGACT ACGCCTGCCA GCGCCTGTCC GAGGCCGGTG AGGCGCACGT CCTGCGCGGG CGCCACCTGG ACCACTACAC CGCCCTCGCC CGCCGGTACG CGGGACGGAT ACTGGGCCCC TCCCAGGGGG AGGCGCTCCG GCGCCTGGAC GCGGAGGCGT CCAACCTGCG CGCGGCCCTG GACGAGGCGG TGGTGCGCGG AGCCGGAGGG CACGCCGCAC GCCTCGTCAA CTCCCTGGGC TGGTACTGGT ACATGCGCGG CCGCTACCGG GAGGGCCGCA GCCTGACCCG GCGCGTCCGC GACGCGGTGC GCGGGTCCGC CTCCACGGAG GCGGCCATGG CCGGGGCGAC CGCCGCGGTG TTCGAGATCC TGGCCGGGGA CGGGGGTGAC CACACCGCCC CGGCCCGCAC CGCGCTCGGG GCCTTCGACG GCCTGCCCGG GGCGTCCGGC GACGCCGTGC TCGAACGCGC CCGCGCCGCC TGGATGCTCG GGTTCGTGCT GTACAGCCGG GGGGACCGGG CCGTCAGCGA GGACCTGGTC ACCCGGGCGC TGGCCGTCTT CCGCGAGCGC GACGACCGCT GGGGGCTGGC CGCCGCCCTG ACCGCACGGG CCTCGCACGC CCTGGGACGC GGGGACCTGG ACGCGGCGGG CGTCCGCGGG CGTGAGGCGC TGGAACTCTT CCGCGGACTC GGGGACCGCT GGGGCCAGCT GAACGCCCTG ACCGTCCTGG CGACGCCGGC CGAGGTCACC GGCCGCCTCG CCGACGCGGC CCGCTTCCGC CGGGAGGCCC TGGGCATGGC CGAGGAGCTG GAGCTGTGGT CCGAGGCCGC CGGGGCCACG GCCGGGCTGG GCCGGATCGC CCTGCTGGAG GGGGACCTCG ACCGCGCCGA CGAACTCCAC CGCAGGGCGC TGGACCTGGT GCGGGGGCAG GGGGACGTGC CGGGGGAGCA GTACGCCCGG CTCGGTCTGG GGCTCAGCGC GCGGCGCCGG GGAAGACTGG AGGAGGCCGA GCGGTACGTG CGCCCGATCG CGGAGTGGTC GGCCCGGGTG GGCTGGCTGC CGGGCGCCGC CCTGGCCCTG GCCGAACTGG GCTTCTCGGC GGAACTGCGG GGCGACGCGG CCGAGGCGCT GCGCCTGCAC CGGGAGGGGC TGGCCGCGGC CCGCCTCAGC GGCGACCCGC GCGCGCTGGC GCTGGCGCTG GAGGGCGTGG CCGCCGCGCA CACGCTCACC GGCCGCCACG GCGAGGCCGC CGGGCTCCTG GGCGCCGCCG AGGCCCTGCG CGAGGGCGCG GGCGCGCCGG CGCCCGCCGC TGAACGGGGC GACGTGGACC GCGCGACGGC GCGCGCACGG GCGGCCCTGG GCGAGGCGGA GTTCGCGCGG GCGTTCGCCT GGGGGCGGAC GCGCCCGCCC GCGGACCTCG CGGACCTCCT CGACGGCGGG GAGCAGCGCC CCGACCCGGC CTGA
|
Protein sequence | MRFGVLGPVR AFTDGGRAVP IPERKARVLL AALLAHRGWP VSADRLVEWL WADGPRPGSP ERALQRKVWA LRRALEEAEP GARDLVRHRP PGYLLDVPPA SVDAERFHLL ADAAQDAAGP RERARLLTEA LELWRGPAYA DVADEEFARP TAARLEERRL GALEEHALTR LDLGEHRGVT GPLGALVAEH PLRERLVAAY MRALYEGGRQ AEALAAHTAL AGRLREELGV DPGPEVAELH GRILRHQPTS PVSGGVSESG TPAEIVPATA RASRVPTEPA PRAGAVRATP GAPVPGPGGA AVAPGDPFPQ AGSFHGTPEE PFSAPGTASG PPGEPVPQVE GVHGAPEAPA SGTGNTPGTD SNVPPRAGDA RGTPEAPAPG PGTPPGTRAA PSAGATPAHA LTGRAFPPLP LTALVGRGEE QRRVARLLAE VRLVTLTGIG GVGKTRLALA IAHELAPGFG GGAHMVEFAA QRAAPGSPAS EPDPVTALAR ALGVRDGGGT DPLRHVLGAL EGRHALLVLD NCEHLAGEVA DLVSALLGRL PDLRVLTTSR EPLGVPGEVL FGVEPLPVPS ADTPADRVGD SGAVRLFAER AAASAPGFAL TPDNAADVAL LCRRLDGIPL ALELAATRVR ALGVTGVLSR LDDRFRLLAT ARRHLPPRQR TLRAMVDWSW ELLDERERVL LRRLAVFTGG CAPEDAEAVC SGGGIDAADV VDLLTSLVDR SLVAAVDDPL TGRRHRLLES VADYACQRLS EAGEAHVLRG RHLDHYTALA RRYAGRILGP SQGEALRRLD AEASNLRAAL DEAVVRGAGG HAARLVNSLG WYWYMRGRYR EGRSLTRRVR DAVRGSASTE AAMAGATAAV FEILAGDGGD HTAPARTALG AFDGLPGASG DAVLERARAA WMLGFVLYSR GDRAVSEDLV TRALAVFRER DDRWGLAAAL TARASHALGR GDLDAAGVRG REALELFRGL GDRWGQLNAL TVLATPAEVT GRLADAARFR REALGMAEEL ELWSEAAGAT AGLGRIALLE GDLDRADELH RRALDLVRGQ GDVPGEQYAR LGLGLSARRR GRLEEAERYV RPIAEWSARV GWLPGAALAL AELGFSAELR GDAAEALRLH REGLAAARLS GDPRALALAL EGVAAAHTLT GRHGEAAGLL GAAEALREGA GAPAPAAERG DVDRATARAR AALGEAEFAR AFAWGRTRPP ADLADLLDGG EQRPDPA
|
| |