Gene Ndas_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1372 
Symbol 
ID9245222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1683854 
End bp1687537 
Gene Length3684 bp 
Protein Length1227 aa 
Translation table11 
GC content79% 
IMG OID 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003679310 
Protein GI297560336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG GTGTCCTGGG TCCCGTCCGC GCGTTCACGG ACGGCGGCCG GGCCGTGCCG 
ATCCCCGAAC GCAAGGCCCG CGTCCTGCTG GCGGCCCTGC TCGCCCACCG GGGCTGGCCG
GTGTCGGCGG ACCGGCTCGT GGAGTGGCTG TGGGCGGACG GGCCCCGGCC CGGCAGCCCC
GAACGCGCCC TGCAACGCAA GGTGTGGGCC CTGCGCCGGG CGCTGGAGGA GGCCGAACCC
GGCGCCCGCG ACCTGGTCCG CCACCGCCCG CCCGGCTACC TCCTCGACGT GCCTCCCGCC
TCCGTGGACG CCGAGCGCTT CCACCTGCTG GCGGACGCGG CCCAGGACGC CGCCGGTCCC
CGCGAACGGG CCCGGCTGCT CACCGAGGCC CTGGAACTGT GGCGGGGCCC CGCCTACGCC
GACGTGGCCG ACGAGGAGTT CGCCCGGCCG ACGGCCGCCC GGCTGGAGGA GCGCCGCCTG
GGCGCGCTGG AGGAACACGC CCTCACCCGG CTCGACCTGG GCGAGCACCG CGGGGTGACC
GGTCCGCTGG GCGCCCTCGT GGCCGAGCAC CCCCTGCGCG AGCGGCTGGT GGCCGCGTAC
ATGCGCGCCC TGTACGAGGG CGGACGCCAG GCCGAGGCAC TGGCCGCGCA CACGGCCCTG
GCCGGGCGCC TGCGGGAGGA GCTGGGGGTG GACCCCGGCC CCGAGGTGGC CGAGCTGCAC
GGCCGCATCC TGCGGCACCA GCCGACCTCA CCGGTTTCCG GGGGCGTCTC CGAGTCCGGA
ACACCGGCGG AAATCGTGCC CGCGACGGCA CGCGCCTCCC GAGTTCCGAC GGAACCCGCG
CCGCGCGCGG GGGCCGTCCG CGCGACGCCG GGAGCGCCTG TGCCGGGGCC GGGAGGCGCC
GCCGTGGCAC CCGGGGATCC GTTCCCGCAG GCAGGGAGCT TCCACGGGAC GCCGGAGGAG
CCCTTTTCCG CGCCGGGGAC CGCGTCCGGA CCGCCGGGGG AGCCGGTGCC GCAGGTGGAG
GGCGTCCACG GGGCGCCGGA GGCACCTGCC TCCGGTACGG GGAACACGCC CGGAACGGAC
TCGAACGTTC CTCCCCGGGC GGGGGACGCC CGTGGAACAC CGGAGGCGCC CGCTCCCGGA
CCGGGGACCC CGCCCGGAAC CCGGGCCGCG CCCTCCGCCG GCGCCACCCC GGCGCACGCG
CTCACCGGCC GCGCCTTCCC GCCCCTGCCG CTCACCGCGC TCGTGGGCCG CGGGGAGGAG
CAGCGGCGGG TGGCACGGCT GCTGGCGGAG GTCCGCCTGG TCACCCTCAC CGGGATCGGC
GGGGTCGGCA AGACCCGCCT GGCCCTCGCG ATCGCGCACG AGCTCGCCCC CGGCTTCGGC
GGCGGGGCGC ACATGGTCGA GTTCGCGGCC CAGCGGGCCG CGCCGGGCTC CCCCGCGTCC
GAACCGGACC CGGTCACGGC CCTGGCCCGG GCACTCGGGG TCCGCGACGG CGGCGGGACG
GACCCCCTGC GCCACGTGCT GGGCGCGCTG GAGGGCAGGC ACGCCCTGCT CGTCCTGGAC
AACTGCGAGC ACCTCGCGGG CGAGGTCGCC GACCTGGTCT CGGCCCTGCT CGGCCGCCTG
CCGGACCTGC GCGTCCTGAC CACCAGCCGC GAACCCCTCG GCGTGCCCGG AGAGGTCCTG
TTCGGCGTCG AACCCCTGCC CGTGCCCTCC GCCGACACAC CGGCCGACCG CGTCGGCGAC
TCCGGCGCGG TGCGCCTGTT CGCCGAACGC GCCGCCGCCT CCGCCCCCGG GTTCGCCCTG
ACCCCGGACA ACGCCGCCGA CGTCGCCCTG CTGTGCCGCC GCCTCGACGG CATCCCCCTG
GCCCTGGAGC TGGCCGCCAC CCGCGTGCGC GCCCTCGGCG TCACCGGTGT GCTCTCGCGC
CTGGACGACC GCTTCCGGCT GCTGGCCACC GCCCGCCGCC ACCTGCCCCC GCGCCAGCGC
ACCCTGCGCG CCATGGTCGA CTGGAGCTGG GAACTGCTCG ACGAGCGCGA GCGCGTCCTG
CTGCGCCGCC TCGCCGTGTT CACCGGCGGG TGCGCCCCCG AGGACGCCGA GGCCGTCTGC
TCGGGCGGGG GCATCGACGC CGCCGACGTC GTGGACCTGC TCACGAGCCT GGTCGACCGC
TCCCTCGTCG CGGCCGTGGA CGACCCCCTC ACCGGCCGCC GCCACCGGCT CCTGGAGTCC
GTCGCCGACT ACGCCTGCCA GCGCCTGTCC GAGGCCGGTG AGGCGCACGT CCTGCGCGGG
CGCCACCTGG ACCACTACAC CGCCCTCGCC CGCCGGTACG CGGGACGGAT ACTGGGCCCC
TCCCAGGGGG AGGCGCTCCG GCGCCTGGAC GCGGAGGCGT CCAACCTGCG CGCGGCCCTG
GACGAGGCGG TGGTGCGCGG AGCCGGAGGG CACGCCGCAC GCCTCGTCAA CTCCCTGGGC
TGGTACTGGT ACATGCGCGG CCGCTACCGG GAGGGCCGCA GCCTGACCCG GCGCGTCCGC
GACGCGGTGC GCGGGTCCGC CTCCACGGAG GCGGCCATGG CCGGGGCGAC CGCCGCGGTG
TTCGAGATCC TGGCCGGGGA CGGGGGTGAC CACACCGCCC CGGCCCGCAC CGCGCTCGGG
GCCTTCGACG GCCTGCCCGG GGCGTCCGGC GACGCCGTGC TCGAACGCGC CCGCGCCGCC
TGGATGCTCG GGTTCGTGCT GTACAGCCGG GGGGACCGGG CCGTCAGCGA GGACCTGGTC
ACCCGGGCGC TGGCCGTCTT CCGCGAGCGC GACGACCGCT GGGGGCTGGC CGCCGCCCTG
ACCGCACGGG CCTCGCACGC CCTGGGACGC GGGGACCTGG ACGCGGCGGG CGTCCGCGGG
CGTGAGGCGC TGGAACTCTT CCGCGGACTC GGGGACCGCT GGGGCCAGCT GAACGCCCTG
ACCGTCCTGG CGACGCCGGC CGAGGTCACC GGCCGCCTCG CCGACGCGGC CCGCTTCCGC
CGGGAGGCCC TGGGCATGGC CGAGGAGCTG GAGCTGTGGT CCGAGGCCGC CGGGGCCACG
GCCGGGCTGG GCCGGATCGC CCTGCTGGAG GGGGACCTCG ACCGCGCCGA CGAACTCCAC
CGCAGGGCGC TGGACCTGGT GCGGGGGCAG GGGGACGTGC CGGGGGAGCA GTACGCCCGG
CTCGGTCTGG GGCTCAGCGC GCGGCGCCGG GGAAGACTGG AGGAGGCCGA GCGGTACGTG
CGCCCGATCG CGGAGTGGTC GGCCCGGGTG GGCTGGCTGC CGGGCGCCGC CCTGGCCCTG
GCCGAACTGG GCTTCTCGGC GGAACTGCGG GGCGACGCGG CCGAGGCGCT GCGCCTGCAC
CGGGAGGGGC TGGCCGCGGC CCGCCTCAGC GGCGACCCGC GCGCGCTGGC GCTGGCGCTG
GAGGGCGTGG CCGCCGCGCA CACGCTCACC GGCCGCCACG GCGAGGCCGC CGGGCTCCTG
GGCGCCGCCG AGGCCCTGCG CGAGGGCGCG GGCGCGCCGG CGCCCGCCGC TGAACGGGGC
GACGTGGACC GCGCGACGGC GCGCGCACGG GCGGCCCTGG GCGAGGCGGA GTTCGCGCGG
GCGTTCGCCT GGGGGCGGAC GCGCCCGCCC GCGGACCTCG CGGACCTCCT CGACGGCGGG
GAGCAGCGCC CCGACCCGGC CTGA
 
Protein sequence
MRFGVLGPVR AFTDGGRAVP IPERKARVLL AALLAHRGWP VSADRLVEWL WADGPRPGSP 
ERALQRKVWA LRRALEEAEP GARDLVRHRP PGYLLDVPPA SVDAERFHLL ADAAQDAAGP
RERARLLTEA LELWRGPAYA DVADEEFARP TAARLEERRL GALEEHALTR LDLGEHRGVT
GPLGALVAEH PLRERLVAAY MRALYEGGRQ AEALAAHTAL AGRLREELGV DPGPEVAELH
GRILRHQPTS PVSGGVSESG TPAEIVPATA RASRVPTEPA PRAGAVRATP GAPVPGPGGA
AVAPGDPFPQ AGSFHGTPEE PFSAPGTASG PPGEPVPQVE GVHGAPEAPA SGTGNTPGTD
SNVPPRAGDA RGTPEAPAPG PGTPPGTRAA PSAGATPAHA LTGRAFPPLP LTALVGRGEE
QRRVARLLAE VRLVTLTGIG GVGKTRLALA IAHELAPGFG GGAHMVEFAA QRAAPGSPAS
EPDPVTALAR ALGVRDGGGT DPLRHVLGAL EGRHALLVLD NCEHLAGEVA DLVSALLGRL
PDLRVLTTSR EPLGVPGEVL FGVEPLPVPS ADTPADRVGD SGAVRLFAER AAASAPGFAL
TPDNAADVAL LCRRLDGIPL ALELAATRVR ALGVTGVLSR LDDRFRLLAT ARRHLPPRQR
TLRAMVDWSW ELLDERERVL LRRLAVFTGG CAPEDAEAVC SGGGIDAADV VDLLTSLVDR
SLVAAVDDPL TGRRHRLLES VADYACQRLS EAGEAHVLRG RHLDHYTALA RRYAGRILGP
SQGEALRRLD AEASNLRAAL DEAVVRGAGG HAARLVNSLG WYWYMRGRYR EGRSLTRRVR
DAVRGSASTE AAMAGATAAV FEILAGDGGD HTAPARTALG AFDGLPGASG DAVLERARAA
WMLGFVLYSR GDRAVSEDLV TRALAVFRER DDRWGLAAAL TARASHALGR GDLDAAGVRG
REALELFRGL GDRWGQLNAL TVLATPAEVT GRLADAARFR REALGMAEEL ELWSEAAGAT
AGLGRIALLE GDLDRADELH RRALDLVRGQ GDVPGEQYAR LGLGLSARRR GRLEEAERYV
RPIAEWSARV GWLPGAALAL AELGFSAELR GDAAEALRLH REGLAAARLS GDPRALALAL
EGVAAAHTLT GRHGEAAGLL GAAEALREGA GAPAPAAERG DVDRATARAR AALGEAEFAR
AFAWGRTRPP ADLADLLDGG EQRPDPA