Gene Ndas_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1806 
Symbol 
ID9245656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2208817 
End bp2211999 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003679740 
Protein GI297560766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.287202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.440265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACTCT GTGTTCTGGG CTCCATCCAT CTGAGCAGCG ACGGACGCCG CGTCACCCCG 
ACCACGGGCA AGGAACGCGC GCTCCTCGCC GCCCTGGCAC TGACGGGCCC GGGACGGCTA
CCCCTTCGGG AGCTGGCCGA CCGGCTCTGG AACGGCTCTC CCCCGGAGGG CTACCGCAGC
ACACTGCACA CCTACGTCAC GCGCCTGCGG GGCAAGATCG AGGCGGCCGG GCTGGACCGC
GGCATCCTGA CCCACACCGC CGAGGGCTAC CTGCTGCGCC TCCCCGCCCA GGCGCTGGAC
TGGGAGCGCT TCCGGAGCCT GCGCTCACGC GCCCGCGCCC TGCACCAGGC GGGCCGCTCC
GCCGAGGCGC GCGCGGCCCT GGAGGAGGCG CTGGCGCTGT GGAAGGGGCC CGCGCTGCCC
GGTGTGGGCG GACGCTGGGT CGAACAGCTG CGCGTGTCCA TGGAACGCTC CCACCAGGAC
GCGCTGGCGG TGTGGGCCTC GCTGGCCATC GCCGAGGGCG ACCACGCGGA GGCGGTGGAG
GTCCTGGGTT CGGCCCTGGT CGCCTACCCC GTCAACGAGT CCCTGCACGG CCACCTCCTG
CACGGGCTGC ACGCCCTGGG CCGGACGGCC GACGCCCTCG CGGCCTACCA GCGGTTGCGG
GAGCGGCTGT CGGAGGAGCT GGGCGTGGAC CCCTCCCCGG CGCTGCGGCG GCGGTTCGAA
CTGCTGCTGC GGGCCACCGC CGGTCCGGTC GGGGACGGTG GGTCGGACGG GGACGGCGGG
TCGGACGGGG TCGGCGGGTC GCAGGCCCCG CCTCCCGCGG CCGGGAGGCC GCGCACGATC
GTGGACAACC TGGCGCGCGA GCCCCGGCAC TTCCGGGGGC GCACCGCGCA GACCCGCCTG
CTCTGCTCGC GGCTGCGGGA GACGGAGTCC GGTTTCGCGC ACCTGTGGGT CGTCTGCGGG
ATGGCGGGGG TGGGCAAGAG CCAACTGAGC CTGCACGTGG CCCATCGCGT CAAGGACCTC
TACCCCGACG CGCGGCTGTC GGTGGACCTG CGCGGGCACG ACGAGCGGGC CGCTCCGCTG
AGCGCGGAGG AGGCCCTCAC CGAACTCATG CGCCTGCTGC GCGTGCCGGT GCCGCCCACC
GTGCTGAGCC TGGCCGAACG CGTCGCTCTG TGGCGCGAGC ACACGCGGTC GATGCGCCTG
CTGCTCCTCC TGGAGGACGC CGCCAGCGCC GAACAGGTCC TTCCGCTGCT CCCCAGCGGG
ACCCGGTGCG CGGTCGTGAT CACCAGCCGC TTCTTCCTCC CGGAGGTGGA GGGCGCCGAC
CACCTGCCGC TCGGGCTTCC CTCCGACGAG GAGTGCACCG AGATGTTCAC CGCCGCCCTC
GACCGGCCGT GGGGCGAGGA GGAGGCCGAC ACCCTCGCCG AGATCATCGA CCGGTGCGAC
AGGCTGCCGA TAGCCGTCGG GCTCGTGGCG AACCTGGCCC AGTTCCATCC GTCGTGGTCG
CCGCGCGACC TGCTGCACCG GCTGCCCTCA CCGTCCTTCC CGGGACTGAG CGCCTTCCGG
GTGGGGGGTC GGGACCTCTC ACGCATCTTC GACGTGTCCG TGGACGCCAT GGACCCGAGG
GCCCGTGACG CCTTCCTCCT GCTCGGCCTG CACCCCACCC GCGGTATGGA CGGGCGGGTG
GCGGCGGCGC TGGTCGGCCC CGACGAGCAC GCCTCACGGC AGGCCCTCAT GGACCTGGTG
AGCGCCCACC TCCTGAACGA GCCCGAACCC GATCGCTTCG AGATGCACGC CCTGCTCCAG
GTTCACGCCC GCCACCGCGC GCGCGACGGT CTGTCCCCGG AGCGGGCCCA TGCGGCCAAG
CAGAGGATGC ACGCGGCCTA CCTGTCGTTG ACGAGCCTGG CCGACCGGGC CCTGCACCCG
CACCGGCCCG GGCGCGACGA CCACGAGCCA CTTCCCATCG GGGAGCGATG GCCGGAGGAG
AGGGCCATGG CGTGGTTCCG GCGCGAACTG GCGACCGTGC TCACCATGCT GTCGGAGGCG
GACCGGCACG GCGCGGGCCG CACCGCCCTG CGACTGGCGC GGGTGGTGTG CGACCACATG
GACGCGCACG GTCCCTGGTC CGAGGCGGTC ACCCTGCACA CCGCCATGGT GGAGTGGGCA
CGCGATCAGG GGGAGGACCG TGAGCGGGCC AGGGCCCACT TCGACCTGGC CCGCGCCCTT
CTGCGCGTGG GAGAGGTGGA CCGCGCCGGA GAACACAGCC GTTCGGCCCA TGAGCGGTGG
TCCGAGGCGG GGGACGTCCT GGGACAGGCG TGGGCCGTGG CCCAAGGGGC GATGATCTCC
TACGTCGCCA ACGACTACGC GGGCAGCCGG GCCCTCGTCG AACGGGCTCT GGAGACGTTC
GAGGCGCGCG GACACCGCCC CGGAATAGTG TTCTGCCTGC GTGTGCGCGG GCTGTGCCAC
TTCGCCGTGA GCGCGTCCCA CGAGGCGATC GCGGACTTCA GTGACGCGTC GACCCTGCTG
GAACGCTCAC AGGACCAGCA GTTGCTCATG GAGATGCAAC TGAACCTGGC GGGCGCGTTC
CAACAACTGG GGTACCACCA CCAGTCCTGG TCGCTGTGCG AAAAGGTCCT GGTGACCGCC
CGTCGGCGCG GTGACAGGCG CAGGGCGGCC GTCGCTCTGA CGAATCTGGG CAAGCTGTCC
CTGCACCGGG AGCGGCCGGA GCAGGCCGTA TCGCGTCTTC AGGAATCGTT GAAGATCCTC
GAATCCTTCG GAGACCCCTG GGCCAAGAGC ACGATCCTGA CGAACCTGGG CACAGCACAC
GCCGCAGCCA ACCAGCCGGA CCAGGCCCGC CTCTGCTTCT GTCGCGCTCT GGCAATGCGC
GGGTTCATCA CTCCTCCGGC GCGGACCGAG TCTCTACTGG GCCTGGCCCG ACTGGAGGGG
ACGGCCGGAC GGCACCATGA CGTGGTAACC CACCTGCGCC AGGCCGCCTC GACGGCGCGG
CAGCACGGTC TGCGCAAGGA GCACGCCACA GCCCTGCTCG CTCTGGGCCA ACACCTCAGT
GTCCACGGTG AGCGGAGCGA AGCGAATGCG TGCCTGCGCC AGGCCGCGGA AATCTTCGAG
GTACTTCAGG CTCCCGAGGC TCAGATCACC CGGTCGCTGC TCGAAACTCT GGGCAACACA
TAA
 
Protein sequence
MRLCVLGSIH LSSDGRRVTP TTGKERALLA ALALTGPGRL PLRELADRLW NGSPPEGYRS 
TLHTYVTRLR GKIEAAGLDR GILTHTAEGY LLRLPAQALD WERFRSLRSR ARALHQAGRS
AEARAALEEA LALWKGPALP GVGGRWVEQL RVSMERSHQD ALAVWASLAI AEGDHAEAVE
VLGSALVAYP VNESLHGHLL HGLHALGRTA DALAAYQRLR ERLSEELGVD PSPALRRRFE
LLLRATAGPV GDGGSDGDGG SDGVGGSQAP PPAAGRPRTI VDNLAREPRH FRGRTAQTRL
LCSRLRETES GFAHLWVVCG MAGVGKSQLS LHVAHRVKDL YPDARLSVDL RGHDERAAPL
SAEEALTELM RLLRVPVPPT VLSLAERVAL WREHTRSMRL LLLLEDAASA EQVLPLLPSG
TRCAVVITSR FFLPEVEGAD HLPLGLPSDE ECTEMFTAAL DRPWGEEEAD TLAEIIDRCD
RLPIAVGLVA NLAQFHPSWS PRDLLHRLPS PSFPGLSAFR VGGRDLSRIF DVSVDAMDPR
ARDAFLLLGL HPTRGMDGRV AAALVGPDEH ASRQALMDLV SAHLLNEPEP DRFEMHALLQ
VHARHRARDG LSPERAHAAK QRMHAAYLSL TSLADRALHP HRPGRDDHEP LPIGERWPEE
RAMAWFRREL ATVLTMLSEA DRHGAGRTAL RLARVVCDHM DAHGPWSEAV TLHTAMVEWA
RDQGEDRERA RAHFDLARAL LRVGEVDRAG EHSRSAHERW SEAGDVLGQA WAVAQGAMIS
YVANDYAGSR ALVERALETF EARGHRPGIV FCLRVRGLCH FAVSASHEAI ADFSDASTLL
ERSQDQQLLM EMQLNLAGAF QQLGYHHQSW SLCEKVLVTA RRRGDRRRAA VALTNLGKLS
LHRERPEQAV SRLQESLKIL ESFGDPWAKS TILTNLGTAH AAANQPDQAR LCFCRALAMR
GFITPPARTE SLLGLARLEG TAGRHHDVVT HLRQAASTAR QHGLRKEHAT ALLALGQHLS
VHGERSEANA CLRQAAEIFE VLQAPEAQIT RSLLETLGNT