Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1806 |
Symbol | |
ID | 9245656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2208817 |
End bp | 2211999 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003679740 |
Protein GI | 297560766 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.287202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.440265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACTCT GTGTTCTGGG CTCCATCCAT CTGAGCAGCG ACGGACGCCG CGTCACCCCG ACCACGGGCA AGGAACGCGC GCTCCTCGCC GCCCTGGCAC TGACGGGCCC GGGACGGCTA CCCCTTCGGG AGCTGGCCGA CCGGCTCTGG AACGGCTCTC CCCCGGAGGG CTACCGCAGC ACACTGCACA CCTACGTCAC GCGCCTGCGG GGCAAGATCG AGGCGGCCGG GCTGGACCGC GGCATCCTGA CCCACACCGC CGAGGGCTAC CTGCTGCGCC TCCCCGCCCA GGCGCTGGAC TGGGAGCGCT TCCGGAGCCT GCGCTCACGC GCCCGCGCCC TGCACCAGGC GGGCCGCTCC GCCGAGGCGC GCGCGGCCCT GGAGGAGGCG CTGGCGCTGT GGAAGGGGCC CGCGCTGCCC GGTGTGGGCG GACGCTGGGT CGAACAGCTG CGCGTGTCCA TGGAACGCTC CCACCAGGAC GCGCTGGCGG TGTGGGCCTC GCTGGCCATC GCCGAGGGCG ACCACGCGGA GGCGGTGGAG GTCCTGGGTT CGGCCCTGGT CGCCTACCCC GTCAACGAGT CCCTGCACGG CCACCTCCTG CACGGGCTGC ACGCCCTGGG CCGGACGGCC GACGCCCTCG CGGCCTACCA GCGGTTGCGG GAGCGGCTGT CGGAGGAGCT GGGCGTGGAC CCCTCCCCGG CGCTGCGGCG GCGGTTCGAA CTGCTGCTGC GGGCCACCGC CGGTCCGGTC GGGGACGGTG GGTCGGACGG GGACGGCGGG TCGGACGGGG TCGGCGGGTC GCAGGCCCCG CCTCCCGCGG CCGGGAGGCC GCGCACGATC GTGGACAACC TGGCGCGCGA GCCCCGGCAC TTCCGGGGGC GCACCGCGCA GACCCGCCTG CTCTGCTCGC GGCTGCGGGA GACGGAGTCC GGTTTCGCGC ACCTGTGGGT CGTCTGCGGG ATGGCGGGGG TGGGCAAGAG CCAACTGAGC CTGCACGTGG CCCATCGCGT CAAGGACCTC TACCCCGACG CGCGGCTGTC GGTGGACCTG CGCGGGCACG ACGAGCGGGC CGCTCCGCTG AGCGCGGAGG AGGCCCTCAC CGAACTCATG CGCCTGCTGC GCGTGCCGGT GCCGCCCACC GTGCTGAGCC TGGCCGAACG CGTCGCTCTG TGGCGCGAGC ACACGCGGTC GATGCGCCTG CTGCTCCTCC TGGAGGACGC CGCCAGCGCC GAACAGGTCC TTCCGCTGCT CCCCAGCGGG ACCCGGTGCG CGGTCGTGAT CACCAGCCGC TTCTTCCTCC CGGAGGTGGA GGGCGCCGAC CACCTGCCGC TCGGGCTTCC CTCCGACGAG GAGTGCACCG AGATGTTCAC CGCCGCCCTC GACCGGCCGT GGGGCGAGGA GGAGGCCGAC ACCCTCGCCG AGATCATCGA CCGGTGCGAC AGGCTGCCGA TAGCCGTCGG GCTCGTGGCG AACCTGGCCC AGTTCCATCC GTCGTGGTCG CCGCGCGACC TGCTGCACCG GCTGCCCTCA CCGTCCTTCC CGGGACTGAG CGCCTTCCGG GTGGGGGGTC GGGACCTCTC ACGCATCTTC GACGTGTCCG TGGACGCCAT GGACCCGAGG GCCCGTGACG CCTTCCTCCT GCTCGGCCTG CACCCCACCC GCGGTATGGA CGGGCGGGTG GCGGCGGCGC TGGTCGGCCC CGACGAGCAC GCCTCACGGC AGGCCCTCAT GGACCTGGTG AGCGCCCACC TCCTGAACGA GCCCGAACCC GATCGCTTCG AGATGCACGC CCTGCTCCAG GTTCACGCCC GCCACCGCGC GCGCGACGGT CTGTCCCCGG AGCGGGCCCA TGCGGCCAAG CAGAGGATGC ACGCGGCCTA CCTGTCGTTG ACGAGCCTGG CCGACCGGGC CCTGCACCCG CACCGGCCCG GGCGCGACGA CCACGAGCCA CTTCCCATCG GGGAGCGATG GCCGGAGGAG AGGGCCATGG CGTGGTTCCG GCGCGAACTG GCGACCGTGC TCACCATGCT GTCGGAGGCG GACCGGCACG GCGCGGGCCG CACCGCCCTG CGACTGGCGC GGGTGGTGTG CGACCACATG GACGCGCACG GTCCCTGGTC CGAGGCGGTC ACCCTGCACA CCGCCATGGT GGAGTGGGCA CGCGATCAGG GGGAGGACCG TGAGCGGGCC AGGGCCCACT TCGACCTGGC CCGCGCCCTT CTGCGCGTGG GAGAGGTGGA CCGCGCCGGA GAACACAGCC GTTCGGCCCA TGAGCGGTGG TCCGAGGCGG GGGACGTCCT GGGACAGGCG TGGGCCGTGG CCCAAGGGGC GATGATCTCC TACGTCGCCA ACGACTACGC GGGCAGCCGG GCCCTCGTCG AACGGGCTCT GGAGACGTTC GAGGCGCGCG GACACCGCCC CGGAATAGTG TTCTGCCTGC GTGTGCGCGG GCTGTGCCAC TTCGCCGTGA GCGCGTCCCA CGAGGCGATC GCGGACTTCA GTGACGCGTC GACCCTGCTG GAACGCTCAC AGGACCAGCA GTTGCTCATG GAGATGCAAC TGAACCTGGC GGGCGCGTTC CAACAACTGG GGTACCACCA CCAGTCCTGG TCGCTGTGCG AAAAGGTCCT GGTGACCGCC CGTCGGCGCG GTGACAGGCG CAGGGCGGCC GTCGCTCTGA CGAATCTGGG CAAGCTGTCC CTGCACCGGG AGCGGCCGGA GCAGGCCGTA TCGCGTCTTC AGGAATCGTT GAAGATCCTC GAATCCTTCG GAGACCCCTG GGCCAAGAGC ACGATCCTGA CGAACCTGGG CACAGCACAC GCCGCAGCCA ACCAGCCGGA CCAGGCCCGC CTCTGCTTCT GTCGCGCTCT GGCAATGCGC GGGTTCATCA CTCCTCCGGC GCGGACCGAG TCTCTACTGG GCCTGGCCCG ACTGGAGGGG ACGGCCGGAC GGCACCATGA CGTGGTAACC CACCTGCGCC AGGCCGCCTC GACGGCGCGG CAGCACGGTC TGCGCAAGGA GCACGCCACA GCCCTGCTCG CTCTGGGCCA ACACCTCAGT GTCCACGGTG AGCGGAGCGA AGCGAATGCG TGCCTGCGCC AGGCCGCGGA AATCTTCGAG GTACTTCAGG CTCCCGAGGC TCAGATCACC CGGTCGCTGC TCGAAACTCT GGGCAACACA TAA
|
Protein sequence | MRLCVLGSIH LSSDGRRVTP TTGKERALLA ALALTGPGRL PLRELADRLW NGSPPEGYRS TLHTYVTRLR GKIEAAGLDR GILTHTAEGY LLRLPAQALD WERFRSLRSR ARALHQAGRS AEARAALEEA LALWKGPALP GVGGRWVEQL RVSMERSHQD ALAVWASLAI AEGDHAEAVE VLGSALVAYP VNESLHGHLL HGLHALGRTA DALAAYQRLR ERLSEELGVD PSPALRRRFE LLLRATAGPV GDGGSDGDGG SDGVGGSQAP PPAAGRPRTI VDNLAREPRH FRGRTAQTRL LCSRLRETES GFAHLWVVCG MAGVGKSQLS LHVAHRVKDL YPDARLSVDL RGHDERAAPL SAEEALTELM RLLRVPVPPT VLSLAERVAL WREHTRSMRL LLLLEDAASA EQVLPLLPSG TRCAVVITSR FFLPEVEGAD HLPLGLPSDE ECTEMFTAAL DRPWGEEEAD TLAEIIDRCD RLPIAVGLVA NLAQFHPSWS PRDLLHRLPS PSFPGLSAFR VGGRDLSRIF DVSVDAMDPR ARDAFLLLGL HPTRGMDGRV AAALVGPDEH ASRQALMDLV SAHLLNEPEP DRFEMHALLQ VHARHRARDG LSPERAHAAK QRMHAAYLSL TSLADRALHP HRPGRDDHEP LPIGERWPEE RAMAWFRREL ATVLTMLSEA DRHGAGRTAL RLARVVCDHM DAHGPWSEAV TLHTAMVEWA RDQGEDRERA RAHFDLARAL LRVGEVDRAG EHSRSAHERW SEAGDVLGQA WAVAQGAMIS YVANDYAGSR ALVERALETF EARGHRPGIV FCLRVRGLCH FAVSASHEAI ADFSDASTLL ERSQDQQLLM EMQLNLAGAF QQLGYHHQSW SLCEKVLVTA RRRGDRRRAA VALTNLGKLS LHRERPEQAV SRLQESLKIL ESFGDPWAKS TILTNLGTAH AAANQPDQAR LCFCRALAMR GFITPPARTE SLLGLARLEG TAGRHHDVVT HLRQAASTAR QHGLRKEHAT ALLALGQHLS VHGERSEANA CLRQAAEIFE VLQAPEAQIT RSLLETLGNT
|
| |