Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5300 |
Symbol | |
ID | 9249199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 462716 |
End bp | 465769 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003683186 |
Protein GI | 297564213 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.706654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTAT TCCCTATTTC CGGCCAATGG CCCTTGCTAG GGTTGCTGAT TGTGGAATTC GGCGTGCTTG GCCCCATTAC CGTCTGGTCC GACGGCCGAC CTGTTCCGGT CGGCGGTCCG CGCCAGCGCT GTGTGCTGGG CGCGCTGCTG GTGCACCTGG GGCGCGAAGT CACCATCGAC CAACTCATCG GTTACCTCTG GAGCGACGAT CCGCCCCGTA CCGCCCGGTC GGTGATCCAG GTCCAGATAT CCCACCTGCG CCGCAGCCTC CCCGGTACCA TCGCCACCAC CCCCGGCGGG TACACCCTCG ACGTCGACGC TGACTCCGTC GACCTGCACC GCTTCCGCAG GCTCCGGGAC CGGGCCGCCG CGGCCGAGCC CAAGACCGCG GTCGACATGC TGGAACAGGC GCTGGAGTGC TGGCGCGGAG TCCCCTTCTC CGGTGTCGGC TCCGAGTACC TGGACTACAC CGTCGTCGCC CCCCTCCGGG AGGAGCGCTG GTCCTGCGTC GTGGCCTGGG CCACCCACGC GCTGGAACTG GGCAGGCACG CCGACGTGGT CTCCCGGCTG ACGTCCCTGG TCAGCGAGGA GCCCTTCAGG GAGCGGCTGC ACCACCTGCT CATCACGGCC CTGTGGCGCG ACAACGAACG GGCCAGGGCG CTCTCCGTCT ACGAGGAGTT CCGGGCCAGG CTGGCCGACG AGCTGGGTGT CGACCCCGGT CCGGAACTGG TCGCCCTGCA CACCCGGATC CTCCAGGAGG ACTTCTCGGA GGAGGGGCCG CAGGACCTGT CGTCCGGGGA GCCGGGAACG CGTTTCGTGG TCCGCAACGA CCTCCCGCGG GACCTGCCGG ACTTCACCGG ACGCCAGGAG TCCCTGCGGC GGTTGGACGA GGTGGCCCGT ACCGGAGACG ACCGCGCCCA GGTCTGCGTC ATCACGGGCA GCGGCGGCGA GGGCAAGACG ACCACAGCGG TCCGCTTCGG CTATGAGGCG GCCGGGCGCT ACCCCGACGG ACAGCTGTTC ATCGACCTGT ACGGGTACAC GACCGACAGG GAGCCTCTCG ACGCCATGTC CGCGCTGGGC GCCCTGCTGC GCGCGGTCGG CGTCGAGCCC GAGGCCGTGC CCGAGTCCCT CGAAGAGCGC GCGGCGCTGT GGCGGGCCAC CCTCATGGGG CGCAGGGTCC TGGTCATCCT CGACAACGCG TTCAGCTATG CCCAGGTCAG CCCGCTCCTC TCCTCCTCGC CGGGGTCGAT GACCCTCATC ACGACCCGCA ACGAACTCTC CGGGCTCAGC GGCGCCCGCT TCCTCTCCCT GGGGGTGTTC GACGAGAGCT CCTCCCTGGA GCTGCTCGGA CGCGTCCTGG GAGAGGACCG CGTACAGCGC GAACCGGACC AGGCCCGGGA GATCATCCGG ATCTGCGGTG GCCTCCCCCT CGCCCTGCGC GTGGTGGCGG GACGGATGCT CAGCCGTCCC AGGTGGTCGT TCGCGCACGT CGCCCGCCGA CTCGGTGAGC AGAACCGGAA GTTCCGCGAA CTCCAGGTCG AGGGGCAGAG CGTCGAGGCC GCCATCGACC TGTCCTTCCA GAGCCTCAAC CGGGACCAGA GCAGGACCTT CCTCCTGCTG GGTCTGATGA TCGGCAGCAC GATCGACCTC GGCGGCGCGG CCGCCCTCCT GGACATGACG GTGGAGGACG CGGACGACAT ACTCCAGGAG CTGGTCGGGG TGTGCCTGCT GGAGGAGCCC CAGGGGGACG TGTACCGCCT GCACGACCTC ATCGGGGCCT TCTCCCGGGA TCGTGCGGCC ATGCTGCTGG ACGCCGGGGA GATCGAGGCC GCGAAGCTCC GTCTGGCGGA GCAGTACATG GCCACGGCAC AGCACGCCGC CGACCTCCTG GGGCCGCGCG CGCACGACGA CGAGATCGAC GTGAGCCGGG GTTACCGCAC CGAACTGTCG GGGAGGGAGG ACGCCGAGAA CTGGTTCACC CTGCACCAGG AGAACCTCGC GGAGACGATC GAGTACTTCG CCTCGCACGG CAACGGCGAG TACGCCTGGC GTATGGCGGA CGCGGTGTGG CGTTTCTACG CCCTCCACGG CCAGATGGGC CTGCTGATCA GTTCCCACCA GCGGGCACTC CAGATCAGCG ACAAGCAGGG GAACCGGCGC GGGCGCGCGG TGACCCTCAT CGGGCTGGGC ATCGCCCACT GCCTCTCGGG GCGCTTCGAC GAGTCGCTCG CCTTCCTCAC CGAGGCCCGG GAACTGCTGA CCGCGATCCA CGACAGCAGG GGGATCATCC GGGCCCTGGC CAACCTGGGG ATGGTCTACG AGCGCGTCGG CCGTCTCGCT GACGCGGCGG AGTCCATCCA GGGTGTGCTG GACTACGCGG TCCAGCTGGG CGACACCCGC CTGGAGGCGT TGCAGTGGGG CAACCTCGCC GTCCTCAAAC AGACGCTCGG CGCGTACACG GAGGCTCTCC ACTGTGCCCA GCAGTCCATG GAGAAGGCCG TGGGCGAGGG CCAGAAGGTG ACCCGGTCCC ACGCCAAACG GGTCATGGGG GAGGCCCGCA CCGGGCTCGG AGAGCTGGAC GCGGCCTTCG CCGACCTGAA CGAGGCCCTG GAGCTGTCAC ACGAGCTGCG CCTGGTGGGC AACCAGGTCT ACATCCACAA CTCCCTGGGG CTGGCCCACC GGGCCGCGGA GCAGTGGGAG CGGGCGATCG AGTCCCACAC CACGGCACTG GACCTGGCCG AGCAGCACGG GCGCCGCAGT GGTGACGCCG AGATCCGCGT CGACCTGGGG ATGACCTACG CGGCCGCCGG ACGCCACCGC GAGGCGCTGT CCGAGCTGGA GGGGGCCCAC GCCATCGCGG TGGAGCGCGG CGAGCGCCAC ATGGTCGCCC GCGCCGCGCT CGCCCTCGGA CGCCTGCCCG CACCGGTCAT GGCCGCGGAC CGGGCCCGGG GGTTCCTCGG CGAGGCCGAG GAGATCTTCA CCGAGTTGGG GCTGGCCGAG GCGGAACAGG CCAGGAAGGC CCTGAAGGAC CACCCGCCCG CGTCCCTCGG CTGA
|
Protein sequence | MSVFPISGQW PLLGLLIVEF GVLGPITVWS DGRPVPVGGP RQRCVLGALL VHLGREVTID QLIGYLWSDD PPRTARSVIQ VQISHLRRSL PGTIATTPGG YTLDVDADSV DLHRFRRLRD RAAAAEPKTA VDMLEQALEC WRGVPFSGVG SEYLDYTVVA PLREERWSCV VAWATHALEL GRHADVVSRL TSLVSEEPFR ERLHHLLITA LWRDNERARA LSVYEEFRAR LADELGVDPG PELVALHTRI LQEDFSEEGP QDLSSGEPGT RFVVRNDLPR DLPDFTGRQE SLRRLDEVAR TGDDRAQVCV ITGSGGEGKT TTAVRFGYEA AGRYPDGQLF IDLYGYTTDR EPLDAMSALG ALLRAVGVEP EAVPESLEER AALWRATLMG RRVLVILDNA FSYAQVSPLL SSSPGSMTLI TTRNELSGLS GARFLSLGVF DESSSLELLG RVLGEDRVQR EPDQAREIIR ICGGLPLALR VVAGRMLSRP RWSFAHVARR LGEQNRKFRE LQVEGQSVEA AIDLSFQSLN RDQSRTFLLL GLMIGSTIDL GGAAALLDMT VEDADDILQE LVGVCLLEEP QGDVYRLHDL IGAFSRDRAA MLLDAGEIEA AKLRLAEQYM ATAQHAADLL GPRAHDDEID VSRGYRTELS GREDAENWFT LHQENLAETI EYFASHGNGE YAWRMADAVW RFYALHGQMG LLISSHQRAL QISDKQGNRR GRAVTLIGLG IAHCLSGRFD ESLAFLTEAR ELLTAIHDSR GIIRALANLG MVYERVGRLA DAAESIQGVL DYAVQLGDTR LEALQWGNLA VLKQTLGAYT EALHCAQQSM EKAVGEGQKV TRSHAKRVMG EARTGLGELD AAFADLNEAL ELSHELRLVG NQVYIHNSLG LAHRAAEQWE RAIESHTTAL DLAEQHGRRS GDAEIRVDLG MTYAAAGRHR EALSELEGAH AIAVERGERH MVARAALALG RLPAPVMAAD RARGFLGEAE EIFTELGLAE AEQARKALKD HPPASLG
|
| |