Gene Ndas_5300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5300 
Symbol 
ID9249199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp462716 
End bp465769 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content70% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003683186 
Protein GI297564213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.706654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTAT TCCCTATTTC CGGCCAATGG CCCTTGCTAG GGTTGCTGAT TGTGGAATTC 
GGCGTGCTTG GCCCCATTAC CGTCTGGTCC GACGGCCGAC CTGTTCCGGT CGGCGGTCCG
CGCCAGCGCT GTGTGCTGGG CGCGCTGCTG GTGCACCTGG GGCGCGAAGT CACCATCGAC
CAACTCATCG GTTACCTCTG GAGCGACGAT CCGCCCCGTA CCGCCCGGTC GGTGATCCAG
GTCCAGATAT CCCACCTGCG CCGCAGCCTC CCCGGTACCA TCGCCACCAC CCCCGGCGGG
TACACCCTCG ACGTCGACGC TGACTCCGTC GACCTGCACC GCTTCCGCAG GCTCCGGGAC
CGGGCCGCCG CGGCCGAGCC CAAGACCGCG GTCGACATGC TGGAACAGGC GCTGGAGTGC
TGGCGCGGAG TCCCCTTCTC CGGTGTCGGC TCCGAGTACC TGGACTACAC CGTCGTCGCC
CCCCTCCGGG AGGAGCGCTG GTCCTGCGTC GTGGCCTGGG CCACCCACGC GCTGGAACTG
GGCAGGCACG CCGACGTGGT CTCCCGGCTG ACGTCCCTGG TCAGCGAGGA GCCCTTCAGG
GAGCGGCTGC ACCACCTGCT CATCACGGCC CTGTGGCGCG ACAACGAACG GGCCAGGGCG
CTCTCCGTCT ACGAGGAGTT CCGGGCCAGG CTGGCCGACG AGCTGGGTGT CGACCCCGGT
CCGGAACTGG TCGCCCTGCA CACCCGGATC CTCCAGGAGG ACTTCTCGGA GGAGGGGCCG
CAGGACCTGT CGTCCGGGGA GCCGGGAACG CGTTTCGTGG TCCGCAACGA CCTCCCGCGG
GACCTGCCGG ACTTCACCGG ACGCCAGGAG TCCCTGCGGC GGTTGGACGA GGTGGCCCGT
ACCGGAGACG ACCGCGCCCA GGTCTGCGTC ATCACGGGCA GCGGCGGCGA GGGCAAGACG
ACCACAGCGG TCCGCTTCGG CTATGAGGCG GCCGGGCGCT ACCCCGACGG ACAGCTGTTC
ATCGACCTGT ACGGGTACAC GACCGACAGG GAGCCTCTCG ACGCCATGTC CGCGCTGGGC
GCCCTGCTGC GCGCGGTCGG CGTCGAGCCC GAGGCCGTGC CCGAGTCCCT CGAAGAGCGC
GCGGCGCTGT GGCGGGCCAC CCTCATGGGG CGCAGGGTCC TGGTCATCCT CGACAACGCG
TTCAGCTATG CCCAGGTCAG CCCGCTCCTC TCCTCCTCGC CGGGGTCGAT GACCCTCATC
ACGACCCGCA ACGAACTCTC CGGGCTCAGC GGCGCCCGCT TCCTCTCCCT GGGGGTGTTC
GACGAGAGCT CCTCCCTGGA GCTGCTCGGA CGCGTCCTGG GAGAGGACCG CGTACAGCGC
GAACCGGACC AGGCCCGGGA GATCATCCGG ATCTGCGGTG GCCTCCCCCT CGCCCTGCGC
GTGGTGGCGG GACGGATGCT CAGCCGTCCC AGGTGGTCGT TCGCGCACGT CGCCCGCCGA
CTCGGTGAGC AGAACCGGAA GTTCCGCGAA CTCCAGGTCG AGGGGCAGAG CGTCGAGGCC
GCCATCGACC TGTCCTTCCA GAGCCTCAAC CGGGACCAGA GCAGGACCTT CCTCCTGCTG
GGTCTGATGA TCGGCAGCAC GATCGACCTC GGCGGCGCGG CCGCCCTCCT GGACATGACG
GTGGAGGACG CGGACGACAT ACTCCAGGAG CTGGTCGGGG TGTGCCTGCT GGAGGAGCCC
CAGGGGGACG TGTACCGCCT GCACGACCTC ATCGGGGCCT TCTCCCGGGA TCGTGCGGCC
ATGCTGCTGG ACGCCGGGGA GATCGAGGCC GCGAAGCTCC GTCTGGCGGA GCAGTACATG
GCCACGGCAC AGCACGCCGC CGACCTCCTG GGGCCGCGCG CGCACGACGA CGAGATCGAC
GTGAGCCGGG GTTACCGCAC CGAACTGTCG GGGAGGGAGG ACGCCGAGAA CTGGTTCACC
CTGCACCAGG AGAACCTCGC GGAGACGATC GAGTACTTCG CCTCGCACGG CAACGGCGAG
TACGCCTGGC GTATGGCGGA CGCGGTGTGG CGTTTCTACG CCCTCCACGG CCAGATGGGC
CTGCTGATCA GTTCCCACCA GCGGGCACTC CAGATCAGCG ACAAGCAGGG GAACCGGCGC
GGGCGCGCGG TGACCCTCAT CGGGCTGGGC ATCGCCCACT GCCTCTCGGG GCGCTTCGAC
GAGTCGCTCG CCTTCCTCAC CGAGGCCCGG GAACTGCTGA CCGCGATCCA CGACAGCAGG
GGGATCATCC GGGCCCTGGC CAACCTGGGG ATGGTCTACG AGCGCGTCGG CCGTCTCGCT
GACGCGGCGG AGTCCATCCA GGGTGTGCTG GACTACGCGG TCCAGCTGGG CGACACCCGC
CTGGAGGCGT TGCAGTGGGG CAACCTCGCC GTCCTCAAAC AGACGCTCGG CGCGTACACG
GAGGCTCTCC ACTGTGCCCA GCAGTCCATG GAGAAGGCCG TGGGCGAGGG CCAGAAGGTG
ACCCGGTCCC ACGCCAAACG GGTCATGGGG GAGGCCCGCA CCGGGCTCGG AGAGCTGGAC
GCGGCCTTCG CCGACCTGAA CGAGGCCCTG GAGCTGTCAC ACGAGCTGCG CCTGGTGGGC
AACCAGGTCT ACATCCACAA CTCCCTGGGG CTGGCCCACC GGGCCGCGGA GCAGTGGGAG
CGGGCGATCG AGTCCCACAC CACGGCACTG GACCTGGCCG AGCAGCACGG GCGCCGCAGT
GGTGACGCCG AGATCCGCGT CGACCTGGGG ATGACCTACG CGGCCGCCGG ACGCCACCGC
GAGGCGCTGT CCGAGCTGGA GGGGGCCCAC GCCATCGCGG TGGAGCGCGG CGAGCGCCAC
ATGGTCGCCC GCGCCGCGCT CGCCCTCGGA CGCCTGCCCG CACCGGTCAT GGCCGCGGAC
CGGGCCCGGG GGTTCCTCGG CGAGGCCGAG GAGATCTTCA CCGAGTTGGG GCTGGCCGAG
GCGGAACAGG CCAGGAAGGC CCTGAAGGAC CACCCGCCCG CGTCCCTCGG CTGA
 
Protein sequence
MSVFPISGQW PLLGLLIVEF GVLGPITVWS DGRPVPVGGP RQRCVLGALL VHLGREVTID 
QLIGYLWSDD PPRTARSVIQ VQISHLRRSL PGTIATTPGG YTLDVDADSV DLHRFRRLRD
RAAAAEPKTA VDMLEQALEC WRGVPFSGVG SEYLDYTVVA PLREERWSCV VAWATHALEL
GRHADVVSRL TSLVSEEPFR ERLHHLLITA LWRDNERARA LSVYEEFRAR LADELGVDPG
PELVALHTRI LQEDFSEEGP QDLSSGEPGT RFVVRNDLPR DLPDFTGRQE SLRRLDEVAR
TGDDRAQVCV ITGSGGEGKT TTAVRFGYEA AGRYPDGQLF IDLYGYTTDR EPLDAMSALG
ALLRAVGVEP EAVPESLEER AALWRATLMG RRVLVILDNA FSYAQVSPLL SSSPGSMTLI
TTRNELSGLS GARFLSLGVF DESSSLELLG RVLGEDRVQR EPDQAREIIR ICGGLPLALR
VVAGRMLSRP RWSFAHVARR LGEQNRKFRE LQVEGQSVEA AIDLSFQSLN RDQSRTFLLL
GLMIGSTIDL GGAAALLDMT VEDADDILQE LVGVCLLEEP QGDVYRLHDL IGAFSRDRAA
MLLDAGEIEA AKLRLAEQYM ATAQHAADLL GPRAHDDEID VSRGYRTELS GREDAENWFT
LHQENLAETI EYFASHGNGE YAWRMADAVW RFYALHGQMG LLISSHQRAL QISDKQGNRR
GRAVTLIGLG IAHCLSGRFD ESLAFLTEAR ELLTAIHDSR GIIRALANLG MVYERVGRLA
DAAESIQGVL DYAVQLGDTR LEALQWGNLA VLKQTLGAYT EALHCAQQSM EKAVGEGQKV
TRSHAKRVMG EARTGLGELD AAFADLNEAL ELSHELRLVG NQVYIHNSLG LAHRAAEQWE
RAIESHTTAL DLAEQHGRRS GDAEIRVDLG MTYAAAGRHR EALSELEGAH AIAVERGERH
MVARAALALG RLPAPVMAAD RARGFLGEAE EIFTELGLAE AEQARKALKD HPPASLG