Gene Ndas_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5201 
Symbol 
ID9249094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp346615 
End bp348801 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_003683087 
Protein GI297564114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.162808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAGCAC ACGACGCGTC GAAGGTCGCT CGACGAGAGT TCCCTCCCGC CCCGGAGACC 
GCGGCGGCGG CCCGCGAGTT CGTTCACGAC ACCCTCCTGT CCTGGGGTGT GACCGACCCC
TCCGACGACG TGATCCTGCT GGTGAGCGAG CTGGTCACCA ACGCCGTCAT CCACGCGCGC
TCGTCCCTGG ACGTGACCGT GCGCCGGTCG GAGGGCGCGA CCGAGGTGAT GGTCACCGAC
TCCGTGCCCG AACGCGCGGT CCCGCAGGCC GGTCCCCTGT CGGTGGACAC CTCCCCGTCC
CGGGAGAACG GGCGCACCGG CGGTCTGGGG CTGGCCCTGG CGTCGGCGAT CGCGTCGAGC
TGGGGCGTGA GCTACGGGCG CAACGACAAG GCCGTGTGGT TCCGGATCGA GGACGACTCG
GGGGAGCGCG CCTCCCTGGA GTCCCCGCCG GTGCGCAGGA GCCCCTCCCG TACCGGGCGC
CCCGCCCCGT GGTCGGCGCT GGACTCCGCC CTGGGCTCGC GGCTGTCCCT GCCGCAGCTG
CTGGAGCGCA CCGTGGAGCA CTCGGCCACG GCGCTGGGCG CGGACGCGGC CTACATCACG
CTGGCCACGT CCGACGAGAC CATGTGGGAG GTGCGCGCCG CGGTCGGGCT GACCCCGGGC
GGCGCCCCGT GGCGGCCGCT GCGCGTGCGC ACGGAGGAGA TCTTCCCGTC GTCGGCGCCG
GAGCCGGGCG CGGTGATCAA CGACGACCTG ATGATCGCCC GCGCCAGCCG GGGGCGCCTG
TCGCGGGCGG GGATGCGTTC GCTGGTCACG GCGCCGCTCA TCGTGGACGG GCGGGTGGTC
GGCCTGCTGG GGGTGGCCTC CCGGCGGGCC CGGCACTTCG GGGCGTCGGC GGCCAAGCGG
CTCCAGGAGG GCGCGAACCT GATCGCGCTG CCGGTGGAGA GGGCCCGGCT GGCCGAGGTG
GAGCTGAGCA GGCGGGCCTC GCTGAGCTTC CTGGCCGAGG CCAGCGACCT GCTCGCGGGC
ACGCTGGACG AGCGGATGAC GGGCGCGCTG GCGGCGCAGC TGATCACCTC GCGGCTGGGC
CGGTGGTGCG CCATCGAGAC GATCAACGAG ATGGGCGTCT CGCAGCTGAC GCACGTGGTG
CACAGCGACG AGAACTACAA CGACATCCTG CGCGCGCTGC TGACCGGGCT GCCGCCGCAC
GAGCAGAGGG AGCCGCAGCC GCTGTGGTCC CCGGCCGAGC TGGCCAAGGA GGGGCTGGAC
GAACAGCTCG TCCGGGAGCT GGCGAGCGGT CCGGCGATCT GCGTCCCGCT GGTGGCGCAC
GGGCGCGCGC TGGGGCGGAT GACGATCGGC AAGAACGAGT CGGACGACTT CACCCGCGAG
GAGGTGGACG TCGCCGACGA CCTGAGCAGC CGGGTGGCGT CGGCGATGGA GAACGCGCGG
CTGCACGAGA AGCAGGCCGC GATGAGCGAC GCCCTCCAGC GCAGCCTGCT GCCGGCCAAG
GAGAAGGAGC CGGTGATCCC GAACGTGGAC CACGCGGTGT TCTACCGTCC GGCCGACGAG
CGCAACGTGG TGGGCGGGGA CTTCTACGAC GTGTTCGCGG CGAGCGGCCG GTGGTGCTTC
GCCATCGGCG ACGTGTGCGG GACGGGCCCG GAGGCGGCGG CCGTGACCGG TCTGGCGCGG
CACACGCTGC GGGCGCTGGC CAAGGAGGGG TTCACGCCCT CGCACATCAT GCAGCGGCTG
AACATGGCGA TCCTGGACGA GAACACCTCG ACCCGGTTCC TGACGATGCT GTACGGGGAG
ATGACCCCGG CCACGGACGG CGGGGGCGGG ATGCGGCTGC GGATGGTCTC GGCCGGGCAC
CCGCTGCCGC TGCGGCTGAA CCAGAAGGGC GAGGTACTGC CGTTCGGCGC CTCGCAGCCG
CTGCTGGGCG CGTTCGAGGA CGTGGGGTTC ACGACCGAGA ACGTGGACAT CCGCCCCGGC
GAGGTGGTGC TGGCGGTGAC CGACGGCGTG ACCGAGCGGC GCAGCAACAC GGACATGCTC
GGCGACGAGG GGCTGATGGA GATCTTCTCC GGCTGCGCGG GGCTGACCGC CCAGGCGGTG
ATCAGCCGCA TCGACCGGGA GCTGGAGGAG TACGCCCCGG GCGGGCGCAG CGACGACACC
GCGATGCTGG TGCTGCGCTT CCTCTAG
 
Protein sequence
MPAHDASKVA RREFPPAPET AAAAREFVHD TLLSWGVTDP SDDVILLVSE LVTNAVIHAR 
SSLDVTVRRS EGATEVMVTD SVPERAVPQA GPLSVDTSPS RENGRTGGLG LALASAIASS
WGVSYGRNDK AVWFRIEDDS GERASLESPP VRRSPSRTGR PAPWSALDSA LGSRLSLPQL
LERTVEHSAT ALGADAAYIT LATSDETMWE VRAAVGLTPG GAPWRPLRVR TEEIFPSSAP
EPGAVINDDL MIARASRGRL SRAGMRSLVT APLIVDGRVV GLLGVASRRA RHFGASAAKR
LQEGANLIAL PVERARLAEV ELSRRASLSF LAEASDLLAG TLDERMTGAL AAQLITSRLG
RWCAIETINE MGVSQLTHVV HSDENYNDIL RALLTGLPPH EQREPQPLWS PAELAKEGLD
EQLVRELASG PAICVPLVAH GRALGRMTIG KNESDDFTRE EVDVADDLSS RVASAMENAR
LHEKQAAMSD ALQRSLLPAK EKEPVIPNVD HAVFYRPADE RNVVGGDFYD VFAASGRWCF
AIGDVCGTGP EAAAVTGLAR HTLRALAKEG FTPSHIMQRL NMAILDENTS TRFLTMLYGE
MTPATDGGGG MRLRMVSAGH PLPLRLNQKG EVLPFGASQP LLGAFEDVGF TTENVDIRPG
EVVLAVTDGV TERRSNTDML GDEGLMEIFS GCAGLTAQAV ISRIDRELEE YAPGGRSDDT
AMLVLRFL