Gene Ndas_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3889 
Symbol 
ID9247760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4661866 
End bp4662948 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative sigma E regulatory protein, MucB/RseB 
Protein accessionYP_003681792 
Protein GI297562818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGG AGGAACCCTC CTCCGCGCGC TCCGCCCCGG TCACGATCCT CGTCGTGGCA 
CTGCTGTGCG CGCTGCTGCT CACGGCCGCC GCGCACCCCC ATCCCACCGC CGTCCCGGCC
GAGGGGGAGG ACGACGGGAT GTCGGTGCTG CACCGCGCCG CCGCAGCGGA GGACGAGGTC
GCCTACTCGG CCGTGCGCGA GGTCACCGGA CCGGAGGACA CGGAGGGCGG GCAGTCCGCC
GAGGGCCGGA CGCTGCGCGT GCGCGTGGTG AACAGGCCCG GCGAAGGCAT CGCCCTGGCG
CCCGTCGGAG ACGAGGAGTC CGCCCTGGTG GTGGACGCGT CCTCGGCCCT GGAGTCGTTG
GACGACCGCC TGCTCAGCAT GCTCGGCGAC ATCTACGCCG TCTCGGACGC GGGCCCCGCC
CGCCTGGACG GCCGGGAGGC CCGCCTGGTG GAGGCCAGGC ACGCCGACGG CACCGTCGCC
GGGCGGTTCT GGGTGGACAC CGCCACCGGC CTGCTCCTGG GCCGGACCGT CTACGGCACC
GGCGGCGAGC ACGCGATCGG CTTCCGCCTC ACGGGGCTCG AACTGGGGGA GGAGGACTGG
CCGGAGGAGG CGCTCGGAGA CTCCCCCTGG AGCGACACCC TCACCCGCAC CGAGCGCGCG
GACCTGCGCG CGGAGGAGTG GCCCCTCCCG GAGTACCTGG CCTGGAACCT GCGGCTGGTC
GACGCCCGGT CCACCGAGCA CGGCGGGCAC CGCGTGGTGC ACGCCGTCTA CTCGGACGGT
CTGTCCCAGG TGTCGGTTTT CACCCAACGT GGGAAGCTGG GCAGCAAGCA TTCCCCCACA
GAACCGAACG GATACGCCGG AACCGGGACG GGGGGAAGCG GCGTCACACC ACAACACGGC
ACGATCTTCG GCGGTGACGC GGGCCAGTAC CAGAGCATGT GGCAGGCGAA CGGCTTCGTC
TACACGGTGC TCGCGGACGC CCCCGCGGGG CTGGCCTCGT CCGCCGTGTC CGCGCTGCCC
GGGCCGGGTT CGGGTTTCTG GGCCCGCGTG CACCGCGGTC TGTCCCGGCT GGGGTTCCTC
TAG
 
Protein sequence
MSAEEPSSAR SAPVTILVVA LLCALLLTAA AHPHPTAVPA EGEDDGMSVL HRAAAAEDEV 
AYSAVREVTG PEDTEGGQSA EGRTLRVRVV NRPGEGIALA PVGDEESALV VDASSALESL
DDRLLSMLGD IYAVSDAGPA RLDGREARLV EARHADGTVA GRFWVDTATG LLLGRTVYGT
GGEHAIGFRL TGLELGEEDW PEEALGDSPW SDTLTRTERA DLRAEEWPLP EYLAWNLRLV
DARSTEHGGH RVVHAVYSDG LSQVSVFTQR GKLGSKHSPT EPNGYAGTGT GGSGVTPQHG
TIFGGDAGQY QSMWQANGFV YTVLADAPAG LASSAVSALP GPGSGFWARV HRGLSRLGFL