Gene Ndas_5463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5463 
Symbol 
ID9249366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp651604 
End bp652866 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003683348 
Protein GI297564375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.37967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCACA CCGACGCCAC GCGCGCGGTC GAGGCGGTCT GGCGTATCGA GTCGGCCCGT 
CTCGTCGCAG CGCTCACCCG CATGGTCGGC GACGTCGGCC TCGCCGAGGA GTTCGCCCAG
GACGCGCTGG TCAGCGCACT GGAGAAGTGG CCCGGCGACG GGGTTCCCGA CAACCCCGGC
GCCTGGCTGA CCACCGTCGC CAAGCGCCGC GTGCTCGACC GCTGGCGCCG CGACGAGCGC
TTCCAGCGGC GCATGGCCGA CCTCGGCCGC GAGATCGCCG AGCACGACGG GCAGGCCGAG
TTCGACGCCG TCCTGGAGGA GGACTTCGGA GACGACCTCC TGCGGCTGAT GTTCGTGTGC
TGCCACCCGG TGCTGTCCAC CGAGGCCCGC GTCGCGCTCA CCCTGCGCCT GCTCGGCGGG
CTCACCACCG ACGAGATCGC CCGGGCCTTC CTGGTGCCCG AGTCCACCGT CGCCCAGCGC
ATCGTGCGCG CCAAGCGCAC CCTGGCCAAG AGGAAAGTGC CCTTCGAGGT GCCCGTCGGC
GAGGACCGCG ACGCCCGGGT GGCCTCCGTG CTGGAGGTCG TCTACCTGGT GTTCAACGAG
GGCTACACCG CCACCTCGGG TACGGAGTGG ACGCGCCCCA CGCTGTGCGA GGAGGCCATG
CGCCTGGGCC GCGTGCTCGC CCAACTGCTG CCCGGGGAGT GCGAGGCGCA CGGGCTGGTG
GCGCTGATGG AGCTGCACGC CTCGCGGCTG CGCGCCCGCG TCGGGCCGGG CGGGGAACCC
GTCCCCCTGG CCGAGCAGAA CCGCGCGCTC TGGGACCGCC TGCTCATCAC GCGCGGGATG
GAGGCCCTGT TCAGGGCACT GCCGCCGGAC CGGAGCCAGC CCGGCGGTCC CTACGTGCTC
CAAGCGGCCA TCGCGGCCGA ACACGCCAGG GCGGTCACCG CCGCGGACAC CGACTGGACG
GTCATCGCCG GGCTGTACCT GGCCCTGGTC AGGGTCACCG GCTCGCCGGT CGTGGAGCTG
AACCGGGCGG TGGCGGTGTC GATGGCCTCG GGCCCGGAGA CCGCGCTGGA GATCGTGGAC
GCGCTGCGGG ACCAGCCGGG GATGGGCGAC TACCACCTGC TGCCCTCCGT GCGCGGCGAC
CTGCTCGTGC GGCTGGACCG CCGGGCCGAG GCCCGCGCCG AGTTCGAGCG CGCGGCCTCC
CTGACCCGCA ACGAGCGCGA GCGCTCGCTG CTCCTGGACC GCGCCCGCGG CTGCGAGGAC
TGA
 
Protein sequence
MSHTDATRAV EAVWRIESAR LVAALTRMVG DVGLAEEFAQ DALVSALEKW PGDGVPDNPG 
AWLTTVAKRR VLDRWRRDER FQRRMADLGR EIAEHDGQAE FDAVLEEDFG DDLLRLMFVC
CHPVLSTEAR VALTLRLLGG LTTDEIARAF LVPESTVAQR IVRAKRTLAK RKVPFEVPVG
EDRDARVASV LEVVYLVFNE GYTATSGTEW TRPTLCEEAM RLGRVLAQLL PGECEAHGLV
ALMELHASRL RARVGPGGEP VPLAEQNRAL WDRLLITRGM EALFRALPPD RSQPGGPYVL
QAAIAAEHAR AVTAADTDWT VIAGLYLALV RVTGSPVVEL NRAVAVSMAS GPETALEIVD
ALRDQPGMGD YHLLPSVRGD LLVRLDRRAE ARAEFERAAS LTRNERERSL LLDRARGCED