Gene Ndas_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0869 
Symbol 
ID9244714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1064354 
End bp1066309 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003678819 
Protein GI297559845 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.456252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.192635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGATG TCCACCGCAC CGTCGACGCG GTCTGGAAGA TGGAGTCGGC GCGGATCGTC 
GCCTCGCTCA CCCGGATCGC GCACGACGTC GGCGCCGCCG AGGAGTGCGC CCAGGACGCC
CTCGTCGCCG CCCTGGAGCA GTGGCCGCGT GAGGGGATCC CCGACAACCC CGGCGCCTGG
CTCATGACCG CCGCCAAACG CCGCGTCCTG GACCGCCTGC GCCGGGAGCA CCGCCTGGAG
AGCAAGCACA AGGAGATCGC CCACGAGCTG GAGCGCGCTC CCGGGGTCGC GCCCGCCCCG
GACGACGGCG TCCTGCGCCT GCTGTTCGCG ACCTGCCATC CGGTGCTGTC CACTCCGGAG
AGGGTCGCGC TGACCCTGCG CCTGGTGGCC GGGCTGACCA ACGGGGAGAT CGCCCGCGCC
TTCCTCACCG GGGAGGGGCG CATCGCCCAG CGCGTGGCGC GGGCCAAACG GCTGCTCGCC
GAGGAGGGGG TGGCCTTCGG GCTGCCCGAC GGACGGGAGC TGGCCGAACG CCTCTCCTCC
GTCCTCGGCG TCATCTACCT CGTCTTCAAC GAGGGCTACG CGGCGACCTC GGGCGAGGAC
CTGATGCGCC CCGGCCTGTG CCTGGAGGCG CTGCGGCTGG GCCGCACCCT GGCCGAACTG
GTGCCGCACG AGGCCGAGGC GCACGGCCTG GTGGCGCTGA TGGAGCTCCA GCAGTCCCGG
GCCGGGGCCC GGACCGGCCC CTCCGGCGAG ATCGTCCGAC TGCACGAGCA GAACCGGGGC
CGCTGGGACC CCCTGCTGGT GCGCCGGGGC TTCGCCGCCA TGCTGCGCGC CCGTGACGCG
GGCGGCCCGC CCGGCCCCTA CGTGCTCCAG GCGGCCGTCG CCGTGTGCCA CGCCCGGGCC
ACCAGCGAGC AGGACACCGA CTGGGCCCGT ATCGCCGCCC TCTACGACCA GCTGGTCGTC
CTACTCCCCA CCCCGGTCGT GCGCCTGAAC CGGGCCGTGG CGGTCGGCAG GGCCCGCGGG
CCGGGGGAGG GACTGGCCCT GGCCGACGAG CTGGCGGAGG ACCCGGTCCT GCGCGACTAC
CACCTGTTGC CCGGCGTGCG CGGCGACCTG CTCCTGCGGC TGGGCCGGGC CGCCGAGGCG
AAGCGGGAGT TCGAGCGCGC CGCCTTGCTG GCCGAGAACA CCGCCGAACG CGCCTTCCTG
TCGCGCCGGG CAGAGGAGAC CGCGGTCCCC GAGCCCGCCG GGCCCGACCT GGGCGCGACG
GCCCGGGAGT TCCTGGGCCG CGACGACCTG GACCCCCAGA CGCTGCGCTC CTACGGCCAG
ACCCTGGACC GGCTGTGCCG TTCGCTGGGG GAGGGTCTGC CCCTGGCCGA CCTGACCCCC
GAACGGGTGG CGGGCGTGTT CGCCACCGCC TGGGGCGGTG CGGCCCCGCG CACGTGGAAC
CGGCACCGGT CGACCGTGCG CTCCTTCGGC GCCTGGGCGG GGCTGGAGGA TCTCGCGGCG
GACCTGGAAC GGCGCGGCGA GACCCGCTCA CCGCACGTGC CGCTCGACCC CGAGACCGTG
GCGCGCCTGT GCGACGGGGA GGGTTTCGCG CTTCGCGAGC GCGTGCTGTG GCGGCTGCTG
CACGAGTCCG GCGCCCGGGT GAATTCCGTG CTGGCGCTCA ACGTGGAGGA CCTGGACCTG
GAGGACCGCC GTGCGCGGGC GGGCGACGGC TGGGTGGGCT GGAGGTCGGG GACCGCCCGG
CTGCTGCCCG AGCTCGTGGC GGGGCGCGAG CGGGGGCCGC TCCTTCTGGC AGACCGGCGT
CCGGGACCGG CGAGGCGCCC CGCCGCGGCC GACCTGTGCC CGCTGACGGG GCGAGGGCGC
CTGTCCTACC CGCGTGCGGA GTACCTGTTC AAGCGGGCCA CGCGCTCCCT GGATCCGGCG
GGGCGGGGCT ACACCCTGAG CAGGCTCCGG CCCTGA
 
Protein sequence
MTDVHRTVDA VWKMESARIV ASLTRIAHDV GAAEECAQDA LVAALEQWPR EGIPDNPGAW 
LMTAAKRRVL DRLRREHRLE SKHKEIAHEL ERAPGVAPAP DDGVLRLLFA TCHPVLSTPE
RVALTLRLVA GLTNGEIARA FLTGEGRIAQ RVARAKRLLA EEGVAFGLPD GRELAERLSS
VLGVIYLVFN EGYAATSGED LMRPGLCLEA LRLGRTLAEL VPHEAEAHGL VALMELQQSR
AGARTGPSGE IVRLHEQNRG RWDPLLVRRG FAAMLRARDA GGPPGPYVLQ AAVAVCHARA
TSEQDTDWAR IAALYDQLVV LLPTPVVRLN RAVAVGRARG PGEGLALADE LAEDPVLRDY
HLLPGVRGDL LLRLGRAAEA KREFERAALL AENTAERAFL SRRAEETAVP EPAGPDLGAT
AREFLGRDDL DPQTLRSYGQ TLDRLCRSLG EGLPLADLTP ERVAGVFATA WGGAAPRTWN
RHRSTVRSFG AWAGLEDLAA DLERRGETRS PHVPLDPETV ARLCDGEGFA LRERVLWRLL
HESGARVNSV LALNVEDLDL EDRRARAGDG WVGWRSGTAR LLPELVAGRE RGPLLLADRR
PGPARRPAAA DLCPLTGRGR LSYPRAEYLF KRATRSLDPA GRGYTLSRLR P