Gene Ndas_3312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3312 
Symbol 
ID9247174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3954517 
End bp3956232 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content69% 
IMG OID 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003681224 
Protein GI297562250 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCATCTG CCAGTTCGAC TCGCTCGAAG CAGTCCGAGT CACTGCAAGA GCCCGTCATT 
CAGCAGCTGA TCGAGCGTGG GCGGTCCCAG GGCTTCCTTG AGCCCGAGGA CGTGCGCCGT
GCCTTCGAAG AGGCCGAAAT CCCGATGTCG CAGGCGCAGT CCGTGCTCCG CAGCCTCGCG
AAGGAGGGCG TGACCGTACT CGTTCCCGAG TCGGCCTCCT CCCGGCGCAA GGCGCCGCGG
CGCAAGGCGG CGACCACCAA GACGGCGGCC ACGCGGTCCA CCTCGACCAG GTCCACCAAG
ACCACCGCGA CGAGCAGGGC CGCCAAGCCC GTGCGCGCGG CGGCGGCCCA GGAGCAGACC
GAGACGGTCA CCGCCGTGGT CGGTTCCGCC GAGGACGCCG GGAACGAGGC CGGGGCGGAG
AAGAAGCCCG CGGCCAAGAA GCCCGCCAAG AAGACGGCGA CCAAGAAGCC CGCGACCAAG
GCGGCCAAGA CCACCGCGGC CAAGGGCCCG ACCGCCAAGA CCACCGGCCT CAAGGCCGCC
GCCGCCAAGA AGACGGCGAC CAAGGAACTC GGACTCGCGG CGGACGGCGA GTTCGACGAG
GACGAGGACG GCCTGGACGA CCTCGAGCAC ACCGGCGCCG AGCTGGAGCT GGTCGAGGAC
ACCCCGGACC CGGCGGACAA GGACCTCAAG CCCGCCAAGC CCGAGGCCGT GGGCGCCCCG
GCCAAGCCCG CCAACGAGGA CGAGTCCTTC GTCCTCTACG ACGACGATGA CGACGCCCCC
GCGGCGCAGG TCGTGGCCGC GGGCGCGACG GCGGACCCGG TCAAGGACTA CCTCAAGCAG
ATCGGCAAGG TCCCGCTGCT CAACGCCGAG CAGGAGGTCG AACTCGCCAA GCGGATCGAG
GCCGGCCTGT TCGCCGAGGA GAAGCTGGCC GAGGAGGCCG AGCTCCTCAC CGTCGAGCTG
CGCGACGAGC TGGAGTGGAT CGCCGAGGAC GGCGGCCGCG CCAAGAAGCA CCTGCTGGAG
GCCAACCTCC GGCTCGTGGT CTCGCTCGCC AAGCGCTACA CCGGCCGCGG CATGCTCTTC
CTGGACCTGA TCCAGGAGGG CAACCTCGGT CTGATCCGCG CGGTGGAGAA GTTCGACTAC
ACCAAGGGCT TCAAGTTCTC GACCTACGCC ACGTGGTGGA TCCGCCAGGC GATCACCCGG
GCCATGGCCG ACCAGGCGCG CACCATCCGC ATCCCGGTGC ACATGGTCGA GGTCATCAAC
AAGCTGGCCC GCGTCCAGCG CCAGATGCTC CAGGACCTGG GCCGCGAGCC CACCCCGGAG
GAGCTGGCCA GGGAACTCGA CATGACCCCG GAGAAGGTCG TCGAGGTGCA GAAGTACGGC
CGCGAGCCGA TCTCCCTGCA CACCCCGCTG GGCGAGGACG GCGACAGCGA GTTCGGCGAC
CTCATCGAGG ACTCCGAGGC GATCCAGCCG GGCGAGGCGG TCAGCTTCAC CCTGCTCCAG
GAGCAGCTGC ACTCGGTGCT GGACACGCTG TCCGAGCGCG AGGCGGGCGT GGTGTCCATG
CGCTTCGGTC TCACCGACGG CCAGCCGAAG ACTTTAGACG AGATCGGCAA GGTCTACGGG
GTCACCCGTG AGCGCATCCG GCAGATCGAG AGCAAGACGA TGTCGAAGCT CCGCCACCCG
TCGCGTTCGC AGGTGCTCCG CGACTACCTG GACTAG
 
Protein sequence
MSSASSTRSK QSESLQEPVI QQLIERGRSQ GFLEPEDVRR AFEEAEIPMS QAQSVLRSLA 
KEGVTVLVPE SASSRRKAPR RKAATTKTAA TRSTSTRSTK TTATSRAAKP VRAAAAQEQT
ETVTAVVGSA EDAGNEAGAE KKPAAKKPAK KTATKKPATK AAKTTAAKGP TAKTTGLKAA
AAKKTATKEL GLAADGEFDE DEDGLDDLEH TGAELELVED TPDPADKDLK PAKPEAVGAP
AKPANEDESF VLYDDDDDAP AAQVVAAGAT ADPVKDYLKQ IGKVPLLNAE QEVELAKRIE
AGLFAEEKLA EEAELLTVEL RDELEWIAED GGRAKKHLLE ANLRLVVSLA KRYTGRGMLF
LDLIQEGNLG LIRAVEKFDY TKGFKFSTYA TWWIRQAITR AMADQARTIR IPVHMVEVIN
KLARVQRQML QDLGREPTPE ELARELDMTP EKVVEVQKYG REPISLHTPL GEDGDSEFGD
LIEDSEAIQP GEAVSFTLLQ EQLHSVLDTL SEREAGVVSM RFGLTDGQPK TLDEIGKVYG
VTRERIRQIE SKTMSKLRHP SRSQVLRDYL D