Gene Ndas_2569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2569 
Symbol 
ID9246420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3064116 
End bp3065522 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003680494 
Protein GI297561520 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.152283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCA CTGCCCCAGC CGCTCCGGCA CCCACCGCCG GAGCCCCCGC TGACACCGCT 
CGCAGCGACG ACGCGTCCGC CGACGCGGTC CTGACCGACC TGCTCGACCG CGGACGCGCC
CAGGGGCACC TGTCCCTCTC CGAACTGCGT GCCGCTTTCA CCGCGGCCGC CGTCTCCACC
GGCGACGGCC GTTCCATCCT GCGTGAACTC ACCGAGAACG GGGTCAGGCT CGCCAACGGG
GGCGACGACT CCGCACCGGT GGCCGATCCC GACGCCGAGG ACGACGTGCT GGAAGACACT
CTCATCGCCA CGGTGCCCGA CACCGACGAG GCACCCAGGG CCGAAGGCAA GACCGCGCCG
CGCAAGGGCC CCTCCCCCCG GAGGAAGACC CGCTCCACAC GGCGCGTCAA GAAGGCCGAG
CCCGCCCAGA CCACCACCGA CGCGGAGACC GGCGAGGCCG ACCTCGACGA CCAGTCCCCC
GCCATGGGCG ACTCGGTCCA CACCTACCTC AAGGCCATCG GCCGACGCCA GCTCCTCACC
GCGGAGCAGG AGGTCGACCT GGCCAAGCGG GTCGAGGCCG GGCTCTACGC CGAGTACCGG
CTCGGCCTGC ACGGCGAGGT GGACGGCTCC GCCCCGCTGG CCGAGGCCGA GGTCGAGGAG
CTGGAGTGGG TGGCCGAGGA CGGCCGCAAG GCCAAGTCCC ACATGCTGGA GGCCAACCTG
CGCCTGGTGG TGTCGGTGGC CAAGAAGTAC AGCGACCGGG GGATGTCGCT GCTCGACGTG
GTCCAGGAGG GCAACCTCGG CCTGATCCGC GCCGTGGAGA AGTTCGACTA CACCAAGGGC
TTCAAGTTCT CCACCTACGC CATGTGGTGG ATCCGCCAGG CCATCCAGCG CGGTTTCGCC
GACTCCGCGC GCACCATCCG CCTGCCCGTG CACGTCCTGG AGCTGCTGAG CAAGGTCAGC
CGCCTGGAGC GCGACATGCA CCAGGCGCTG GGCCGCGAGC CCACGCCGGA GGAGCTGGGC
CTGGAGCTGG ACAAGACCCC GGCCCAGATC GAGGAGCTGC TGCGGGTCAC CCGCCAGCCC
ATCAGCCTGG ACTCCACGAT CGGCGAGGAC GGCGAGACCC GCATCGGCGA CCTGATCGAG
GACGTGGACG CCTCGGAGGC CTCCGAGGTG GTGGACCGCC AGCTCATGGC CGACCAGCTG
CGCAACGCGC TGTCGGACCT GGAGCCGCGC GAGGCCACCA TCATGTCCCT GCGCTTCGGC
CTCATGGACG GCCGTCCGCG CACCCTGGAC GAGATCGGCA AGCACCTGGG GCTGACCCGC
GAGCGCATCC GCCAGCTGGA GAAGCAGTCG CTGTCCAAGC TGCGCCACCC CAGCCGCGCC
CAGCAGCTGC TGGACTTCGC CAGCTAG
 
Protein sequence
MTSTAPAAPA PTAGAPADTA RSDDASADAV LTDLLDRGRA QGHLSLSELR AAFTAAAVST 
GDGRSILREL TENGVRLANG GDDSAPVADP DAEDDVLEDT LIATVPDTDE APRAEGKTAP
RKGPSPRRKT RSTRRVKKAE PAQTTTDAET GEADLDDQSP AMGDSVHTYL KAIGRRQLLT
AEQEVDLAKR VEAGLYAEYR LGLHGEVDGS APLAEAEVEE LEWVAEDGRK AKSHMLEANL
RLVVSVAKKY SDRGMSLLDV VQEGNLGLIR AVEKFDYTKG FKFSTYAMWW IRQAIQRGFA
DSARTIRLPV HVLELLSKVS RLERDMHQAL GREPTPEELG LELDKTPAQI EELLRVTRQP
ISLDSTIGED GETRIGDLIE DVDASEASEV VDRQLMADQL RNALSDLEPR EATIMSLRFG
LMDGRPRTLD EIGKHLGLTR ERIRQLEKQS LSKLRHPSRA QQLLDFAS