Gene Ndas_1885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1885 
Symbol 
ID9245735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2296055 
End bp2297374 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content75% 
IMG OID 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003679819 
Protein GI297560845 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.248059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.283313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGACC GCGACCGTGC GCAGCGGCGC CCGCCGGTGG CGGACGGCCA GCGCGACGGC 
GGGCGGCGGC CCCCCGGCAC GGACGCGGAC ATCGAGCACC TGCTGCGCAC CGAGGCGCCG
CAGGTGCTCG GCGCCCTGGT GCGGCGCTTC GGCCGGTTCG ACGCCGCTGA GGACGCGGTG
CAGGAGGCGC TGCTCGCCGC GAGCCGGGCC TGGCCCGCCG ACGGCGTGCC GGAGAACCCG
CGCAGCTGGC TGATCCGCGT CGGCTACCGG CGCCTGGTCG ACCTCCTCCG CGCCGAACAG
GCCAGGCACC GGCGCGAACA GGAGATCGGC GCCGCCGAAC TGGCCATGCG GGAGCCGGAC
CGGAGGGCGG GCCCCGCCCG GGAGAGCGAC GACAGCCTGG CCCTGCTGTT GCTGTGCTGC
CACCCCGCGC TGAGCGCCGC CTCCCAGGTG GCGCTCACGT TGCGCGCGGT GGGCGGCCTG
ACCACCGCCG AGATCGCGCA CGCCCACGGG ACCTCGGAGA ACACCATGGG CACGCGGATC
AGCCGCGCCA AGCAGCAGCT GGCCCGGGTC GGAGCCCGTG TCACCCCGCC GACCGACGCC
GACCGCGACA GCCGGATCAC GGCGGTGGCG AAGGTGCTCT ACCTGGTCTT CAACGAGGGT
TACACGACCT CCGAGGGCGA CCAGCTCGCC CGCGTGGACC TGACCGGCGA GGCCATCCGG
CTGACACGCA TGCTCCACGA CTCTCTGCCC GACGACGCCG AGGTCACCGG CCTGCTCGCG
CTCATGCTGC TCACCGAGGC GCGCCGTCCC GCACGCACCG GCGACCACGA CGAGCTGGTG
CCCCTGGACG AACAGGACCG GTCGTTGTGG AACGCCGACC TCGTCCGCGA GGGCACCGCG
CTGATCGACG GCGTGTGGAA CCGCGGTGAG GTCGGCCCCT ACCAGTTGCA GGCGGCGATC
GCGGCCGTGC ACGCGGCGGC CCCGGCGCCG GAGCGGACCG ACTGGGTGCA GATCGCGGTG
CTCTACCTGT GGCTCGAACG GCTCAGCCCC ACCGCTCCCG TGCGGCTGAG CCGGGTGGTG
GCGGTGGCCA AGGCGTACGG CCCGGCGCGG GGACTGGCCC TGCTGGACGA CCTCGACCGA
CGCTTCGGGC TCGGCCGGGA CCCCCTGACC CGGCAGCGCG AACGCGCGGT GCGCGCTCAC
CTGCTGGAGA GGACCGGGGA GGGGGAGGGC GCGGCGGCGC TGTACCGGGA GGCGGCCTCC
CTGACCGGCA ACCGGGTCGA GCGGCGGTTC CTGCTGGACC GCGCCGACCG CCTCGGTTGA
 
Protein sequence
MNDRDRAQRR PPVADGQRDG GRRPPGTDAD IEHLLRTEAP QVLGALVRRF GRFDAAEDAV 
QEALLAASRA WPADGVPENP RSWLIRVGYR RLVDLLRAEQ ARHRREQEIG AAELAMREPD
RRAGPARESD DSLALLLLCC HPALSAASQV ALTLRAVGGL TTAEIAHAHG TSENTMGTRI
SRAKQQLARV GARVTPPTDA DRDSRITAVA KVLYLVFNEG YTTSEGDQLA RVDLTGEAIR
LTRMLHDSLP DDAEVTGLLA LMLLTEARRP ARTGDHDELV PLDEQDRSLW NADLVREGTA
LIDGVWNRGE VGPYQLQAAI AAVHAAAPAP ERTDWVQIAV LYLWLERLSP TAPVRLSRVV
AVAKAYGPAR GLALLDDLDR RFGLGRDPLT RQRERAVRAH LLERTGEGEG AAALYREAAS
LTGNRVERRF LLDRADRLG