Gene Ndas_3856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3856 
Symbol 
ID9247727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4629360 
End bp4630409 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative anti-sigma regulatory factor, serine/threonine protein kinase 
Protein accessionYP_003681759 
Protein GI297562785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.964763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.561641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGGG TCTGGGACCT GGCGGTCACC GACGCCACGC ACGTGTCCGA CGCGGCCCGG 
ACCGCCCGCA AGGCCGCGCT CGGCATCGGT CTGGACGGGG CCGCGGCGGA CGCCGCGGCC
CTCGCCGCGT CCGAACTGGC GACCAACATG TGCAGGTACG CCGGCGGTGG GCGCTTCGTT
GCGGAGGAGG CCAGGGAGCC GTGGGGAACG GCCCGGGCCC TGCAGATCCT CGCCCTGGAC
CACGGCCCCG GGATCCCCGA CGTCCCGCGC GCCATGGAGG ACGGCTTCTC CACCTCCGAG
GACTCCCTGG GGGCGGGGCT CGGCGCCTGC GCGCGCGCCG CCGACTTCTT CGAGGTCCAG
AGCGCTCCCG GGAGGGGGAC GGTCGCGGTG GCCCGGTTCG CCCGGCCCGC CGACCGCGCG
CTGTTCGGCG CCCGCGCGGC CACGGGCGGG ATACACGTCC CCCTGGGCGG CTACAAGCTC
TCCGGGGACG CCCTGGCCTT CCGCAGCGGC GGAGCCCTGC GCACGGTCAT GCTCGCGGAC
GGACTCGGGC ACGGTGAGGC CGCCTCGGAG GCCGCCGACG TGGCCTCGCG CTTCGTGACC
GGGAGCCGGG GAGGGACCCC CGACACGCTG TTGCGCGGGC TGCACGAGGC GCTGCGGCGT
ACGCGCGGGG CGGCCGTGGC GATCGTCGAG ATCGACGAGT CCCGGGGCCG CCTCACCTTC
TGCGGGGTCG GCAACATCGG CGCGCGCCTG TACCGGGGCG GGCGCTGGGA GACACTGCTC
TCCCAACCCG GGATCGTGGG GGCCTTCTCC CTGCGGACGC CGGTCCCGGC CCGGCGGGAG
TGGTCCCCGG GCGACATGCT CATCCTGCAC AGCGACGGGG TGCCGGGACG CTGGAAACCC
GAGGGCGTCG AGGCACTGCG CGGCCACGAC GCCGCGGTCG TGGCCGGAGC GGTGTTCCGG
GACTCGGGCA GCGCGGCCCG TCCGCTGCGC GACGACACCA GCGTCGCCGT GGTCACGCAC
ACCGCACGCC CTGACGGGGG AGGACGATGA
 
Protein sequence
MTRVWDLAVT DATHVSDAAR TARKAALGIG LDGAAADAAA LAASELATNM CRYAGGGRFV 
AEEAREPWGT ARALQILALD HGPGIPDVPR AMEDGFSTSE DSLGAGLGAC ARAADFFEVQ
SAPGRGTVAV ARFARPADRA LFGARAATGG IHVPLGGYKL SGDALAFRSG GALRTVMLAD
GLGHGEAASE AADVASRFVT GSRGGTPDTL LRGLHEALRR TRGAAVAIVE IDESRGRLTF
CGVGNIGARL YRGGRWETLL SQPGIVGAFS LRTPVPARRE WSPGDMLILH SDGVPGRWKP
EGVEALRGHD AAVVAGAVFR DSGSAARPLR DDTSVAVVTH TARPDGGGR