Gene Ndas_4416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4416 
Symbol 
ID9248291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5254623 
End bp5255783 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content76% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003682311 
Protein GI297563337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.495846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTA TGGCCGGGCG GTGCGGCGCG CGCGACTGGA TCGTGGACTC CTCCATCTTC 
CTGATCGCCC TCCTCTCCGT ACTGATGAAC GCCGCCGAGT TCCCCCACCT GCCCGTCCGG
GAACCGGTCC TGCCCGTGTG GCTCAAGGCC GCCGACGCGG CCCTCTCCCT GCTGGCCTGC
CTGGCGCTGT GGTGGCGGCG GCGCTGGCCG GTCCAGATCG CCGTCGTGCT CGTCCTGTAC
TCCGCCGTCT CCGGTCTGGC CTCGGGCGCG ATGCTCATCG CCCTGTTCTC CCTGGCCGTG
CGCCGTCCGC CCCGCACCAG CCTGGCGGTG TACGGGCTGA GCGTGGGCGC CTCGCTCGTG
CACGCCGCGC TGTGGCCCGA CCCGCACGCC CCGTTCCTGG TGATCCTCCT GCTGGGGGCC
GCCCTCCAGG GCGCCGTGAC CGGCTGGGGG CTCACCGTCC AGCACCGGCG CGAGCTGGTG
GAGTCGCTGC GCGACCGGGC CCTGCACGCC GAGACGGAGG CGCAGCTGCG CGCCGAGCAC
GCCCAGCACC AGGTCCGCGA GGCCATGGCC CGCGAGATCC ACGACGTGCT CGGGCACCGG
CTGTCGCTGC TGAGCGTGCA CGCGGGCGCC CTGGAGTACC GGCCCGACGC CCCCGCCGAG
GAGGTGGCCC GGTCGGCGAA GGTGATCCGC GAGAGCGCCC ACCAGGCCCT CCAGGACCTG
CGGGAGGTGA TCGGCGTGCT GCGCGCGCCC GTCGGGGAGC TGCCGCAGCC GACCATGGCC
GACCTGCGGC AGCTGGTGGA GGAGGCCGAC GAGGCCGGGA CCCGGGTGGA GTTCGTGCAG
GAGTGCGCCG GGACGGTCCC CGAGCGCACC GGGCGCACCG CCTACCGGAT CGTCCAGGAG
GGGCTGACGA ACGTGCGCAA GCACGCCCCG GGCGCCACCA CGCGCGTACT GGTCCGGGGA
GCCCCGGGCG ACGGCCTGCT GGTGGAGGTG GGCAACGACC CCTCCCCCGG CGCCCCTCCC
GCGGCGTCGG GCGGGGACGG CGACGGCCAG GGCCTGGTCG GGCTCGCCGA GCGGGTGTCC
CTGGCCTCGG GGCGGCTGGA GCACGGCCCG GACGGTCGGG GTGGCTGGCG GCTGGCGGCA
TGGCTACCGT GGCCGACATG A
 
Protein sequence
MNAMAGRCGA RDWIVDSSIF LIALLSVLMN AAEFPHLPVR EPVLPVWLKA ADAALSLLAC 
LALWWRRRWP VQIAVVLVLY SAVSGLASGA MLIALFSLAV RRPPRTSLAV YGLSVGASLV
HAALWPDPHA PFLVILLLGA ALQGAVTGWG LTVQHRRELV ESLRDRALHA ETEAQLRAEH
AQHQVREAMA REIHDVLGHR LSLLSVHAGA LEYRPDAPAE EVARSAKVIR ESAHQALQDL
REVIGVLRAP VGELPQPTMA DLRQLVEEAD EAGTRVEFVQ ECAGTVPERT GRTAYRIVQE
GLTNVRKHAP GATTRVLVRG APGDGLLVEV GNDPSPGAPP AASGGDGDGQ GLVGLAERVS
LASGRLEHGP DGRGGWRLAA WLPWPT