Gene Ndas_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0406 
Symbol 
ID9244244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp498135 
End bp499391 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content73% 
IMG OID 
Productserine/threonine protein kinase 
Protein accessionYP_003678360 
Protein GI297559386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGT GGATAACCGT GTCCTCCCTA GAGAACCCCC ACACGCAGCG CCCCGAGGCG 
TCGCGCTCCT CCGCACGGGC GGAGGCGGCC GACGCCGCCG AACCCCGTCC CGAGGGACGC
CTGCTCAAGG GGCGCTACCA GTTGGTGTCG GAGATCGCCC GCGGCGGTGT CGGCACGGTC
TGGCGCGCCA CCGACCTGGT CATCGACCGC GAGGTGGCCG TCAAGGAGCT GCGCCTGCCC
GCGGACCTGA GCCGCTCCGA GCGCGAGTCG CTCCTGCAGC GCACCACCCG TGAGGCCCGC
GTCGCCGGAC GCCTCACCCA CCCCAGCCTG GTCACCGTCC TGGACGTCGT GGACGAGGAC
GACCGCCCCT GGATCGTCAT GGAGCTGGTG GAGGCCTCCA CCCTGGAGGA ACTCATCCAG
GTGGGCGGCC CGCTGCCCTA CCAGCGCGTG GCCGAGATCG GCCTCCAGCT CATCGACGCC
CTCAAGGTCG CGCACGCCGA GGGCATCGTG CACCGCGACG TCAAGCCCGA CAACGTGATG
ATCAGCCAGG CCGGTCGGGT CGTGCTCACC GACTTCGGGC TGGCCGCCTG GAACGGCGAG
TCGGCGCTGA GCGCGTCCGG GCGCATCATC GGCTCCCCGG CCTACATCCC GCCGGAGCGG
GCCAAGGCCG GCCCGGTGGG ACCCGAGTCC GACCTGTGGT CCCTGGGCGC CACCCTGTAC
GCGGCGGTGG AGGGGCACCC TCCCTACGAC CGCAAGGGCT ACATCAAGAT CCTGCGCCAG
GAGCAGCTGG ACGAGCCCGC CGAGGCCGCC AGCGCGGGCC CGCTGGCGCC GGTGCTGGCC
GGTCTGCTCC GGGTGGAGCC CTCGGAGCGG TTGACCGCCG AGAACGCCAC CAAGATGCTG
CGGATCGCCG CTCTGGCGCC GTGGGCGCCC GAGACGAGCC CGGAGACGGC CGCGCAGGGC
ACCGCCACGC CGGCCGAGCC CCGTCCCGAG GGCGAGGAGC GCGAGTCGGC CCGCGAGCAC
TTCAAGGCGG GCGCGCACGT GCTGGCGGAG TCCATCAACG AGCGGCTGAG CAACAGCCCC
GAGGCGATGA ACTCGATCAT GACCTCGATC CGCGAGTCCA CGGACACGCT GGGGCTGACC
AGTTCGGGCA AGCACGCCGA GCGCAGCCGC CTGCCCGCGG TGGCCACCAT CGCCGCGGTC
TCCCTCGGCG TCCTGGTGCT GCTCGCGGTC ATGCTCTGGG CGCTGCTCGG CCGCTAG
 
Protein sequence
MDPWITVSSL ENPHTQRPEA SRSSARAEAA DAAEPRPEGR LLKGRYQLVS EIARGGVGTV 
WRATDLVIDR EVAVKELRLP ADLSRSERES LLQRTTREAR VAGRLTHPSL VTVLDVVDED
DRPWIVMELV EASTLEELIQ VGGPLPYQRV AEIGLQLIDA LKVAHAEGIV HRDVKPDNVM
ISQAGRVVLT DFGLAAWNGE SALSASGRII GSPAYIPPER AKAGPVGPES DLWSLGATLY
AAVEGHPPYD RKGYIKILRQ EQLDEPAEAA SAGPLAPVLA GLLRVEPSER LTAENATKML
RIAALAPWAP ETSPETAAQG TATPAEPRPE GEERESAREH FKAGAHVLAE SINERLSNSP
EAMNSIMTSI RESTDTLGLT SSGKHAERSR LPAVATIAAV SLGVLVLLAV MLWALLGR