Gene Ndas_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4005 
Symbol 
ID9247877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4788900 
End bp4790858 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content73% 
IMG OID 
Productserine/threonine protein kinase 
Protein accessionYP_003681908 
Protein GI297562934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.599637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.402473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGTCGG ATGGGCTCCC GAAGAACCTG GAACCGCTGG CCGCCGGAGA TCCGGCCACC 
ATCGGTCCCT ACGTGCTGTC CGGGAAGCTG GGTAGCGGCG GCATGGGAAC CGTCTACCTG
GGCAGCACTC CCGAGCGCAA CAACCAGGTA GCCATCAAGG TCATCCGCCC GGAGCTCGCC
TTCGACGAGG CGACGCGGGC GCGCTTTCGC GACGAGATGG AGAACGCCCG CAAGGTCGCC
TCCTTCTGCA CCGCCAAGGT GCTCGACCAC GGGACGTTCG AGAACCGTCC CTACATGGTC
ACCGAGTACA TCGCGGGCAC CGCCCTGGCC GAGCACATCG CCGAGAACGG TCCGCTGGAC
TCCTCGACGC TGCACGGCTT CGCACTGGGC GTGGCGGCCG CCCTGGCCGC CATCCACCGC
ACCGGGCTGG TCCACCGCGA CCTCAAGCCC GCCAACGTGC TGCTGTCGCT CTCGGGGCCC
CGGGTGATCG ACTTCGGCAT CGCCCGCGCG ATGAACACCG CCACCAACCA CACCCAGACC
GGCATCGTCA TGGGCAGCCC CGGCTGGATG GCTCCCGAGC AGCTGCTGGA GGAGAAGGTC
ACCACCTCGG CGGACATCTT CGCCTGGGGC TGCCTGGTGG CCTTCGCCGG GAACGGCACC
CACCCCTTCG GCAACGGCGA CGCCATGACA TTGGGCAAGC GGGTGCTGTT CGCCGAACCC
CAGATCGGCA ACCTGATCAG CCCCCTGGAC CGCCTGGTGA CGCGCGCGCT GGCCAAGGAG
CCGGGCCGCC GCCCCACCGC CCAGGACCTG CTGTTGGAGC TGGCGGGCGG CGAGGACAAC
AGCAACCCCA ACGACATGGT GTCCCACGCG CTGCACCAGT CGTGGCGGCC CAACCTGCCC
CCGATGCCCC CGCACGGCAT GCCCCACCCC CAGCAGGGGC CGCACCAGAC CATGGCGGGC
ATGCGGCACC CGGCGCCCGG CCAGTACCAG GGCGCACCGC CCGCGCCCAT GCCCCCGCCC
GCGCACCAGA CGGGCAACTA CCCGCGCCCG CAGGGGCCGC CGCCGGGCCA TCCGGCCGGT
CCCCCGCCGG GGGGCCAGGC GCAGCAGGTG CACCAGGCCC CCGGGGTCCA GGCCCCGGGC
GGGCAGCCCC AGGGCGGCCC GGTGCAGCAG GCCCGGCCCG TGCCCGGCCC CGCCCAGGCG
CCGCACCCCG CCTCGACCGG GCCGATCCCG ATGGTCCCCC CGGCCGACCA GCAGACGGGC
CGCCGGGGTC CGCAGCCCTA CGTTCCGCCG GTGCCGCCGC CACCGCACCA GCCCGCCGCG
CCCAACCGGC GCAAGGGGAC GGTCGTGGCC CTGGTGCTGG GCGCCGTGGC GCTGCTGGCG
GCCCTGATCG TCATGGCGAC CGTGCTGACC AACCTCAGCG ACTGGTCGCT CTTCGGGGAC
GGCGACCCCC AGGGGGACTC CTCCGAGCAG ACGGGCGCCG CCCAGGAACC GCAGGAGCCC
GCGCCCGGCG ACGAGTCCCC CGGCGGCGCC GACGAGGAGG AGGCGCCCGG CGAGGTGCCC
AGCGGGATGT CGGGCGCCTC GGCCGACCGC ATGGTCGAGT ACCGGATCCG CGGCGTCAGC
TGCGGCCTGA CCGAGCACAA CATCCGCTCG GAGCTGCCCT CGACCGGCCA GTACTGCGTG
GTGGACCTGG AGCTGTTCAA CGTCAGCGAC GAACTGGTGA GGTTCGAGCA CACAGAGCAG
CAGATGACGA CCAGCGGGGA GCCCGTCAAC GCCCAGGCAC CGTCCGTGCG CGAGGTCGAG
GCCCCGCTGT GGGACCCGGC CGGGATCAGT CCCGGGGTCG CCGCCGGGGG CGAGCTGGTG
TTCATACTCC GCGACGACAT GGACCCGCGC ACGCTGGTGC TCAACCACCG GACGGGCGCG
GAGGCCACGG AGATCAACAT CGAGCACATC GTGGACTGA
 
Protein sequence
MPSDGLPKNL EPLAAGDPAT IGPYVLSGKL GSGGMGTVYL GSTPERNNQV AIKVIRPELA 
FDEATRARFR DEMENARKVA SFCTAKVLDH GTFENRPYMV TEYIAGTALA EHIAENGPLD
SSTLHGFALG VAAALAAIHR TGLVHRDLKP ANVLLSLSGP RVIDFGIARA MNTATNHTQT
GIVMGSPGWM APEQLLEEKV TTSADIFAWG CLVAFAGNGT HPFGNGDAMT LGKRVLFAEP
QIGNLISPLD RLVTRALAKE PGRRPTAQDL LLELAGGEDN SNPNDMVSHA LHQSWRPNLP
PMPPHGMPHP QQGPHQTMAG MRHPAPGQYQ GAPPAPMPPP AHQTGNYPRP QGPPPGHPAG
PPPGGQAQQV HQAPGVQAPG GQPQGGPVQQ ARPVPGPAQA PHPASTGPIP MVPPADQQTG
RRGPQPYVPP VPPPPHQPAA PNRRKGTVVA LVLGAVALLA ALIVMATVLT NLSDWSLFGD
GDPQGDSSEQ TGAAQEPQEP APGDESPGGA DEEEAPGEVP SGMSGASADR MVEYRIRGVS
CGLTEHNIRS ELPSTGQYCV VDLELFNVSD ELVRFEHTEQ QMTTSGEPVN AQAPSVREVE
APLWDPAGIS PGVAAGGELV FILRDDMDPR TLVLNHRTGA EATEINIEHI VD