Gene Ndas_5246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5246 
Symbol 
ID9249143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp405513 
End bp407354 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content71% 
IMG OID 
Productserine/threonine protein kinase with PASTA sensor(s) 
Protein accessionYP_003683132 
Protein GI297564159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.438769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0079464 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCAGC CCCGGCTCCT CGGCGGACGC TACGAGCTCG ACACCATCGT CGGGCGCGGC 
GGCATGGCCG AGGTCTACCG CGCACGCGAC CTGCGTCTCG ACCGGCTCGT CGCCATCAAG
ACCCTCCGAC ACGACATGGC TCGCGACCAC GTCTTCCAGG CCAGGTTCAG ACGGGAGGCG
CAGTCCGCCG CCTCCCTGAA CCACCCGGCC ATCATCGCGG TGTACGACAC CGGCGAGGAC
ATGATCGACG GGGTGTCCAT CCCCTACATC GTCATGGAGT ACGTGGACGG GCGGACCCTG
AAGGAGCTGC TGGACGACGA CCGCCGCCTC CTGCCGGAGC GCTCGGCGGA GCTGGTCGAC
GGCATCCTCA AGGCGCTGGA GTACAGCCAC GACAACGGGA TCGTCCACCG CGACATCAAG
CCCGCGAACG TGATGCTGAC CCGCAACGCC GACGTCAAGG TGATGGACTT CGGCATCGCC
CGGTCCATGG ACGACAACCA GGCGACGATG ACGCAGGCCT CCCAGGTGAT CGGTACCGCC
CAGTACCTGT CCCCGGAGCA GGCGCGCGGC GAGCGGGTGG ACCCGCGCAG CGACATCTAC
TCCACCGGCT GCGTGCTCTA CGAGCTGCTC ACCGGCCGTC CGCCGTTCAC GGGCGACTCG
CCCGTCTCGA TCGCCTACCA GCACGTGCGG GAGGAGCCGG TCCCGCCGAG CGAGATCGAC
CCCCAGATCC CCCATTGGCT GGAGGACGTC ACCCTCCGGG CGATGACCAA GAACCGCGAG
GAGCGGTACC AGAACGCGGC CGAGATGCGC GCCGACATCC AGCGCGGCCT GGCCGGGATG
CCCACCCAGG CGGGCACCAT GGCCATGGCC GCCGCCGGCG CCACGACGGC GATGCCGCCC
GCGCCCGCCG AGCGGTACGA CGACTACGAC GATTACGACG ACGACTACGA CGACCGCTAC
GACGACCGCG GGAAGGACGG CCGCGGCAAG ACCGCGCTGT GGGTCCTTCT GGGCGTCGGC
GTGGTCGCCT CCCTGATCCT GGTGTTCGTG CTGATGAACC TGGGGGGAGG GGATCCCGAG
ACGCAGACCG CCGCGGTACC GGACGTGGCG GGGTCCACCG TCGCGGAGGC CCAGTCCTCC
CTGAGCGAGG CGGGCTTCGA GAACGTCACC CCGGAGCAGC AGGCGAGCGA GGACGTCGAG
GAGGGCCAGG TCATCGAGAC CGACCCGCCC GCCGGCGACG AGGTCCCGGT GGACGAGGAG
ATCGTCCTGT ACGTCTCCAG CGGCCCGGAC GCCCTGGAGA TCCCGTCCGT GCAGGGCCAG
TCCGAGTCCG ACGCGACCGG CACCCTGAAC GACGCGGGCT TCGAGAACGT CACCTCCGAA
CAGAGGGCGG ACGACAGCGT GCCCGAGGGC CAGGCGATCG GCACCGACCC GGCCGCCGGG
GAGGCGGTCG CGCCGGACAC CGACATCACG CTGCTGATCT CCTCCGGACC CAACCAGGTA
CAGGTGCCCG ACCTGGTCGG CATGACCCGC GACGGCGCGG AGTCGGCGCT GGCGCAGAGG
GACCTGAGCG CCTCCTTCTC CGAGGAGCCG AGCACGGAGG GCCCGGTCGG CACGGTCATC
CGGCAGGACC CGGGGTCGGG GCAGAACGTG GCGCCCGGGA GCACGGTGAA CGTCGTGCTC
GCCACCGAAC CGGCCACTCA GGGGCCGTCG GACGGCGACG ACGGCGGCGA GTCCCCTCCG
GGTGAGGGCG GGGAGACCCC TCCCGGCGGT GAGGACGGGG GCCAGACCCC GCCCGGCGGC
GACGACGGCG GCTTCGAGTT CCCGTCCATG CGCAGGGACT GA
 
Protein sequence
MSQPRLLGGR YELDTIVGRG GMAEVYRARD LRLDRLVAIK TLRHDMARDH VFQARFRREA 
QSAASLNHPA IIAVYDTGED MIDGVSIPYI VMEYVDGRTL KELLDDDRRL LPERSAELVD
GILKALEYSH DNGIVHRDIK PANVMLTRNA DVKVMDFGIA RSMDDNQATM TQASQVIGTA
QYLSPEQARG ERVDPRSDIY STGCVLYELL TGRPPFTGDS PVSIAYQHVR EEPVPPSEID
PQIPHWLEDV TLRAMTKNRE ERYQNAAEMR ADIQRGLAGM PTQAGTMAMA AAGATTAMPP
APAERYDDYD DYDDDYDDRY DDRGKDGRGK TALWVLLGVG VVASLILVFV LMNLGGGDPE
TQTAAVPDVA GSTVAEAQSS LSEAGFENVT PEQQASEDVE EGQVIETDPP AGDEVPVDEE
IVLYVSSGPD ALEIPSVQGQ SESDATGTLN DAGFENVTSE QRADDSVPEG QAIGTDPAAG
EAVAPDTDIT LLISSGPNQV QVPDLVGMTR DGAESALAQR DLSASFSEEP STEGPVGTVI
RQDPGSGQNV APGSTVNVVL ATEPATQGPS DGDDGGESPP GEGGETPPGG EDGGQTPPGG
DDGGFEFPSM RRD