Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5246 |
Symbol | |
ID | 9249143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 405513 |
End bp | 407354 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | serine/threonine protein kinase with PASTA sensor(s) |
Protein accession | YP_003683132 |
Protein GI | 297564159 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.438769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0079464 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCCAGC CCCGGCTCCT CGGCGGACGC TACGAGCTCG ACACCATCGT CGGGCGCGGC GGCATGGCCG AGGTCTACCG CGCACGCGAC CTGCGTCTCG ACCGGCTCGT CGCCATCAAG ACCCTCCGAC ACGACATGGC TCGCGACCAC GTCTTCCAGG CCAGGTTCAG ACGGGAGGCG CAGTCCGCCG CCTCCCTGAA CCACCCGGCC ATCATCGCGG TGTACGACAC CGGCGAGGAC ATGATCGACG GGGTGTCCAT CCCCTACATC GTCATGGAGT ACGTGGACGG GCGGACCCTG AAGGAGCTGC TGGACGACGA CCGCCGCCTC CTGCCGGAGC GCTCGGCGGA GCTGGTCGAC GGCATCCTCA AGGCGCTGGA GTACAGCCAC GACAACGGGA TCGTCCACCG CGACATCAAG CCCGCGAACG TGATGCTGAC CCGCAACGCC GACGTCAAGG TGATGGACTT CGGCATCGCC CGGTCCATGG ACGACAACCA GGCGACGATG ACGCAGGCCT CCCAGGTGAT CGGTACCGCC CAGTACCTGT CCCCGGAGCA GGCGCGCGGC GAGCGGGTGG ACCCGCGCAG CGACATCTAC TCCACCGGCT GCGTGCTCTA CGAGCTGCTC ACCGGCCGTC CGCCGTTCAC GGGCGACTCG CCCGTCTCGA TCGCCTACCA GCACGTGCGG GAGGAGCCGG TCCCGCCGAG CGAGATCGAC CCCCAGATCC CCCATTGGCT GGAGGACGTC ACCCTCCGGG CGATGACCAA GAACCGCGAG GAGCGGTACC AGAACGCGGC CGAGATGCGC GCCGACATCC AGCGCGGCCT GGCCGGGATG CCCACCCAGG CGGGCACCAT GGCCATGGCC GCCGCCGGCG CCACGACGGC GATGCCGCCC GCGCCCGCCG AGCGGTACGA CGACTACGAC GATTACGACG ACGACTACGA CGACCGCTAC GACGACCGCG GGAAGGACGG CCGCGGCAAG ACCGCGCTGT GGGTCCTTCT GGGCGTCGGC GTGGTCGCCT CCCTGATCCT GGTGTTCGTG CTGATGAACC TGGGGGGAGG GGATCCCGAG ACGCAGACCG CCGCGGTACC GGACGTGGCG GGGTCCACCG TCGCGGAGGC CCAGTCCTCC CTGAGCGAGG CGGGCTTCGA GAACGTCACC CCGGAGCAGC AGGCGAGCGA GGACGTCGAG GAGGGCCAGG TCATCGAGAC CGACCCGCCC GCCGGCGACG AGGTCCCGGT GGACGAGGAG ATCGTCCTGT ACGTCTCCAG CGGCCCGGAC GCCCTGGAGA TCCCGTCCGT GCAGGGCCAG TCCGAGTCCG ACGCGACCGG CACCCTGAAC GACGCGGGCT TCGAGAACGT CACCTCCGAA CAGAGGGCGG ACGACAGCGT GCCCGAGGGC CAGGCGATCG GCACCGACCC GGCCGCCGGG GAGGCGGTCG CGCCGGACAC CGACATCACG CTGCTGATCT CCTCCGGACC CAACCAGGTA CAGGTGCCCG ACCTGGTCGG CATGACCCGC GACGGCGCGG AGTCGGCGCT GGCGCAGAGG GACCTGAGCG CCTCCTTCTC CGAGGAGCCG AGCACGGAGG GCCCGGTCGG CACGGTCATC CGGCAGGACC CGGGGTCGGG GCAGAACGTG GCGCCCGGGA GCACGGTGAA CGTCGTGCTC GCCACCGAAC CGGCCACTCA GGGGCCGTCG GACGGCGACG ACGGCGGCGA GTCCCCTCCG GGTGAGGGCG GGGAGACCCC TCCCGGCGGT GAGGACGGGG GCCAGACCCC GCCCGGCGGC GACGACGGCG GCTTCGAGTT CCCGTCCATG CGCAGGGACT GA
|
Protein sequence | MSQPRLLGGR YELDTIVGRG GMAEVYRARD LRLDRLVAIK TLRHDMARDH VFQARFRREA QSAASLNHPA IIAVYDTGED MIDGVSIPYI VMEYVDGRTL KELLDDDRRL LPERSAELVD GILKALEYSH DNGIVHRDIK PANVMLTRNA DVKVMDFGIA RSMDDNQATM TQASQVIGTA QYLSPEQARG ERVDPRSDIY STGCVLYELL TGRPPFTGDS PVSIAYQHVR EEPVPPSEID PQIPHWLEDV TLRAMTKNRE ERYQNAAEMR ADIQRGLAGM PTQAGTMAMA AAGATTAMPP APAERYDDYD DYDDDYDDRY DDRGKDGRGK TALWVLLGVG VVASLILVFV LMNLGGGDPE TQTAAVPDVA GSTVAEAQSS LSEAGFENVT PEQQASEDVE EGQVIETDPP AGDEVPVDEE IVLYVSSGPD ALEIPSVQGQ SESDATGTLN DAGFENVTSE QRADDSVPEG QAIGTDPAAG EAVAPDTDIT LLISSGPNQV QVPDLVGMTR DGAESALAQR DLSASFSEEP STEGPVGTVI RQDPGSGQNV APGSTVNVVL ATEPATQGPS DGDDGGESPP GEGGETPPGG EDGGQTPPGG DDGGFEFPSM RRD
|
| |