Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4405 |
Symbol | |
ID | 9248280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5239869 |
End bp | 5242214 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | serine/threonine protein kinase |
Protein accession | YP_003682300 |
Protein GI | 297563326 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.42637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAGT GCACGGTCCC CGGCTGCCCC GGCCGGGTCG AGGACGGTTT CTGCGGCGTG TGCGGCATGG CCCCCGCGGA CGTACGGCCT CCGGACGGCC CCGCCCCCCG GCAGCGGCCC GTGCCGCCGA CCTCCGGCCC GCAGCCCTCC CTCGGCGCGA ACTCCTCCGC CAACCCCCTC AGCGGCGACC CCGTCCACTC CCACCCCCTC TCCGGCAACG TCAGGCTCTC GGGGCGCTCC GGTCCCGGCA GTGTCTCCGC GACCTCGGGC GCCAGCGTCA GCGGCGCCGG ACGCGGCAGC CGCCGATCCT CCCGCGGCAT GCTCGGCATG GGCATGGTCC AGGTGCCCCT CGTCCCCTAC CGGGACCCCT CCGAAGCCGT CATGGACAAC CCCGTCGTCG CGGAGAAGAA CCGCTTCTGC GGCAACTGCG GGGAGCGGGT CGGCCGCACC CAGGGCGACC AGCCCGGGCG CACCGAGGGC TTCTGCCGCA AGTGCGGGAC CCAGTTCTCC TTCACGCCCA AGCTCTCCCC CGGGGACCTG GTCGGCGGCC AGTACGAGGT GCTCGGCTGT CTGGCCCACG GCGGCCTCGG CTGGATCTAC CTGGCCCGGG ACCGCAACGT CAACGACCGC TGGGTGGTCC TCAAGGGCCT GCTCAACGCC GGTGACGCCG AGGCGCACAA GGCGGCCGCC GCCGAGCGCG CGTTCCTCTC CGAGGTCGAG CACCCCAACA TCGTCAAGAT CATCAACTTC TCCCAGCACC CCGACCCGCG CACCGGCATC CCCGGCGGCC ACATCGTCAT GGAGTACGTG GGCGGCAAGT CCCTGCGCGA GCTGCTCATC GAGCGCCGGG AGGACGACCC CGACGCCGTC CTGCCCCCGG ACCAGGTCAT CGCCTACGGC CTGGAGGTCC TGCGGGCGCT GGGCTACCTG CACTCCAGGG GCCTGCTCTA CTGCGACTTC AAACCCGACA ACGTCATCCA GAGCGAGGAG CAGATCAAGC TGATCGACCT GGGCGGCGTG CGGCGCATGG ACGACACCGT CAGCCCCGTC TACACCACGC CCGGGTACCG GGTGCCCGAG GAGGAGCTGC GCGGCCCCGG CCCCACGGTC AGCGCCGACC TGTACTCGGT CGGCCGCGCC CTGGCCGTGC TCAGCTTCCG CTTCAGCTTC ATGCGCGACC ACCCGCACAG CATCCCGCCG AGGGAGACCG TGCCGCTGCT CCAGCGCCAC CCGTCCTTCG ACCGGCTGCT GCGCCGCGCC ACGCACAGCG AACCCGAACT GCGCTTCCAC GACGCCGAGG ACATGGCCGA CCAGCTCACC GGCGTCCTGC GCGAGGTGCT CTCCGACCTG GAGGGAACGC CGCACCCGGC CCCCTCGACC CTGTTCGGCG CGGAGAACCC CGCCGCCCGA CCCGACCCCG GCGCCCACAA CGCCGACCGC ATGCTCCTCG CGCCCCCGCC CACCGACGCG GCGGCGCTGC TGCCCGCGCC CCTCATCGAC CAGTCCGACC CGGCCGCCCA GCACCTGCGC GGTTTCCAGA CCCTGCCGCC CGAGGAACTC GTCCCCGCGC TGCGCGCCAT GCCCTCGCCC ACCCCCGAGA CCCTGCTGAT GCTGGTCCGC GCGCTCGTCA CCGTCGGACG CCAGGGCGAG GCCATGGACG TCCTCCAGCG GTTCAGCGAA GTGGTCCCGG GCGACTGGCG CACCATGTGG TACCTGGCCG TCGCCGAGCT GAGCACCGGC CGGTTCCGCG ACGCCCGCGA CCACTTCGAC GAGCTGTACG ACCACCTGCC CGGGGAACTC GCGCCCAAGC TCGCACTGGC CCTGGCCTGC GAACGGACCG GCGAGCACGA GACCGCCGCC CGCCTGTTCC GGGCGGTCTG GAACACCGAC CGCTCCTTCG TCAGCGCCGC GTTCGGCCTG GCCCGCATCC GCCTGGTCCA GGGGGACCGT GCCGCTGCCA CCGCCGTCCT GGACACGGTC CCGGAGCTGT CGCGGCTGCA CGTCCACGCC CAGACCGCGC TGATCGCCGT GCTCGCGGCG CACCACCAGG GCACCGGGGC CGCCGACTTC GTGCAGGCCG GACGGCGCCT GGAGCGGATC GGCCTGGACG GCGAGTCCGC CGACCGCCTG GCGGTACGGG TCCTGGAGTC GGCCCTGGGC TGGTTCGGCG CGGGTGCGGG GGCGTCGGCG GGGGAGGAGC TCCTCGGCGC CCCGTTCACC GAGAACGGTG TGCGGGCCAA CCTGGAGCGC CGCTACCGCG CGCTTGCCCA GCGCGCCGTC GCGCCGTCGG AACGCTATGA GCTCGTCGAC AGGGCCAACT CCCTACGCCC CGTGACGCTG CTGTGA
|
Protein sequence | MTKCTVPGCP GRVEDGFCGV CGMAPADVRP PDGPAPRQRP VPPTSGPQPS LGANSSANPL SGDPVHSHPL SGNVRLSGRS GPGSVSATSG ASVSGAGRGS RRSSRGMLGM GMVQVPLVPY RDPSEAVMDN PVVAEKNRFC GNCGERVGRT QGDQPGRTEG FCRKCGTQFS FTPKLSPGDL VGGQYEVLGC LAHGGLGWIY LARDRNVNDR WVVLKGLLNA GDAEAHKAAA AERAFLSEVE HPNIVKIINF SQHPDPRTGI PGGHIVMEYV GGKSLRELLI ERREDDPDAV LPPDQVIAYG LEVLRALGYL HSRGLLYCDF KPDNVIQSEE QIKLIDLGGV RRMDDTVSPV YTTPGYRVPE EELRGPGPTV SADLYSVGRA LAVLSFRFSF MRDHPHSIPP RETVPLLQRH PSFDRLLRRA THSEPELRFH DAEDMADQLT GVLREVLSDL EGTPHPAPST LFGAENPAAR PDPGAHNADR MLLAPPPTDA AALLPAPLID QSDPAAQHLR GFQTLPPEEL VPALRAMPSP TPETLLMLVR ALVTVGRQGE AMDVLQRFSE VVPGDWRTMW YLAVAELSTG RFRDARDHFD ELYDHLPGEL APKLALALAC ERTGEHETAA RLFRAVWNTD RSFVSAAFGL ARIRLVQGDR AAATAVLDTV PELSRLHVHA QTALIAVLAA HHQGTGAADF VQAGRRLERI GLDGESADRL AVRVLESALG WFGAGAGASA GEELLGAPFT ENGVRANLER RYRALAQRAV APSERYELVD RANSLRPVTL L
|
| |