Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3758 |
Symbol | |
ID | 9247627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4513545 |
End bp | 4514912 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | serine/threonine protein kinase |
Protein accession | YP_003681662 |
Protein GI | 297562688 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.143716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGTC CCGAACCGCT ACTGCCGAGC GACCCCTCCT CGTTCGGCGA GTACAGCGTG ACAGGGCGGC TGGGCAAGGG CGGCCAGGGG ATCGTCTACC TCGCCGAGGA CTCCGGCGGC GAGCAGTACG CGGTCAAGGT CCTCAACGAC CAGTGGTCGC GCGACAGCGA CCTGCGCAAG CGGTTCGAGA AGGAGGTCCG GGCGGCGCAG AAGGTCGCCT CCTTCTGCAC CGCCGCGATC ATCGACGCCC GGCTCGACGG CGACCCGCCC TACGTGGTCA GCGAGTTCGT GCCGGGCAAG GACCTCCAGG AGACCATCGC CAAGGGCGCC ACCCAGAAGG GCTCCACCCT CCAGCGGCTC GCCATCACCA CCGCCACCGC CCTGGTCGCC ATCCACAAGG CGGGCATCGT CCACCGCGAC TTCAAGCCCG GCAACGTGCT GATCGGCCCG GACGGCCCCC GCGTCATCGA CTTCGGCATC GCCCGGATCG ACGACGGCAC CGCCACCATG ACCAACTCGA TCGTCGGTAC GCCCTCCTAC ATGGCGCCGG AGCAGATCGA GGGCCGCGAC ATCACCGACA AGTGCGACAT CTTCGCCTGG GGCTGCGTCA TCGCCTTCGC CTCCACCGGG AGGGCGCCGT TCGGCTCCGA CACCGTCCCG GCGGTCGTCC ACCGCGTGGT CAGCGCGCCC CCGGACCTGA CCGGCATGGA CGAGGCCCTG CGCCCCATCG TCGAGTCCTG CCTGGACAAG AACCCGGACA ACCGGCCCAA CGCCCAGACG CTCCTCATGC GCCTGCTGGG CCACGAGGGC CCCGAGGCCT CCGGCCCCTC CACCGACGAC GCCGTCAAGC AGGGCGAGCA GATCGCCCAG ACCGGCCTGT TCACCGGCGC CCAGATGCCC TACCGGCCGC ACACCGGGCC GCAGCAGCCG CACACGGGCC CCCAGCAGCC GCACAGCGGC CCGCAGGCGG GCGCCGCGGG GTACGCGCAG CCCGGCGCCG CCAACCCCCG TCCGGGCATG CCGGGGCACC CCATGGGCTC CGACCCCCGC CACGGCTTCG TCCACCCCGG CACCTCCAAC CCCAGGCCCG GAATCCCCGG CCACCCGGGC ACGTCCAACC CGCGCCCGGG CGCCCCCGCC TACCCGCCCC CGCCGCCCGG CATGCCCCAC GGCCTGCACG CGGGCGCCAA CCCCATGGCC ACGGGCATGA ACCCCAACCA GACCTACGCC CGCCCCGGCG CGCAGCAGCC CATGCCGCGC CCCGTCGAGC AGAACTCGGT GCTCCAGCAG AGCTGGGTGA TCCCGGCGGT GCTCATCACC ATCGTGATCC TGCTGCTGGT GCTGCTCCTG CTGGCCATCA GCGGCTGA
|
Protein sequence | MPGPEPLLPS DPSSFGEYSV TGRLGKGGQG IVYLAEDSGG EQYAVKVLND QWSRDSDLRK RFEKEVRAAQ KVASFCTAAI IDARLDGDPP YVVSEFVPGK DLQETIAKGA TQKGSTLQRL AITTATALVA IHKAGIVHRD FKPGNVLIGP DGPRVIDFGI ARIDDGTATM TNSIVGTPSY MAPEQIEGRD ITDKCDIFAW GCVIAFASTG RAPFGSDTVP AVVHRVVSAP PDLTGMDEAL RPIVESCLDK NPDNRPNAQT LLMRLLGHEG PEASGPSTDD AVKQGEQIAQ TGLFTGAQMP YRPHTGPQQP HTGPQQPHSG PQAGAAGYAQ PGAANPRPGM PGHPMGSDPR HGFVHPGTSN PRPGIPGHPG TSNPRPGAPA YPPPPPGMPH GLHAGANPMA TGMNPNQTYA RPGAQQPMPR PVEQNSVLQQ SWVIPAVLIT IVILLLVLLL LAISG
|
| |