Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5207 |
Symbol | |
ID | 9249100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 356984 |
End bp | 357955 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | putative anti-sigma regulatory factor, serine/threonine protein kinase |
Protein accession | YP_003683093 |
Protein GI | 297564120 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.441203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCG AACACCAGGG ACTGCTCTTC CGCCGCGAGC AGCGGTTCCA CGAGGTGGCC CGGCAGCGGT TGCGCTCAGC CGTACGCGAA AACGCCCACA CGGTGCTCGC GGTCTCCGGC GAACGCGCGG CAGCGCTCAC CGGGGCGCTG TCGGCCTCCG AGCGCGAGCG CGTGCACGTC CTGGAGCGGA ACCGGTTCTA CGACGCCCCG GGGCGCACCC TGGCCGCCCT GCACCGCCTG GCGCTGGTCC ACGCCCCCGT GCCGGTGGTC GTGGTGGCCG AGCCGCCGCT GCCGGTCTCG GGGATGGAGC TGCGCGAGTG GCACCGGCTG GAGTCGGTGC TGTCCACGGC GCTGGCCCCG CAGCGGCTGA GCCTGCTGTG CGTCCACGAC GACCGCGACC TGACCACGCG GACGCGGTCC GCGGTCGTGG CCACCCACCC GGTGCTGGTG GAGTCCGACG GGCCCCGCCC CAACCCCGCC TACCTGGGCA CGGCCGCGTT CGGCACCCGG CCGGTCGCCC CGGAGCCGCT GCCGGTCAGC GGACCCGCGC ACCGGCTGGA GATCGGGCTG TCCCTGCCGC GGCTGCGCGC CGACCTGGCG GCGCTCGGCG AGTCCATGGG GTTCCCGCCC GAGCGGATCG ACAGCCTGGT GGTGGCGGTC AACGAGCTGG CCGCCAACGT GCTGGAGCAC GGGGCGGGCA AGGGCACCGT GCAGGTCTGG CGCGCCCCCG GGCGGTGGGT GTGCGACGTG TTCGACGAGC GCGGCGGGCT CTTCGACCCG CTCACCGGCT ACCGCCCGGC CGACAGCATG CGCCCGCGCG GGTACGGGCT GTGGATCACC CGGCAGACCT GCGACTTCCT GGAGATCAGC GGCAGCGGCG AGGGCTCCCT GGTACGGCTG CACTTCGTGG ACGGGGCCGG GGAGTCCGGG ACCGTCCGGG AACCAGGGAC GCGGGCCCCG CTCAGCCCCT GA
|
Protein sequence | MTFEHQGLLF RREQRFHEVA RQRLRSAVRE NAHTVLAVSG ERAAALTGAL SASERERVHV LERNRFYDAP GRTLAALHRL ALVHAPVPVV VVAEPPLPVS GMELREWHRL ESVLSTALAP QRLSLLCVHD DRDLTTRTRS AVVATHPVLV ESDGPRPNPA YLGTAAFGTR PVAPEPLPVS GPAHRLEIGL SLPRLRADLA ALGESMGFPP ERIDSLVVAV NELAANVLEH GAGKGTVQVW RAPGRWVCDV FDERGGLFDP LTGYRPADSM RPRGYGLWIT RQTCDFLEIS GSGEGSLVRL HFVDGAGESG TVREPGTRAP LSP
|
| |