Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4240 |
Symbol | |
ID | 9248114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5056229 |
End bp | 5057392 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_003682137 |
Protein GI | 297563163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.481849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.893444 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCACGC CTTCACCGCC TCCCAACCAA CAGCCCCAGG TCCTCGAGTT CTTCGCGGGG ATCGGCTTGG CCAGAATCGG ACTCGAAGCA GCGGGTCTCA GGGTCTCGTG GTCCAACGAT TACGAGACGA GCAAGAAGAA CATGTACGAA GGTCACTTTG GGACATCCAG CGACCACACA TATGTGCTCC GCGATATTCG GAAAGTCTAC GCTGACCAGC TCCCGGCTGG CGCATCCGTG GCATGGGCTT CATCACCTTG CACAGACCTG TCACTCGCAG GAGCCAGGGC CGGCCTGGCA GGGGCAGAGT CAGGAACCTT CTGGGAATTC ATTAGAATCC TTAAAGATTT CAATGAATCA CGCCCTCCTA TCGCCGTCCT TGAAAATGTT GTTGGACTAG CGACTTCACA TGCAGGCGAA GATCTAGCTG CCGCAGTAAA GGCATTCAAT GAGCTCGGTT ACTCAGTCGA CGTCCTAGTG ATCGACGCGC GCCGCTTCAT TCCACAGTCA CGACCGCGTC TTTTTCTTGT AGCCGCCCAA AACCCTCCAA ATGGCCGCCC TCAGACAGAT TCCACACTAC GCCCCGACTT TCTTCAGCCA GTCTTTGGGG ATCCAACACT CACAACGCAT AGGGCACATC TTCCTGAACC TCCGGCACTC CTAACGTCCG GTTTTGGGAT GTGCGTCGAA GAGATGCCCT TGAATGACGA GCGATGGTGG GATGAAGAAC GAACAGAGGC ATTCATGTCC TCGCTATCAC CCACACAATA CCAGCGCGTG ATGCAGATGC ATTCCTCACC GGGCGTTAAG TACCGAACAG CATATAGGCG AACTCGTAAG GGAATCCCCG TATGGGAGGT TCGCCCTGAT GACGTATCGG GGTGCTTGCG AACTGCACGC GGTGGCTCTT CCAAGCAAGC TGTTCTAAGG GTCGATAACT CATCGCTTCA TGTCAGGTGG ATGACCCCTC GAGAATATGC CCGCTTGATG GGAGCAGGCG AGTATAAGCT TGACGGGATC CGAGCCAATA AGGCATTGTT CGGCTTCGGC GACGCTGTCG CCGCACCTGT CGTGCAGTGG CTAAGCGAGA AATATCTTTT GCCTCTCCTT CGAGAAGAAA ATTTTACCGA GCCCGAGATG ATGGAGATCC CTCTTGGCCA GTAG
|
Protein sequence | MFTPSPPPNQ QPQVLEFFAG IGLARIGLEA AGLRVSWSND YETSKKNMYE GHFGTSSDHT YVLRDIRKVY ADQLPAGASV AWASSPCTDL SLAGARAGLA GAESGTFWEF IRILKDFNES RPPIAVLENV VGLATSHAGE DLAAAVKAFN ELGYSVDVLV IDARRFIPQS RPRLFLVAAQ NPPNGRPQTD STLRPDFLQP VFGDPTLTTH RAHLPEPPAL LTSGFGMCVE EMPLNDERWW DEERTEAFMS SLSPTQYQRV MQMHSSPGVK YRTAYRRTRK GIPVWEVRPD DVSGCLRTAR GGSSKQAVLR VDNSSLHVRW MTPREYARLM GAGEYKLDGI RANKALFGFG DAVAAPVVQW LSEKYLLPLL REENFTEPEM MEIPLGQ
|
| |