Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3081 |
Symbol | |
ID | 9246937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3685439 |
End bp | 3686686 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_003680996 |
Protein GI | 297562022 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.758933 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAT CGCTGGTACC AGGAACCCCT CAAACATGGA AAGTCACCAC TCTTGGCGAA CTCTGTGCTT CAGGTGGCGG AAATATTCAG ACTGGCCCCT TTGGAAGCCA ACTCCACGCC GCCGACTATG TTACTCAAGG AATTCCTAGC GTTATGCCCC AAAATATTGG GGACAACGTC ATCAAAGAGG AGGGAATAGC ACGGATTGCG CCGGAAGACG CATTTCGCCT CGAAAAATAC CTCCTCGCAC CAGGAGACAT TGTCTACTCG CGGCGAGGAG ATATTGAAAA ACGGGCGCTA GTACGCGAGA CTCAGCGCGG ATGGCTGTGT GGAACTGGAT GTCTCCGCGT TAGGCCCGGA GTTGGAGCAA ACTCCGAATT CATCTCTTAC TATCTAGGCC ACCCCAGCGT GAGAGAGTGG ATTGTGAAAC ACGCCGTGGG CGCCACCATG CCGAATCTCA ATACGAAGAT TCTGAGCTCC CTCCCCGTAA GTGTCCCTCC ACTCAACGAA CAGGTTTCCA TAGCTTCAAC TCTTGGGGCG CTAGATAATA AGATCACAGT CAACAAGCAG ATCGTTAGCA CCTACGAATC CCTTCTTGCA ACTGAGTTCG AACAACTCAT CAGAATAGAA GCAGGGGCAG AACAGGACAT CGCCCTGGCA AACGAATTCG TAGAGTTCAA TCCAAAGTAC CAGAAACCAT CTGATCCCAC ATCCCGCCAT GTGAACATGG CAGCACTACC CACAAGCTCT GCCAGGGTTC ACACATGGGA TTTCCGAAAG CCTACACCTG GGACTCGATT CCAGAACGGT GACACCTTGC TCGCAAGGAT CACTCCCTGC CTGGAGAACG GAAAGACGGC ATTCGTTGAC TTTATGGATG ACAACGAGAC CGGCATCGGA TCCACTGAAT TTATCGTCAT GCGCTCACTC CCAGGCGTGC CGCAGCATTT CTCCTATCTT CTGGCTCGGA ACAAGCGCTT CCGCGAACAT GCGATCTCGA ACATGATCGG AACTTCTGGG CGCCAGCGCT GCCCAGCCGA CCGCCTTCCG GGTTTCTCGA TGAAGCGTCC TGACCCCACA GAGCTGGAAC GAATCGGAAA AGATTCCGAT GTGGCATTCG CTCACATGCG ATCTCTTGAC TCAGAGGCCT ACATTCTCGC GGAACTGCGG GACACACTCC TGCCAAAGCT GATCTCCGGG GAGCTTCGCG TCAAGGACGC CGAGAAGCGG GTCTCCGACG CGGTCTGA
|
Protein sequence | MPESLVPGTP QTWKVTTLGE LCASGGGNIQ TGPFGSQLHA ADYVTQGIPS VMPQNIGDNV IKEEGIARIA PEDAFRLEKY LLAPGDIVYS RRGDIEKRAL VRETQRGWLC GTGCLRVRPG VGANSEFISY YLGHPSVREW IVKHAVGATM PNLNTKILSS LPVSVPPLNE QVSIASTLGA LDNKITVNKQ IVSTYESLLA TEFEQLIRIE AGAEQDIALA NEFVEFNPKY QKPSDPTSRH VNMAALPTSS ARVHTWDFRK PTPGTRFQNG DTLLARITPC LENGKTAFVD FMDDNETGIG STEFIVMRSL PGVPQHFSYL LARNKRFREH AISNMIGTSG RQRCPADRLP GFSMKRPDPT ELERIGKDSD VAFAHMRSLD SEAYILAELR DTLLPKLISG ELRVKDAEKR VSDAV
|
| |