Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1428 |
Symbol | |
ID | 9245278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1748150 |
End bp | 1749307 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | transposase, IS605 OrfB family |
Protein accession | YP_003679366 |
Protein GI | 297560392 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.996106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTGTGA AGATCGTTGT GCAGGTGAAG CTGCTCCCCG ACTCCGCTCA GGAGGCGACG CTGTGGGAGA CGCTGAAGTT GTGCAACCAG GCGGCCAACC GGGCCTCGCG GAGTGCGCAC GAGTCCGGGG TGAAGGCGAA GACACCGTTG CAGCGGCTGG TCTACGACGA CCTCAAAGCG ATGGGGTTGT CCGCGCAACC GGCGATCCAC TGCGCACGCA AGGTCGCCGG GGCCTATGCC ACGCTGAAGG CCAACCTCAA GGCCGGAAAC CACGGCTGTG AGGGGTCCAA GCGGCGTACC CGGGTGGAGA ACACGCCGGT TCGGTTCCGA AAAGACGCCG CGCAGCCCTT CGACGACCGG TGCCTGTCCT GGCAGATAGA CGCGCGCACG GTGTCGATCT GGACCGTGCA CGGCCGGGTG AAGGGCATGG CGTTCGCGTG TTCACCGGAC CAGGCCAAGG TGCTCACCGA GTTCCGCAAG GGCGAATCCG ACCTGGTCAA GCGCGGCGAC AACTGGTACC TGTACGCCAC CTGCGAGGTT CCCGAGGCCG ACGAGTACAC CCCCGAAGGT TTCGTGGGTG TGGACCTCGG CATCGCGAAC ATCGCCACCA CCAGCGACGG CACCGTTCAC TCGGGCAAAC AGGTCAACCA GGTCCGCCAC CGCAACCGCC ACCTCAGGCG CCGTCTCCAG AAGAAGGGAA CCAAGTCCGC CAAGCGTGTC CTGCGCCGCC TCTCGGGCCG CGAGGCGCGG TTCGCGGCCG ATACCAACCA CCGCATCGCC AAGCAGATCG TGACCGAGGC TCAACGCACC TCACGGGGTG TCGCCCTGGA AGACCTGGGC GGCATCCGCA AGAGGGTACG GCTACGCAAG CCCCAGCGGG TCACGCTGCA CTCCTGGTCC TTCGCTCAAC TGGGCGCTTA CATCGCCTAC AAGGCGAAAC GGGTCGGAGT GCCGGTGGTG TACGTGGACC CGGCCTACAC CTCCCAGGGG TGTAGCGCGT GTGGCCACGT CGACAAGAGG AACCGGCCGA GCCAGGCCAC TTTCCGATGC ACGTCGTGCG GTTTCGCTGG GCACGCCGAT GTGAACGCGG CCCGCAACAT CTCCTCCAGG GGTGTGGCGG GCTGGGCAGT GAGTCACGCT GCCGACGACG CGGCCTGA
|
Protein sequence | MGVKIVVQVK LLPDSAQEAT LWETLKLCNQ AANRASRSAH ESGVKAKTPL QRLVYDDLKA MGLSAQPAIH CARKVAGAYA TLKANLKAGN HGCEGSKRRT RVENTPVRFR KDAAQPFDDR CLSWQIDART VSIWTVHGRV KGMAFACSPD QAKVLTEFRK GESDLVKRGD NWYLYATCEV PEADEYTPEG FVGVDLGIAN IATTSDGTVH SGKQVNQVRH RNRHLRRRLQ KKGTKSAKRV LRRLSGREAR FAADTNHRIA KQIVTEAQRT SRGVALEDLG GIRKRVRLRK PQRVTLHSWS FAQLGAYIAY KAKRVGVPVV YVDPAYTSQG CSACGHVDKR NRPSQATFRC TSCGFAGHAD VNAARNISSR GVAGWAVSHA ADDAA
|
| |