Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2854 |
Symbol | |
ID | 9246705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3407278 |
End bp | 3408489 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transposase, IS605 OrfB family |
Protein accession | YP_003680771 |
Protein GI | 297561797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.232539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTGCT TTGACTGCGG TCTGTCACTG GCCGGGTGCA CGATGGCGGG CGTGTCGAAG ATCGTCGTGC AGCTCGGGCT CACGCCGTCG CCCGAGCAGG CGGCGGTGCT GGCTTCGACC CTGCGCGCGC TCAACACCCA CGCCACCCGG GTGGCGAAGG TGGCCCACGA GCAGGGCGTG ATGCGCGACT ACGAGCTGCG CAAGCACACC TACCGGCAGC TGCGTGCGGC CGAGGTGGGC TCACAGGCGG CCCGACACGT GATCAAGAAG GTGTGCGACG CCTACCGCAC CCGCCGCGCC AATCTGGACA ACGGCGACTA CGGGCCGAAG GGATCAGTCC GCCGCACGCG GATCGGGTCC ACACCCATTG CCTTCCGCGC GGGCTCGGCG CACCCCTACG ACGCGCGCGA CCTGTCGTTC GCGATGGACG CGCGCACGAT CTCGCTGTGG ACCTTCCAGG GCCGGTTGCA GGACGTGCCG TTCACCGGTT CTGCCGACCA GGTCAAGGCG CTGGCCGGGC ACAAGCGCGG TGAGGCCGAC CTGCTCCGCC GCGACGGCGC CTGGTTCCTC GCGGTGACCG TGGAGGTGCC CGACGCCCCC GAGCAGGAGC CGGACGGGTT CGTGGGGGTG GACCTGAGGA TCATCAACAT CGCCACCACC AGCGACGGCC AGGTCATGGC CGGGCGCAGG ATCAACCGGT ACCGCCGCCG CCGGCTCAGG CTGCGCCAGA AGTTGCAGGC CAAGGGCACC CGCTCCGCCA AGCGCCTGCT CAGGAAGCGC CGCCGCAAAG AAGCACGGCA TGCCGACACC AACCACCAGA TCTCCAAACG CATCGTGGCC GAGGCCGAAC GCACCGGGCG CGGCATTTCC CTCGAAAACC TGACGGGGAT TCGCGCGCGG GTACGGCAAC GCAGGCCCCA ACGGGCCACG CTGCACTCCT GGTCCTTCCA CCAGCTGGGC GCCTTCATCG CCTACAAGGC CCGCCTGAAA GGTGTGCCGG TGGTCTTCGT GGACCCGGCG CACTCCTCAC GCGAGTGCGC CGCATGCTCC TACACTCACA GGGCCAACCG GGTCTCACAG GCCTTGTTCA TCTGCCGGTC GTGCGGCGTC GTTGCGCACG CGGACCGCAA TGCTTCCCGT GTCCTGGCCC GCAGGTGCCA GGAGGCGTGG AACGCGGGGC GGAAGTCACA CGTCCCACCC GGCCACCCCT AA
|
Protein sequence | MDCFDCGLSL AGCTMAGVSK IVVQLGLTPS PEQAAVLAST LRALNTHATR VAKVAHEQGV MRDYELRKHT YRQLRAAEVG SQAARHVIKK VCDAYRTRRA NLDNGDYGPK GSVRRTRIGS TPIAFRAGSA HPYDARDLSF AMDARTISLW TFQGRLQDVP FTGSADQVKA LAGHKRGEAD LLRRDGAWFL AVTVEVPDAP EQEPDGFVGV DLRIINIATT SDGQVMAGRR INRYRRRRLR LRQKLQAKGT RSAKRLLRKR RRKEARHADT NHQISKRIVA EAERTGRGIS LENLTGIRAR VRQRRPQRAT LHSWSFHQLG AFIAYKARLK GVPVVFVDPA HSSRECAACS YTHRANRVSQ ALFICRSCGV VAHADRNASR VLARRCQEAW NAGRKSHVPP GHP
|
| |