Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2622 |
Symbol | |
ID | 8448234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2873317 |
End bp | 2874357 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645041718 |
Product | transposase IS4 family protein |
Protein accession | YP_003201961 |
Protein GI | 258652805 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0000619925 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00356568 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTCCGCA CGAACCGGGT TCGCGCCGAT ACCACGGTCG TCCCGGCGAA TGTGGCGTAC CCGACCGACT CGGGGTTGCT GGCCAAAGCG ATCCGCCGGA TCGGCACCAG CGTGAAGCGG ATCCACGCGG CCGGCGGCGC GGTCCGCACA AGGGTGCGGG ACCGGTCCCG GTCCGCCGGA GCGAAGGCGC ACGGGGTCGC TGCCAAGCTG CGGTCCCGCG CCCAACTGGG TCGAGACGAG GCCCGCGCCG GGGTGCAGAA GATCACCGGC GAGCTCGCTG ACCTGGCCGA ACAGGCGATC AAGGACACCC GCAAGCTGCT GGTCAACGCC CACCGCGCCG CGGACAGGGC CCTGGCCAGG GCCAAGGCGC TGGCCAAGAC CGGGATCCGC GACGCGGCCG TCGGGCGACG CCGCGGCCGG TTGGTCCGCG CGGTCAACGA CCTACAGAAC CTGGTCGAGG CGACCGAACG GATCATCGAG CAGACCCGGA CCCGGCTGAC CGGCCGCACC CCGGACGGCG CCACCCGGGT GGTCAGCCTG CACGACACCC AGGCCCGGCC GATCGCCAAG GGCCGCCTCG GTAAGCCGGT CGAGTTCGGC TACAAGGGCC AGGTTGTCGA CAACCAGGAT GGCATCGTGC TGGACCACAA CGTCGAACTG GGCAACCCGC CCGACGCACC ACAACTGGCG CCGGCGATCG ACCGGATCAC CGCGCGAACC GGTCGGACGC CGCGCACGGT GACCGCGGAC CGCGGCTACG GCGAGGCCAG TGTCGACCAG CAGCTGACCG ACCGCGGCGT GCGGAACGTC GTCATCCCCC GCAAGGGCAG ACCCGGCGCG GCCCGCCGAG CCGTCGAGCA TCGGCCGGCG TTCCGGCGGA CCGTGAAGTG GCGAACCGGC TGCGAAGGCC GGATCAGCAC CCTCAAACGC GGCTACGGAT GGGACCGCAC GCGCCTGGAC TCGCTCGAAG GAGCCCGGAC CTGGACCGGA CAAGGGATCC TGACCCACAA CCTGGTCAAG ATCGCCGCCC TGACCGCCTG A
|
Protein sequence | MVRTNRVRAD TTVVPANVAY PTDSGLLAKA IRRIGTSVKR IHAAGGAVRT RVRDRSRSAG AKAHGVAAKL RSRAQLGRDE ARAGVQKITG ELADLAEQAI KDTRKLLVNA HRAADRALAR AKALAKTGIR DAAVGRRRGR LVRAVNDLQN LVEATERIIE QTRTRLTGRT PDGATRVVSL HDTQARPIAK GRLGKPVEFG YKGQVVDNQD GIVLDHNVEL GNPPDAPQLA PAIDRITART GRTPRTVTAD RGYGEASVDQ QLTDRGVRNV VIPRKGRPGA ARRAVEHRPA FRRTVKWRTG CEGRISTLKR GYGWDRTRLD SLEGARTWTG QGILTHNLVK IAALTA
|
| |