Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3040 |
Symbol | |
ID | 8448653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3338601 |
End bp | 3339866 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645042124 |
Product | Integrase catalytic region |
Protein accession | YP_003202366 |
Protein GI | 258653210 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000000772165 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00538259 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTGATTG TGGATGACTG GGCGGAGATC CGTCGGTTGC ATCGGGCGGA GGGGATGCCG ATCCGGGCGA TCGCTCGGCG TCTGGGGTGT TCGAAGAACA CTGTGAAGCG GGCGTTGGCC GCGCAGGGTC CGCCGAGGTA TGAGCGGGCG ACGGTCGGGT CGGCCGTTGA TGCGTTCGAG CCGGCCATCC GGGCGTTGTT GGCGGAGTTT CCGTCGATGC CGACGTCGGT GATCATGGAG CGGGTTGGGT GGTCGCGGGG CCGCACGGTG TTCTTCGAGC GGGTCGCGGT GTTGCGGCCG TTGTTCGTGC CGCCGGATCC GGCGTCGCGG ACGGAGTATG GGCCGGGGCA GTTGGCGCAG TGCGATCTGT GGTTTCCGCC GGTGGACGTG CCGGTGGGGT TCGATCAGGT CGCCCGTCCA CCGGTGCTAG TGATGGTGTC GGGGTTCTCA CGGGTCATCA CGGCCAGGAT GCTGCCGTCG CGGCAGTCTG CGGATCTACT GGCTGGGCAT TGGGAGCTGC TGTTGGGGTG GGGTCGCTTG CCCAGAGCCC TGGTCTGGGA CAACGAGGCC GCGGTCGGCC GGTGGCGCGG CGGCCGACCG GAACTGACCG AACCGATGAA CGCCTTCCGT GGAACGTTGG GTATCAAGGT CGTGCTCTGC GCGCCGCGCG ACCCTGAGTC CAAGGGCCTG GTCGAGCGGG CGAACGGCTA TCTGGAGACC TCATTCCTGC CCGGCCGCAC GTTCACCTCC CCGGCTGACT TCAACGCCCA GCTGGCTGCG TGGCTGGTCC GGGCGAACCA GCGGCAACAC CGCCGGCTCG GGTGCCGGCC CATCGACCGG TGGGCGGCGG ACCTGGCCGC GATGATGGCG ATGCCACCGG TTGCGCCGGT GGTGGGCTGG ACCGCGTCGC CGCTGCTGCC TCGTGATCAT TACGTCCGCG TCGATTCCAA CGACTATTCG GTGCATCCCG GTGTGGTCGG TCGACGGGTG CAGGTGCTGG CCGATCTGGA TCAGGTCGTG GTGACCTGCG CCGGCACGGT CGTGGCCGCG CACGAACGGT GCTGGGCGCG GCGGCAGACC ATCACCGATG CCGACCATGC CCAGGCCGCG GCGGCGTTAC GCGCCGCCCA CCGCGAACGG GTGCGACGGC CGGTAGAGAC CGACGTCGCG GTCCGCGAAC TCGCCGATTA CGACCGCATC TTCGGCCTGC AGGACGACCT CGACGATCAT CCCAGCGTCG ACGTGGCCGA CGGTGAGGTC GCCTGA
|
Protein sequence | MLIVDDWAEI RRLHRAEGMP IRAIARRLGC SKNTVKRALA AQGPPRYERA TVGSAVDAFE PAIRALLAEF PSMPTSVIME RVGWSRGRTV FFERVAVLRP LFVPPDPASR TEYGPGQLAQ CDLWFPPVDV PVGFDQVARP PVLVMVSGFS RVITARMLPS RQSADLLAGH WELLLGWGRL PRALVWDNEA AVGRWRGGRP ELTEPMNAFR GTLGIKVVLC APRDPESKGL VERANGYLET SFLPGRTFTS PADFNAQLAA WLVRANQRQH RRLGCRPIDR WAADLAAMMA MPPVAPVVGW TASPLLPRDH YVRVDSNDYS VHPGVVGRRV QVLADLDQVV VTCAGTVVAA HERCWARRQT ITDADHAQAA AALRAAHRER VRRPVETDVA VRELADYDRI FGLQDDLDDH PSVDVADGEV A
|
| |