Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2521 |
Symbol | |
ID | 8448132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2772772 |
End bp | 2774625 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645041631 |
Product | integrase family protein |
Protein accession | YP_003201875 |
Protein GI | 258652719 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0000524212 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00558917 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGGA CCGCCGAATC GACCGCGGCC CGGGCCGAGG AACCCGGGAT CGGCCAGGCT CGGGCGGCCT ACCTGGCCGA CGCTGGCACC CAGCCGGCAC GACTGTCCCG GTCGCGGAGC ATCGGGCGGT TCGTCACCGA GTTCGGTGAC GTGAGCGGTT GGCGGGCCTG CTCGGTGCCC GAACGGCTCG CCGCCGGAGA CCAGGCGCGA GCCTTCGCGC ACTTCGCTGT CGTGCACGAC CGGGTCCCGG TCGAGGCCGA GTACGTCGCA GCCGTGACGT CGAGGTGGGG CCATCACGTC GCGAACCGAG ATCCGGAGCA GGCTAACCAA TTCCGGCGCC AGGCCGCGTC GTTGAGTTTC AATCGGTTGG AGATCGACAA GATGTGGTCC AAGCTCGCCC AGATCTGCGT CATCACCGGC AGCCGCCCGG ACGAACTGAG CAGCGACAAC TACGTGGCCG GGCGGGCCGC GTTCTGGGCC GCGGTCGTCG CCAAACACGG CGGCACGATC CCGACAACAC TGGCGACCCC CCTGTTCGGC CTGGACGCCG TGATGTTCCA CCGGGGCCAA GCGCCCCGAC CCACGACACG CAAGTCATGG GCTGCCCGGT CGGTGCCGGA GACCAGCTGG GACCAGATCA CCGCCGCCGC CCCGATGATG GCCGCCACCA TGCACCGCTA CCTGGACCAG CTGCGGGTCA GCCTGCGCGC TTCATCGGTC GCCAGCATCG AGATCACCCT GCGGCAGATC GCCGGGCACC TGATCTCCAC CAGCAGCGTC ACGACGGTCG CCGACATCGG CCGGCCGCAG ATCGAGGCCT ACGGGACCTG GCTGGCCGGG CGTGGCGGCT ACCGCAAGAA CAGCACAATC AGCAAAACGA CCATCGGGAT GCGGATGTGT CACCTCGGCG CGTTCTTCCG CCGGATCATC GAATGGGGCT ACCCCGACGC ACCCCTTCGG CCACCGGTCT TCTCCTCCGA CCGGCCGGCC AAAGACAAGC CGCTGCCCCG GTTCCTCGAC GACGCTGCTA CCGCCAAGTT CATGGCCGCG GCACGGGAGC TGCCCGACCC GTTCGGTCGG CTCGCCATCG AGATCTTGGC CCGCACCGGC ATGCGCAAGG GTGAGCTACT CGGTCTGACC ACCGACGCCG TTGTGCAGAT CGGCTCCGCC TACTGGCTGC GGATCCCTGT CGGCAAACTC CACAACGACC GCTACGTTCC TCTCCACCCG CAACTGAAAA CCATGATCGA CAACTGGCTC GAACAGCGAC CCGACTGGCA AAACAGCCAC CTGCTGTTTA CCGACCGGGG CCGGCCGATC CCACCCACCC GAGTCGACCA CGCCGTTCAA ACAGCCGCCA CCGCAGCCGG TATCGGTCAC GTGCACCCAC ACCAACTCCG GCACACGCTA GCCACCCAGG CAATCAACCG CGGCATGAGC CTCGAAGCGA TCGCCGCGCT GCTGGGCCAC AAGACGATGA GCATGACCCT GGTCTACGCG CGCATCGCCG ACCGCACCGT CGCCGACCAG TACTTCACCG TCACCGAGAA GGTCCAAGCT CTCTACCAAC AGCAGCAACC CGCAATGCTA CCCGCCGCGG ACGAACCAGC CCCGATGCGC AAACTCCGCG CCGAACACAG CAAACGCATG CTCGGCAACG GCTTCTGCAC CCGACCCGCC GAACTCGAAT GCCACTACGA AACGATCTGC GAGTCCTGCA CCTTCTTCGT CACCACCATC GAATTCCGAC CCACCCTGCA GGCTCAACGT GACGACGCCG CCCGCAAGGG TCAAAGCGGC CGTCAGAAGG TCTACGACGG ACTCCTTCAG CGACTCGCCG ACACCGCTAC TTGA
|
Protein sequence | MSGTAESTAA RAEEPGIGQA RAAYLADAGT QPARLSRSRS IGRFVTEFGD VSGWRACSVP ERLAAGDQAR AFAHFAVVHD RVPVEAEYVA AVTSRWGHHV ANRDPEQANQ FRRQAASLSF NRLEIDKMWS KLAQICVITG SRPDELSSDN YVAGRAAFWA AVVAKHGGTI PTTLATPLFG LDAVMFHRGQ APRPTTRKSW AARSVPETSW DQITAAAPMM AATMHRYLDQ LRVSLRASSV ASIEITLRQI AGHLISTSSV TTVADIGRPQ IEAYGTWLAG RGGYRKNSTI SKTTIGMRMC HLGAFFRRII EWGYPDAPLR PPVFSSDRPA KDKPLPRFLD DAATAKFMAA ARELPDPFGR LAIEILARTG MRKGELLGLT TDAVVQIGSA YWLRIPVGKL HNDRYVPLHP QLKTMIDNWL EQRPDWQNSH LLFTDRGRPI PPTRVDHAVQ TAATAAGIGH VHPHQLRHTL ATQAINRGMS LEAIAALLGH KTMSMTLVYA RIADRTVADQ YFTVTEKVQA LYQQQQPAML PAADEPAPMR KLRAEHSKRM LGNGFCTRPA ELECHYETIC ESCTFFVTTI EFRPTLQAQR DDAARKGQSG RQKVYDGLLQ RLADTAT
|
| |