Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1423 |
Symbol | |
ID | 9155572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1490234 |
End bp | 1491763 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | integrase family protein |
Protein accession | YP_003646390 |
Protein GI | 296139147 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00313784 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCGG TCCACAAGTT CACCAGCAGA GTTGACGGCA AGCCCTACTA CAAGGTCAAG TGGCGCACGC CCGATGGGAA GCACCGCACG AGGGGCGGCT TCGCTCGCCG CAGGGACGCT GAGGCGTACG CGGAGACCGT CGAGTTCAGC GCGCGCCGTG GGCTCACGTT CGACCCACGC AGCGGGGACA TGCTGTTCAG GGCCGCAGGG CAGGCGTGGT TGCAGAGCCG CACCGACCTC AAGATCGCGA CCCGAACCAA CTACACGGGC CAGCTCCGCG CTGATGGGGA GATTGACCGC AGGTTCGGCG GCTACCCCCT CAACAAGATC ACCCGCGAGG ACCTACAGGC CTGGGTGAAC GAGCAGGTAG CAGGCGGTAG TAGTTCCAGC AGCGTCCGCA ACAAGTTCTT CATCGTGCGG ATGATCCTGT CGCAGGCCGT CGTGGACCGT CGCTTGGAGT TCTCGCCCGC CGACCACGTG AAGCTGCCCG CGCCGCGCCG GAAGGGCAAG CAGGCCAGCA CAGCGTCCGG CCAGGCCGCG TCGATGGTCG GCTCTGCGCC ATCAGTCCAC GGATCGACTG AGGACGCCGC ATTCCTCACT GCCGAACAGG TGGAGTACCT GACGGCCGCC ACACCCTGGC CGTACAACAT CCTTGTCCAT ATGGCGGCGT GGACCGGCCT GCGCAGCGGC GAGCTGACAG GCCTACAGAT CGGGGATATC GACCTTGGGC GCAACTCGAC AGTCAGCGTT CAGCGAACGG CCTTGGTTGT GCCTGGTACA CCAGCAGACG GAGACAGCCC GGCAACTGCC CCTCGTGCCG TCTACGACAC CCCCAAGACC CGCCGATCCA GGCGGCGCGT GCCCCTGACC GCCGCGACCG TTGCCGTCCT CCGGGACTAC TTGGCGCCCG CGCCCCGGCT CGACGGCGGC GACCGCCGCT TCAACGCGGC AGCGACCCCA CTAGCCCCCA GGATCGACGC CCCGAGCAGC GCGACCGGCG CGCCGACGTA CGTTCACCCG AGAGCGGGCG ACCCCACCGC ACCGCTGTTC CCGAAGATGC GGTTGGTCGC TGGCCGGCCA ACGGGCAAGC GCGCCCCGCG CCACGAGACC GGACCCCGCG CGGGACAACC GATGACCCCC GAAGAACAAG CCGCACGCCG CACCGTGGCC GAAGCCCAGG CTCGCCTAGA ACTGGACTGG AATGCCGTAC TGCTGCACAA GACCTATTAC AAGGCGGTGT TCCAACCCGC ACTACTCCGC GCCAGCTTAG CCCTGGTGGC CGATGGGCTG CGGCCGATAC CCGAGCACGC GACCGCGCAC AGCCTCCGGC ACACCTACGC GAGCTTCTGC GTCAGCGCGG GCCTGCACCC CAAGCAGATA TCGAGCTATT GCGGGCACGC ATCCGTGAAC ACCACGATGG GCATCTACGC ACACCTATTC GAGGACGACC ACACCGAAGC AATGGCCGCG CTCGGCAGCG TCGGAGCTAC CCGCCGCGAC AACGTCACCC CGCTGCGCGG ATGGGGTTAG
|
Protein sequence | MASVHKFTSR VDGKPYYKVK WRTPDGKHRT RGGFARRRDA EAYAETVEFS ARRGLTFDPR SGDMLFRAAG QAWLQSRTDL KIATRTNYTG QLRADGEIDR RFGGYPLNKI TREDLQAWVN EQVAGGSSSS SVRNKFFIVR MILSQAVVDR RLEFSPADHV KLPAPRRKGK QASTASGQAA SMVGSAPSVH GSTEDAAFLT AEQVEYLTAA TPWPYNILVH MAAWTGLRSG ELTGLQIGDI DLGRNSTVSV QRTALVVPGT PADGDSPATA PRAVYDTPKT RRSRRRVPLT AATVAVLRDY LAPAPRLDGG DRRFNAAATP LAPRIDAPSS ATGAPTYVHP RAGDPTAPLF PKMRLVAGRP TGKRAPRHET GPRAGQPMTP EEQAARRTVA EAQARLELDW NAVLLHKTYY KAVFQPALLR ASLALVADGL RPIPEHATAH SLRHTYASFC VSAGLHPKQI SSYCGHASVN TTMGIYAHLF EDDHTEAMAA LGSVGATRRD NVTPLRGWG
|
| |