Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_2224 |
Symbol | |
ID | 4923220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | - |
Start bp | 2594700 |
End bp | 2597396 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640163809 |
Product | DNA topoisomerase I |
Protein accession | YP_001094349 |
Protein GI | 127513152 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.648811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00821912 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCCAA TTTACAGAAT TTTGAGACTT TTTGGCGCAG ATCATTCTAT GGGTAAATCG CTAGTTATTG TCGAATCACC GGCCAAAGCC AAGACTATTA ACAAATATCT CGGCAAAGAT TACATCGTAA AATCGAGTGT GGGTCACATC CGTGATCTTC CTACCTCATC CTCCAGTGAC GCCAGTTCGT CGACTAAATC GCCCGCCGAA GTGCGCAAGA TGTCCACCGA AGAGAAGGCT AAATATAAGG CTGACAAAGC GAAGAAAGCC CTGGTTGCGC GTATGGGCGT CGACCCAGAG CGAGGCTGGA AAGCCAAGTA TCAGACGCTT CCGGGCAAAG AGAAAGTTGT CAAAGAATTA CAGTCACTTG CTGAAAATGC AGATCATATC TTCCTCGCAA CCGATTTGGA TAGAGAGGGG GAGGCAATTG CATGGCATCT TCAAGAAGTG ATTGGTGGGG ATCCGGAGCG CTATCAGCGT GTGGTCTTTA ACGAGATCAC CAAGAGCGCA ATTCAAGAGG CGTTCAGCCA TCCGTCTCAG CTCAACACCA ATATGGTCAA TGCCCAGCAG GCGAGACGTT TTCTCGATCG CGTAGTGGGT TTCATGGTGT CACCTCTGCT GTGGAAAAAA GTGGCCCGTG GCCTCTCTGC AGGGCGTGTG CAATCGGTAG CAACGCGTCT TGTGGTCGAG CGCGAAGGCG AGATCAAGGC GTTCGTACCT GAAGAGTTCT GGGACATTCA TGCCGAGCTG ACTACCCTAG GTAACGAAGG GCTCAAGATG CAGGTGGCTA AGTTTAAGGG GGATAACTTT AACCCTGTAA ACGAAGCGCA GGCCATGGTT GCGGTAAACG CACTAAAAGA CAGCAAGTAC CAGGTTTCTA GCCGTGAAGT CAAAGCGACT TCTAGTAAGC CCTCGGCGCC CTTCATTACC TCGACGCTTC AGCAGGCGGC CAGTACCCGT CTCGGCTTTG GCGTGAAGAA GACCATGATG ATGGCCCAGC GTCTCTATGA GGCGGGTCAT ATCACCTATA TGCGTACCGA CTCGACCAAT CTGAGTCAGG AAGCCCTCGA CAGTGTGCGT GACATGATCG GCAAGGAGTA TGGCGATAAG TATCTGCCCG ATGCGCCGAT CCGTTATGGC AGCAAAGAGG GAGCACAGGA AGCTCACGAA GCGATTCGTC CGTCCGATGT GAACGTTAGT GCCACCAGCC TTAGCGATAT GGAAAGAGAT GCCCAGCGTC TCTATGAGTT GATCTGGCGC CAGTTTGTTG CCTGTCAGAT GACCCCAGCC AAATATGATG CGACGCGTTT GACCGTGACC GCAGGGGATT ACGAACTCAA GGCCAGTGGC CGTACCCTTA AGTTTGACGG TTGGACTCGG GTTCAGACCG CGCTTAAGAA GAAAAACGAG GAAGACAACA CGCTGCCGAT GGTTGCCGAA GGCGATGTGC TGGCATTAGA AGAGTTGTTA CCTAAGCAGC ACTTCACTAA ACCGCCCGCG CGTTATAGCG AAGCATCACT GGTGAAAGAG CTTGAGAAGC GTGGCATTGG TCGTCCATCG ACCTATGCGA CCATCATCTC GACCATTCAA GACCGTGGCT ATGTGAAGGT TGAGAATCGT CGGTTCTACG CCGAGAAGAT GGGTGAGATC GTTAGCGAGA GCCTCATCGG CAGCTTCGAA GAGCTGATGA GTTACGACTT TACCGCCGGC ATGGAGCAGA CCCTAGATAA CGTGGCTCAG GGCCAGCTCG AATGGAAGAA GGTGCTGGAT AACTTCTATA AAGGCTTGAC CGCTCAGCTT GAAAAGGCCG AGCTACCGCC AGAAGAGGGC GGCATGCGCC CTAACGAGAT GGTGCTGACC GATATCGCCT GTCCCACCTG TGGACGCCCA ATGGGCATTC GCACCGGTAC TACCGGCGTG TTCCTGGGCT GTTCAGGTTA TGCCTTGCCA CCGAAAGAGC GCTGTAAGAC GACCATGAAC CTGACGCCGG GCGAAGAGGC CGTCAGCGAG AACAGCGAAG ATGCCGAAAC CGATGCACTG CGCGCCAAGC ACAGATGTGA TGTTTGCGGC ACGGCCATGG ATAGCTATCT TATCGATGAG AAGCGCAAGC TGCATGTGTG TGGTAATAAC CCCATCTGCG GCGGCTACGA AGTCGAGCAA GGGCAGTTTA AGATCAAGGG TTACGAAGGG CCGATTATCG AATGCGACCG CTGTGGCAAC GATATGGAGC TGAAGAATGG CCGTTTCGGT AAGTATTTTG GTTGTACCAA CAGTGAGTGT AAGAATACGC GTAAACTGTT GAAAAATGGT GAAGTGGCTC CGCCGAAGGA AGATCCGATC CACCTGCCAG AGCTGAAATG CACCAAGTCT GATGGCTATT TCGTGCTCAG AGACGGCGCC GCAGGCATCT TCCTGGCCGC GAGCACCTTC CCTAAATCCC GTGAAACCCG CGCGCCTCTG GTGGAAGAGC TGGTGAAGTA CCGTGAGCTC TTATGGCCCA AGTATCAATA TTTAGCCGAT GCCCCTGTAG CGGATGAAGA CGGTAATAAG GCGGTTGTGC GCTTTAGCCG TAAGACCAAG GAGCAGTATG TGGCAACCGA AATCGATGGT AAGGCGACCG GTTGGACGGC CAAGTTTGTC GGCGGTAAGT GGGTAAGCGA AGCCAAGGCT AAGCCAAAGG CGAAGAAGAA AGCCTAA
|
Protein sequence | MEPIYRILRL FGADHSMGKS LVIVESPAKA KTINKYLGKD YIVKSSVGHI RDLPTSSSSD ASSSTKSPAE VRKMSTEEKA KYKADKAKKA LVARMGVDPE RGWKAKYQTL PGKEKVVKEL QSLAENADHI FLATDLDREG EAIAWHLQEV IGGDPERYQR VVFNEITKSA IQEAFSHPSQ LNTNMVNAQQ ARRFLDRVVG FMVSPLLWKK VARGLSAGRV QSVATRLVVE REGEIKAFVP EEFWDIHAEL TTLGNEGLKM QVAKFKGDNF NPVNEAQAMV AVNALKDSKY QVSSREVKAT SSKPSAPFIT STLQQAASTR LGFGVKKTMM MAQRLYEAGH ITYMRTDSTN LSQEALDSVR DMIGKEYGDK YLPDAPIRYG SKEGAQEAHE AIRPSDVNVS ATSLSDMERD AQRLYELIWR QFVACQMTPA KYDATRLTVT AGDYELKASG RTLKFDGWTR VQTALKKKNE EDNTLPMVAE GDVLALEELL PKQHFTKPPA RYSEASLVKE LEKRGIGRPS TYATIISTIQ DRGYVKVENR RFYAEKMGEI VSESLIGSFE ELMSYDFTAG MEQTLDNVAQ GQLEWKKVLD NFYKGLTAQL EKAELPPEEG GMRPNEMVLT DIACPTCGRP MGIRTGTTGV FLGCSGYALP PKERCKTTMN LTPGEEAVSE NSEDAETDAL RAKHRCDVCG TAMDSYLIDE KRKLHVCGNN PICGGYEVEQ GQFKIKGYEG PIIECDRCGN DMELKNGRFG KYFGCTNSEC KNTRKLLKNG EVAPPKEDPI HLPELKCTKS DGYFVLRDGA AGIFLAASTF PKSRETRAPL VEELVKYREL LWPKYQYLAD APVADEDGNK AVVRFSRKTK EQYVATEIDG KATGWTAKFV GGKWVSEAKA KPKAKKKA
|
| |