Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_06301 |
Symbol | topA |
ID | 4776336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 593802 |
End bp | 596552 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086137 |
Product | DNA topoisomerase I |
Protein accession | YP_001016647 |
Protein GI | 124022340 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.059761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTTG CTTGCCGGTG CGATAAAGAT TGGAACGGTT ACATATCGTC TGTGGCGAAC ACTCTGGTCA TTGTCGAAAG CCCTACCAAG GCGAGAACCA TTCGAGGGTT TCTGCCCAAG GACTTCCGTG TGGAGGCCTC CATGGGCCAT GTGCGCGACT TGCCCAACAA CGCCAGTGAG ATCCCTGCGG CCCAGAAGGG GCAGAAATGG GCCAACCTCG GCGTAAACAC CACCGCGGAT TTCGAACCTC TTTATGTGGT TCCGAAGGAC AAAAAGAAGG TGGTCAAGGA GCTGAAGGCA GCTTTGAAGG AGGCTGATCA GCTGTTGTTG GCAACTGACG AAGATCGAGA GGGCGAAAGC ATCAGCTGGC ATTTGCTGCA GCTGTTGGCT CCCAAAGTGC CTGTCAAGCG GATGGTGTTT CACGAGATCA CCAAAGAAGC CATTGCTAAG GCTCTTGATC AGCCCAGAGA TCTCGACATG GAGCTGGTCC ATGCCCAGGA AACGCGACGG ATTCTTGACC GTTTGGTGGG ATACACGCTT TCGCCTCTGT TGTGGAAGAA GGTTGCATGG GGACTCTCCG CCGGTCGGGT GCAGTCAGTT TCTGTGCGGT TGCTTGTGCA GCGTGAACGT GCCCGTAGGG CTTTCCGTAG CGGCAGCTAC TGGGACCTAA AGGCCAAACT TGAAAAGGGT GGTGGTCAAT TTGAGGCAAA GCTCACCAGT CTGGATGGCC AGAAGATTGC TACCGGCAGT GATTTCGATG AAGCGACAGG CGCTTTAAAG GCTGGCAGAA ATGTTCGACT GCTTGGCGAA TCAGATGCGC TCACTCTTTC CGAGGCCGTG CGCAGCAGTC AGTGGCGGGT TGAGGCGGTG GAAGAGAAGC CAACGGTACG TAAACCGGTG CCTCCTTTCA CGACAAGCAC TTTGCAGCAG GAGGCAAATC GCAAGTTGCG GTTTTCGGCC AGGGAAACGA TGCGGTGTGC CCAGGGGCTT TATGAGCGTG GCTTCATCAC CTATATGCGA ACTGACTCTG TGCATCTATC CGAGCAGGCT ATTCAGGCTG CTCGGAGCTG CGTGGGGTCA CGCTATGGCG ATGATTATCT GAGCAAAACT CCACGTCAGT TCAGCACTAA GTCACGCAAT GCCCAGGAGG CTCACGAAGC GATACGACCA GCGGGTGAAA GCTTCCGTTC CCCAAGTGAA TCTGGGCTTG AAGGGCGCGA CATGGCCCTA TATGAGTTGA TTTGGAAGCG AACAGTGGCC AGCCAGATGG CCGAGGCTCG ACTCACCATG CTGGCTGTTG ATCTTCGTGT GGCTGATGCC AAATTTCGGG CCACGGGTAA GCGCATTGAT TTCCCAGGTT TCTTTCGCGC TTACGTGGAG GGCAGTGATG ATCCAGACGC TGCCTTGGAA GGCCAGGAAG TTTTGCTGCC TGATTTAGCG GTTGACGATT CGCCCACGCT GCAGGATGTG GAGGCCCTCG GTCACCAGAC TCAGCCGCCG GCTCGCTATA GCGAGGCTTC ACTGGTGAAG ATGCTCGAGA AGGAGGGCAT TGGTAGGCCT TCCACCTACG CCAGCATCAT CGGCACCATC GTTGATCGGG GTTATGCAGC ATTGCAAAAC AACTCCCTTA TTCCCAGTTT CACTGCTTTT GCTGTAACGG CTCTTCTAGA GGAGCATTTC CCAGATCTTG TCGATACCAG CTTTACGGCT CGGATGGAAT TCACGCTTGA TGAGATTTCC ACGGGCAAGG TGCAGTGGTT GCCTTATCTC GAAGGGTTCT ACAAGGGCGA AAAGGGCCTT GAGAGTCAGG TTCAGCAGCG TGAAGGTGAC ATCGACTCCA GTGTGTCTCG AACTGTGGAT CTGGAGGGAT TGCCCTGTGT GGTGCGCATC GGTCGTTTTG GGGCCTATCT GGAAGCCAAA CGAGTGGGTG ATGACGGCGA GGAGGAATCG CTTAAGGCCA CCCTCCCTCA AGAGATCACC CCTGCTGATC TTGATGCAGA GAAAGCCGAG CTGATTCTCA AGCAGAAAGC TGATGGCCCG GAATCGATTG GGGAAGACCC GGAAACCGGT GATCAGGTTT ACCTCCTTTT TGGTCAGTAC GGGCCTTATG TGCAACGAGG CCAGGTGGGT GAGGACAACC CCAAGCCGAA GCGGGCATCC TTGCCCAAAG GCAAGAAGCC TGATGAGCTC AGCCTTGATG AGGCACTGGG CTTACTGCGT CTGCCGCGCT TACTGGGAGA GCATCCCGAT GGTGGACGAA TTCAGGCGGG TTTGGGTCGC TTCGGACCCT ATGTGGTCTG GGATAAGAGC AAGGGAGAGA AGGACTATCG CTCCCTTAAG GGGGAGGATG ACGTGCTGGC GGTGGGGCTG AGCCGTGCAC TAGAGCTTTT GGCGATGCCC AAGCGGGGCA GGGGCGGCCG GACTGCGTTG AAAGACCTTG GCATCCCGGA GGGGAGTGAG GAGACGGTGC AGGTTTTTGA CGGTCCCTAT GGCTTGTATG TCAAGCAGGG CAAGCTCAAT GCCTCGTTAC CTGAAGGGAA GGGCGTCGAC GACATTTCTC TTGATGTAGC AGTGGAGCTA TTGGCTGCCA AGGCTTTAAG TAAGAAGACA AGTCGACGCA AAAAGAGCAC TTCAACAACC AGCAAAAAAC CCGCCGCAAG CAAACCAAAA ACTCCTAAAC CACCTGCTAC TACAAAGACA GGTCGGTTGC GAGCCAGTGC TGTTCGGGTC ATCAAGCCTG GTGAGGTTTG A
|
Protein sequence | MQLACRCDKD WNGYISSVAN TLVIVESPTK ARTIRGFLPK DFRVEASMGH VRDLPNNASE IPAAQKGQKW ANLGVNTTAD FEPLYVVPKD KKKVVKELKA ALKEADQLLL ATDEDREGES ISWHLLQLLA PKVPVKRMVF HEITKEAIAK ALDQPRDLDM ELVHAQETRR ILDRLVGYTL SPLLWKKVAW GLSAGRVQSV SVRLLVQRER ARRAFRSGSY WDLKAKLEKG GGQFEAKLTS LDGQKIATGS DFDEATGALK AGRNVRLLGE SDALTLSEAV RSSQWRVEAV EEKPTVRKPV PPFTTSTLQQ EANRKLRFSA RETMRCAQGL YERGFITYMR TDSVHLSEQA IQAARSCVGS RYGDDYLSKT PRQFSTKSRN AQEAHEAIRP AGESFRSPSE SGLEGRDMAL YELIWKRTVA SQMAEARLTM LAVDLRVADA KFRATGKRID FPGFFRAYVE GSDDPDAALE GQEVLLPDLA VDDSPTLQDV EALGHQTQPP ARYSEASLVK MLEKEGIGRP STYASIIGTI VDRGYAALQN NSLIPSFTAF AVTALLEEHF PDLVDTSFTA RMEFTLDEIS TGKVQWLPYL EGFYKGEKGL ESQVQQREGD IDSSVSRTVD LEGLPCVVRI GRFGAYLEAK RVGDDGEEES LKATLPQEIT PADLDAEKAE LILKQKADGP ESIGEDPETG DQVYLLFGQY GPYVQRGQVG EDNPKPKRAS LPKGKKPDEL SLDEALGLLR LPRLLGEHPD GGRIQAGLGR FGPYVVWDKS KGEKDYRSLK GEDDVLAVGL SRALELLAMP KRGRGGRTAL KDLGIPEGSE ETVQVFDGPY GLYVKQGKLN ASLPEGKGVD DISLDVAVEL LAAKALSKKT SRRKKSTSTT SKKPAASKPK TPKPPATTKT GRLRASAVRV IKPGEV
|
| |