Gene P9303_06301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_06301 
SymboltopA 
ID4776336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp593802 
End bp596552 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content55% 
IMG OID640086137 
ProductDNA topoisomerase I 
Protein accessionYP_001016647 
Protein GI124022340 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.059761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTG CTTGCCGGTG CGATAAAGAT TGGAACGGTT ACATATCGTC TGTGGCGAAC 
ACTCTGGTCA TTGTCGAAAG CCCTACCAAG GCGAGAACCA TTCGAGGGTT TCTGCCCAAG
GACTTCCGTG TGGAGGCCTC CATGGGCCAT GTGCGCGACT TGCCCAACAA CGCCAGTGAG
ATCCCTGCGG CCCAGAAGGG GCAGAAATGG GCCAACCTCG GCGTAAACAC CACCGCGGAT
TTCGAACCTC TTTATGTGGT TCCGAAGGAC AAAAAGAAGG TGGTCAAGGA GCTGAAGGCA
GCTTTGAAGG AGGCTGATCA GCTGTTGTTG GCAACTGACG AAGATCGAGA GGGCGAAAGC
ATCAGCTGGC ATTTGCTGCA GCTGTTGGCT CCCAAAGTGC CTGTCAAGCG GATGGTGTTT
CACGAGATCA CCAAAGAAGC CATTGCTAAG GCTCTTGATC AGCCCAGAGA TCTCGACATG
GAGCTGGTCC ATGCCCAGGA AACGCGACGG ATTCTTGACC GTTTGGTGGG ATACACGCTT
TCGCCTCTGT TGTGGAAGAA GGTTGCATGG GGACTCTCCG CCGGTCGGGT GCAGTCAGTT
TCTGTGCGGT TGCTTGTGCA GCGTGAACGT GCCCGTAGGG CTTTCCGTAG CGGCAGCTAC
TGGGACCTAA AGGCCAAACT TGAAAAGGGT GGTGGTCAAT TTGAGGCAAA GCTCACCAGT
CTGGATGGCC AGAAGATTGC TACCGGCAGT GATTTCGATG AAGCGACAGG CGCTTTAAAG
GCTGGCAGAA ATGTTCGACT GCTTGGCGAA TCAGATGCGC TCACTCTTTC CGAGGCCGTG
CGCAGCAGTC AGTGGCGGGT TGAGGCGGTG GAAGAGAAGC CAACGGTACG TAAACCGGTG
CCTCCTTTCA CGACAAGCAC TTTGCAGCAG GAGGCAAATC GCAAGTTGCG GTTTTCGGCC
AGGGAAACGA TGCGGTGTGC CCAGGGGCTT TATGAGCGTG GCTTCATCAC CTATATGCGA
ACTGACTCTG TGCATCTATC CGAGCAGGCT ATTCAGGCTG CTCGGAGCTG CGTGGGGTCA
CGCTATGGCG ATGATTATCT GAGCAAAACT CCACGTCAGT TCAGCACTAA GTCACGCAAT
GCCCAGGAGG CTCACGAAGC GATACGACCA GCGGGTGAAA GCTTCCGTTC CCCAAGTGAA
TCTGGGCTTG AAGGGCGCGA CATGGCCCTA TATGAGTTGA TTTGGAAGCG AACAGTGGCC
AGCCAGATGG CCGAGGCTCG ACTCACCATG CTGGCTGTTG ATCTTCGTGT GGCTGATGCC
AAATTTCGGG CCACGGGTAA GCGCATTGAT TTCCCAGGTT TCTTTCGCGC TTACGTGGAG
GGCAGTGATG ATCCAGACGC TGCCTTGGAA GGCCAGGAAG TTTTGCTGCC TGATTTAGCG
GTTGACGATT CGCCCACGCT GCAGGATGTG GAGGCCCTCG GTCACCAGAC TCAGCCGCCG
GCTCGCTATA GCGAGGCTTC ACTGGTGAAG ATGCTCGAGA AGGAGGGCAT TGGTAGGCCT
TCCACCTACG CCAGCATCAT CGGCACCATC GTTGATCGGG GTTATGCAGC ATTGCAAAAC
AACTCCCTTA TTCCCAGTTT CACTGCTTTT GCTGTAACGG CTCTTCTAGA GGAGCATTTC
CCAGATCTTG TCGATACCAG CTTTACGGCT CGGATGGAAT TCACGCTTGA TGAGATTTCC
ACGGGCAAGG TGCAGTGGTT GCCTTATCTC GAAGGGTTCT ACAAGGGCGA AAAGGGCCTT
GAGAGTCAGG TTCAGCAGCG TGAAGGTGAC ATCGACTCCA GTGTGTCTCG AACTGTGGAT
CTGGAGGGAT TGCCCTGTGT GGTGCGCATC GGTCGTTTTG GGGCCTATCT GGAAGCCAAA
CGAGTGGGTG ATGACGGCGA GGAGGAATCG CTTAAGGCCA CCCTCCCTCA AGAGATCACC
CCTGCTGATC TTGATGCAGA GAAAGCCGAG CTGATTCTCA AGCAGAAAGC TGATGGCCCG
GAATCGATTG GGGAAGACCC GGAAACCGGT GATCAGGTTT ACCTCCTTTT TGGTCAGTAC
GGGCCTTATG TGCAACGAGG CCAGGTGGGT GAGGACAACC CCAAGCCGAA GCGGGCATCC
TTGCCCAAAG GCAAGAAGCC TGATGAGCTC AGCCTTGATG AGGCACTGGG CTTACTGCGT
CTGCCGCGCT TACTGGGAGA GCATCCCGAT GGTGGACGAA TTCAGGCGGG TTTGGGTCGC
TTCGGACCCT ATGTGGTCTG GGATAAGAGC AAGGGAGAGA AGGACTATCG CTCCCTTAAG
GGGGAGGATG ACGTGCTGGC GGTGGGGCTG AGCCGTGCAC TAGAGCTTTT GGCGATGCCC
AAGCGGGGCA GGGGCGGCCG GACTGCGTTG AAAGACCTTG GCATCCCGGA GGGGAGTGAG
GAGACGGTGC AGGTTTTTGA CGGTCCCTAT GGCTTGTATG TCAAGCAGGG CAAGCTCAAT
GCCTCGTTAC CTGAAGGGAA GGGCGTCGAC GACATTTCTC TTGATGTAGC AGTGGAGCTA
TTGGCTGCCA AGGCTTTAAG TAAGAAGACA AGTCGACGCA AAAAGAGCAC TTCAACAACC
AGCAAAAAAC CCGCCGCAAG CAAACCAAAA ACTCCTAAAC CACCTGCTAC TACAAAGACA
GGTCGGTTGC GAGCCAGTGC TGTTCGGGTC ATCAAGCCTG GTGAGGTTTG A
 
Protein sequence
MQLACRCDKD WNGYISSVAN TLVIVESPTK ARTIRGFLPK DFRVEASMGH VRDLPNNASE 
IPAAQKGQKW ANLGVNTTAD FEPLYVVPKD KKKVVKELKA ALKEADQLLL ATDEDREGES
ISWHLLQLLA PKVPVKRMVF HEITKEAIAK ALDQPRDLDM ELVHAQETRR ILDRLVGYTL
SPLLWKKVAW GLSAGRVQSV SVRLLVQRER ARRAFRSGSY WDLKAKLEKG GGQFEAKLTS
LDGQKIATGS DFDEATGALK AGRNVRLLGE SDALTLSEAV RSSQWRVEAV EEKPTVRKPV
PPFTTSTLQQ EANRKLRFSA RETMRCAQGL YERGFITYMR TDSVHLSEQA IQAARSCVGS
RYGDDYLSKT PRQFSTKSRN AQEAHEAIRP AGESFRSPSE SGLEGRDMAL YELIWKRTVA
SQMAEARLTM LAVDLRVADA KFRATGKRID FPGFFRAYVE GSDDPDAALE GQEVLLPDLA
VDDSPTLQDV EALGHQTQPP ARYSEASLVK MLEKEGIGRP STYASIIGTI VDRGYAALQN
NSLIPSFTAF AVTALLEEHF PDLVDTSFTA RMEFTLDEIS TGKVQWLPYL EGFYKGEKGL
ESQVQQREGD IDSSVSRTVD LEGLPCVVRI GRFGAYLEAK RVGDDGEEES LKATLPQEIT
PADLDAEKAE LILKQKADGP ESIGEDPETG DQVYLLFGQY GPYVQRGQVG EDNPKPKRAS
LPKGKKPDEL SLDEALGLLR LPRLLGEHPD GGRIQAGLGR FGPYVVWDKS KGEKDYRSLK
GEDDVLAVGL SRALELLAMP KRGRGGRTAL KDLGIPEGSE ETVQVFDGPY GLYVKQGKLN
ASLPEGKGVD DISLDVAVEL LAAKALSKKT SRRKKSTSTT SKKPAASKPK TPKPPATTKT
GRLRASAVRV IKPGEV