Gene P9301_04601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_04601 
SymboltopA 
ID4910970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp400066 
End bp402672 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content36% 
IMG OID640160038 
ProductDNA topoisomerase I 
Protein accessionYP_001090684 
Protein GI126695798 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATCACA CACTTGTTAT TGTTGAAAGT CCCACCAAAG CAAAAACTAT AAGAAAGTTT 
TTGCCTTCTA ACTATGAAGT TCTCGCTTCA ATGGGACACG TAAGAGATCT TCCAAAAGGA
GCTGCTGAAA TACCTGCTGC GGTTAAAAAG GAAAAATGGT CAAGGATAGG AGTTAATACA
ACAGAAGATT TTGAACCACT TTACATAGTT CCTAAAGATA AGAAAAAGGT TGTTAAAGAG
CTGAAAGATG CTTTGAAAGG TGCTACCCAA CTATTACTGG CAACTGATGA AGATAGAGAG
GGAGAGAGTA TTAGCTGGCA TCTTCTGCAA ATACTGAAGC CTAAAATACC AACTAAGAGA
ATGGTTTTTC ATGAAATTAC AAAAAAGGCA ATTAATAAAG CTTTAGATCA AACAAGAGAA
ATTGATATGG AACTTGTTCA GGCTCAAGAA ACCAGAAGAA TCTTGGACAG GCTTTTTGGA
TATGAATTAT CTCCTTTACT TTGGAAGAAG GTAGCCCCCA GATTATCTGC TGGTCGGGTT
CAATCAGTTT CTGTAAGGCT TCTTGTTAGA AGAGAGAGAG AAAGAAGATC CTTTAAAAAA
GCTAGTTACT GGGGAATTAA AGCTTCCCTA GTAAAAGATA ATATAACTTT TGAAACTAAA
TTATTCAGTT TAAACGGTCA ACGAATTTCT AACGGTTCCG ATTTCGACGA ACAAACCGGT
AAATTAAAAG AAGGGAACAA ATCTTTAATA ATTGGAGAAG AACAAGTAAA TGACTTATTG
AAGACTTTTT CCTCTGAGGA TTGGTTAGTC TCAAAAATCG AAAAAAAGCC ATCCACTCGT
AAGCCAGTTC CTCCATTTAC AACAAGCACA TTACAACAAG AAGCAAATAG GAAGCTTCGT
TTGTCTGCAA GAGAAACTAT GAGATGTGCG CAAGGGCTAT ATGAGAGAGG TTTTATAACG
TATATGAGAA CTGATTCAGT TCATCTCTCC GAACAAGCCA CAAGAGCTGC TAGAGAATGT
GTTAGTTCTA TGTATGGAAA AGAATATTTA TCTAACTCAC CAAGACAATT TAATTCAACT
GCAAGAAATG CTCAAGAAGC TCACGAAGCT ATTAGGCCTG CAGGTGAGGT ATTTAAAACA
CCAAAGGAAA CTAATCTAAC TGGTAGAGAT TTATCACTTT ACGATTTAAT TTGGAAAAGA
ACTGTAGCTA GTCAAATGGC TGAAGCTAGG CTAACAATGA TTAATGCTGA AATTAGCGTA
GGGGATGGAA TATTTAAATC GAGTGGGAAA AGTATTGATT TCGCAGGATT CTTCAGAGCT
TATGTCGAGG GAAGTGATGA TCCAAGTTCA TCCCTTGAAC AACAAGAAAT TATTCTCCCA
AACTTAACAA CTGGAACATG TCTTGATGTT ACGAAGAAGG AATCTACTTT TCATGAAACT
AAACCTCCTG CAAGATATAC AGAGGCCGCA TTAGTTAAAG TTCTTGAAAA AGAAGGGATT
GGAAGACCTT CTACCTATGC CAGTATTATT GGGACCATAG TTGATAGAGG TTATGCGAAT
ATATCTTCCA ATACTTTGGC TCCAACTTTT ACAGCTTTTG CTGTTACTGC TCTATTAGAA
GAACATTTTC CTGATCTTGT TGATACTACT TTTACTGCAA AAATGGAATC TTCATTGGAT
GAAATATCTT CAGGAAATCT TGAGTGGCTG CCATACCTAG AAACTTTCTA TAAAGGTAAA
AATGGTTTGG AGGTAAAGGT TCAGAAAACA GAGGGTGATA TTGATGGTAA AGCTTATAGA
CAAGTTGATT TCGAAGACCT TCCTTGTGTA GTAAGAATAG GCTCTAACGG ACCTTGGCTA
GAAGGTACAA AAATTGATGA ATCTGGTAAT GAAATTCAGG CGAAAGGTAA TCTTCCAATG
GATATTACTC CTGGAGATTT AGACATAAAG CAAGTTGATC AAATTTTAAG TGGCCCATCG
GATCTTGGCA CTGATCCAAA AACTGGGGAA AAAGTCTTTT TAAGATTTGG ACCTTATGGA
CCTTACGTAC AATTGGGAAA TAATGATCAA AATAAAGCTA AACCAAGAAG AGCTTCATTA
CCTAAAGAGT TGAAAACTGA TGATCTAACT CTAGATGAGG CTCTTGTACT TTTAAGCTTG
CCTAGATTGT TAGGAGTTCA TCCTGAAGGA GGAGTTGTCG AGGCTGATAG AGGAAGATTT
GGCCCCTATA TCAAATGGAT TAAAAATGAA AATGAATCTG AAAACAGATC CTTAAAGAAA
GAGGATGATG TTTTTACAGT TGATATAGAA CGAGCATTAG AAATTCTTGC GATGCCAAAA
ATGGGTAGAG GTGGTCAAGA GGTACTTAAA GACTTTGGAA AACCGAAAGA ATTTAAAGAA
AAAATTCAAA TATTAAATGG AAGATATGGG GTCTATTTAA AATGTGGTAA AACCAATGTT
TCGATTGCCA AAGATACTGA CATAGAAAAA TTTACCATAG ATGACGCAGT ATCTCTTTTA
GAAGAAAAAC TAAAAGATAA AAAAGGTTCA ATTTTAAAAA AAACAAAGAT TAGTAATAAA
AAAACTACCA GGAAAAAGAA AAGTTAG
 
Protein sequence
MDHTLVIVES PTKAKTIRKF LPSNYEVLAS MGHVRDLPKG AAEIPAAVKK EKWSRIGVNT 
TEDFEPLYIV PKDKKKVVKE LKDALKGATQ LLLATDEDRE GESISWHLLQ ILKPKIPTKR
MVFHEITKKA INKALDQTRE IDMELVQAQE TRRILDRLFG YELSPLLWKK VAPRLSAGRV
QSVSVRLLVR RERERRSFKK ASYWGIKASL VKDNITFETK LFSLNGQRIS NGSDFDEQTG
KLKEGNKSLI IGEEQVNDLL KTFSSEDWLV SKIEKKPSTR KPVPPFTTST LQQEANRKLR
LSARETMRCA QGLYERGFIT YMRTDSVHLS EQATRAAREC VSSMYGKEYL SNSPRQFNST
ARNAQEAHEA IRPAGEVFKT PKETNLTGRD LSLYDLIWKR TVASQMAEAR LTMINAEISV
GDGIFKSSGK SIDFAGFFRA YVEGSDDPSS SLEQQEIILP NLTTGTCLDV TKKESTFHET
KPPARYTEAA LVKVLEKEGI GRPSTYASII GTIVDRGYAN ISSNTLAPTF TAFAVTALLE
EHFPDLVDTT FTAKMESSLD EISSGNLEWL PYLETFYKGK NGLEVKVQKT EGDIDGKAYR
QVDFEDLPCV VRIGSNGPWL EGTKIDESGN EIQAKGNLPM DITPGDLDIK QVDQILSGPS
DLGTDPKTGE KVFLRFGPYG PYVQLGNNDQ NKAKPRRASL PKELKTDDLT LDEALVLLSL
PRLLGVHPEG GVVEADRGRF GPYIKWIKNE NESENRSLKK EDDVFTVDIE RALEILAMPK
MGRGGQEVLK DFGKPKEFKE KIQILNGRYG VYLKCGKTNV SIAKDTDIEK FTIDDAVSLL
EEKLKDKKGS ILKKTKISNK KTTRKKKS