Gene P9211_04331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04331 
SymboltopA 
ID5730470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp408102 
End bp410801 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content42% 
IMG OID641284790 
ProductDNA topoisomerase I 
Protein accessionYP_001550318 
Protein GI159902974 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0369496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAAG CTCTAGTAAT CGTTGAAAGT CCAACAAAGG CACGAACTAT AAAAAGCTTC 
TTGCCGAAAG ACTTTGAAGT GGTCGCTTCC ATGGGGCATG TCAGGGATCT TCCCAATAGT
GCAAGTGAGA TCCCTGCTGC TCAAAAAGGC GAAAAATGGG CAACTTTAGG GGTGAATACG
ACGGCGGACT TTGAACCTTT GTATGTTGTA CCAAAAGATA AAAAAAAGAT TGTTAGGGAG
CTTAAAACTG CTTTAAAAGG CGCTGACCAG CTTTTGCTTG CAACTGATGA AGATCGAGAA
GGCGAAAGCA TTAGCTGGCA CTTATTGCAG CTTCTGAACC CAAAGGTTCC TGCCAAGAGA
ATGGTTTTTC ATGAGATTAC AAAGGACGCA ATATCTCATG CTCTGACTCA GACAAGAGAT
CTTGATATGG AATTGGTCCA TGCTCAAGAG ACCAGGCGTA TTCTTGACCG ATTAGTGGGA
TACACCCTTT CACCTCTTCT TTGGAAAAAG GTTGCATGGG GACTTTCTGC TGGAAGAGTT
CAATCTGTTG CTGTCCGACT CCTAGTTCTC CGTGAACGTG CCCGTAGAGC TTTTAGAAGC
GCGAATTATT GGGATTTGAA AACTACTTTA CAAAATCAAG GGAACTCATT TGAGGCAAAA
CTGGCAACTC TGGGCGGCCA GAAAATAGCT AGCGGGATTG ATTTTGATGA TTCCACTGGC
TTGATGAAGG ATGGAAATAA TGCTCGCTTG GTCAGTGAAA AAGAAGCAAA AGATCTTGCA
AAAACACTCC AATTGAATGA ATGGAAAGTT AGTTCGATTG AAGAAAAGCC TACAGTTAGG
AGACCAGTAC CCCCATTTAC TACAAGTACT CTTCAACAGG AGTCAAACCG CAAGCTTAGG
CTCTCGACTA GAGAAACGAT GAGATGTGCA CAGGGTCTAT ATGAGAGAGG TTTTATAACT
TATATGCGAA CTGATTCAGT TCATCTCTCA GAGCAGGCAA TTAAAGCTTC TAGAAAATGT
ATTGAGTCTA GATATGGAGC TGATTACTTA AGTAATAAAG TCCGTCAATT TAGCAATAAG
TCTAGAAATG CTCAAGAAGC ACATGAAGCC ATTCGGCCTG CGGGAGCAAC TTTTAAGATG
CCAGATCAAA CAGGGCTTGA AGGTAGAGAT CTGGCGCTTT ATGAATTGAT ATGGAAAAGA
ACGGTTGCGA GTCAAATGCA AGAAGCTCGC TTGACAATGA TTGGAGTTGA AATAACAGTT
GGTGATGCTG TGTTTCGTTC TTCTGGTAAG AGAATTGACT TTCCTGGTTT CTTCCGAGCT
TATGTGGAAG GAAATGATGA TCCAGATGCA GCCTTAGAAG GTCAAGAAGT CCTTCTTCCA
AGTTTGGAGG TGGGAGATAC ACCTATTGTC GAAAAGATAG AGCCGTTAGC TCATCAAACA
CAACCACCTG CCCGCTATAG TGAAGCTTCC TTGGTAAAAA TGCTTGAGAA GGAAGGTATT
GGAAGACCTT CTACGTATTC AAGTATTATT GGAACCATTG TGGATAGAGG TTATTCCTCT
ATCCACAATA ATTCTTTAAT ACCTAGCTTT ACAGCATTTG CAGTAACGGC TTTATTAGAA
GAACATTTCC CAGATCTAGT TGATACTAGG TTTACTGCTA GAATGGAATT GACTTTAGAT
GAAATCTCTA CAGGTAAAGT GGAGTGGTTA CCATATTTAT CAGGCTTTTA TAAAGGAGAG
GATGGCCTTG AAAATCAAGT CGAGAAAAAA GAAGGAGACA TAGACCCAGG TTTATCTAGA
ACTATTGCTT TAGAAGGCCT AAAATGTGTT GTCAGGATTG GACGCTTTGG AGCCTATCTT
GAATCGAGAC GTCTAGGAGA TAATGGCGAA GAAGAGTTGA TTAAAGCCAC ACTCCCTCAA
GAAACGACCC CAGCAGATCT GGATGAAGAG AAAGCAGAGT TAATTCTGAA GCAAAAATCA
GATCAACCTG ATCCTTTGGG GACGGATCCT GAAACAGGTG AAGAGATTTA TTTGCTCTTT
GGTCAATATG GCCCTTATGT TCAGAGAGGG CAAGTAACTG ATGAGGTGCC GAAACCAAAA
AGAGCTTCAG TGCCAAAAGC AGTTAAACCA GAAGATCTGA CTATAGAGGA AGCCCTTGGG
TTGCTTAAAT TGCCACGTGC TCTTGGGGAG CATCCTGATG GGGGTAAGAT TGCTGCAGGT
CTTGGGCGCT TTGGTCCTTA TATTGTTTGG AATAAAGGAA AAGGAGAAAA AGATTATCGA
TCACTAAAAG GAGCTGATGA TGTTCTTGAA GTAAAGATTG AAAGAGCACT TGAGTTATTG
GCTATGCCAA AGAGAGGTAG AGGTGGTAGG ACTGCATTGA AAGACCTTGG CATACCTAAG
GGACAGAAAG ATAAGGTTGA GGTTTATAAC GGCCCCTATG GACTTTACGT AAAACAAGGG
AAAGTCAATG CTTCTTTGCC AAAAGGTAAA AGTGCTGAAG AAATTACCAT CGAAGAAGCA
GTTGAATTGC TCGAAGCTAA ACTTACAAGT AAAAAAACTA AAAAGAAAAA AGTAGCTAAA
CCCAAAAGCT CTACAAAAAG CAAAGCAAAG TCCAATAAAG CGACTCCTAA AGCTCCATCT
ACTACTAAGT CAGGACGATT AAGAGCAAGT GCGGTAAGAG TTATTAAGTC TGGTAATTAA
 
Protein sequence
MAQALVIVES PTKARTIKSF LPKDFEVVAS MGHVRDLPNS ASEIPAAQKG EKWATLGVNT 
TADFEPLYVV PKDKKKIVRE LKTALKGADQ LLLATDEDRE GESISWHLLQ LLNPKVPAKR
MVFHEITKDA ISHALTQTRD LDMELVHAQE TRRILDRLVG YTLSPLLWKK VAWGLSAGRV
QSVAVRLLVL RERARRAFRS ANYWDLKTTL QNQGNSFEAK LATLGGQKIA SGIDFDDSTG
LMKDGNNARL VSEKEAKDLA KTLQLNEWKV SSIEEKPTVR RPVPPFTTST LQQESNRKLR
LSTRETMRCA QGLYERGFIT YMRTDSVHLS EQAIKASRKC IESRYGADYL SNKVRQFSNK
SRNAQEAHEA IRPAGATFKM PDQTGLEGRD LALYELIWKR TVASQMQEAR LTMIGVEITV
GDAVFRSSGK RIDFPGFFRA YVEGNDDPDA ALEGQEVLLP SLEVGDTPIV EKIEPLAHQT
QPPARYSEAS LVKMLEKEGI GRPSTYSSII GTIVDRGYSS IHNNSLIPSF TAFAVTALLE
EHFPDLVDTR FTARMELTLD EISTGKVEWL PYLSGFYKGE DGLENQVEKK EGDIDPGLSR
TIALEGLKCV VRIGRFGAYL ESRRLGDNGE EELIKATLPQ ETTPADLDEE KAELILKQKS
DQPDPLGTDP ETGEEIYLLF GQYGPYVQRG QVTDEVPKPK RASVPKAVKP EDLTIEEALG
LLKLPRALGE HPDGGKIAAG LGRFGPYIVW NKGKGEKDYR SLKGADDVLE VKIERALELL
AMPKRGRGGR TALKDLGIPK GQKDKVEVYN GPYGLYVKQG KVNASLPKGK SAEEITIEEA
VELLEAKLTS KKTKKKKVAK PKSSTKSKAK SNKATPKAPS TTKSGRLRAS AVRVIKSGN