Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04331 |
Symbol | topA |
ID | 5730470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 408102 |
End bp | 410801 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284790 |
Product | DNA topoisomerase I |
Protein accession | YP_001550318 |
Protein GI | 159902974 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0369496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCAAG CTCTAGTAAT CGTTGAAAGT CCAACAAAGG CACGAACTAT AAAAAGCTTC TTGCCGAAAG ACTTTGAAGT GGTCGCTTCC ATGGGGCATG TCAGGGATCT TCCCAATAGT GCAAGTGAGA TCCCTGCTGC TCAAAAAGGC GAAAAATGGG CAACTTTAGG GGTGAATACG ACGGCGGACT TTGAACCTTT GTATGTTGTA CCAAAAGATA AAAAAAAGAT TGTTAGGGAG CTTAAAACTG CTTTAAAAGG CGCTGACCAG CTTTTGCTTG CAACTGATGA AGATCGAGAA GGCGAAAGCA TTAGCTGGCA CTTATTGCAG CTTCTGAACC CAAAGGTTCC TGCCAAGAGA ATGGTTTTTC ATGAGATTAC AAAGGACGCA ATATCTCATG CTCTGACTCA GACAAGAGAT CTTGATATGG AATTGGTCCA TGCTCAAGAG ACCAGGCGTA TTCTTGACCG ATTAGTGGGA TACACCCTTT CACCTCTTCT TTGGAAAAAG GTTGCATGGG GACTTTCTGC TGGAAGAGTT CAATCTGTTG CTGTCCGACT CCTAGTTCTC CGTGAACGTG CCCGTAGAGC TTTTAGAAGC GCGAATTATT GGGATTTGAA AACTACTTTA CAAAATCAAG GGAACTCATT TGAGGCAAAA CTGGCAACTC TGGGCGGCCA GAAAATAGCT AGCGGGATTG ATTTTGATGA TTCCACTGGC TTGATGAAGG ATGGAAATAA TGCTCGCTTG GTCAGTGAAA AAGAAGCAAA AGATCTTGCA AAAACACTCC AATTGAATGA ATGGAAAGTT AGTTCGATTG AAGAAAAGCC TACAGTTAGG AGACCAGTAC CCCCATTTAC TACAAGTACT CTTCAACAGG AGTCAAACCG CAAGCTTAGG CTCTCGACTA GAGAAACGAT GAGATGTGCA CAGGGTCTAT ATGAGAGAGG TTTTATAACT TATATGCGAA CTGATTCAGT TCATCTCTCA GAGCAGGCAA TTAAAGCTTC TAGAAAATGT ATTGAGTCTA GATATGGAGC TGATTACTTA AGTAATAAAG TCCGTCAATT TAGCAATAAG TCTAGAAATG CTCAAGAAGC ACATGAAGCC ATTCGGCCTG CGGGAGCAAC TTTTAAGATG CCAGATCAAA CAGGGCTTGA AGGTAGAGAT CTGGCGCTTT ATGAATTGAT ATGGAAAAGA ACGGTTGCGA GTCAAATGCA AGAAGCTCGC TTGACAATGA TTGGAGTTGA AATAACAGTT GGTGATGCTG TGTTTCGTTC TTCTGGTAAG AGAATTGACT TTCCTGGTTT CTTCCGAGCT TATGTGGAAG GAAATGATGA TCCAGATGCA GCCTTAGAAG GTCAAGAAGT CCTTCTTCCA AGTTTGGAGG TGGGAGATAC ACCTATTGTC GAAAAGATAG AGCCGTTAGC TCATCAAACA CAACCACCTG CCCGCTATAG TGAAGCTTCC TTGGTAAAAA TGCTTGAGAA GGAAGGTATT GGAAGACCTT CTACGTATTC AAGTATTATT GGAACCATTG TGGATAGAGG TTATTCCTCT ATCCACAATA ATTCTTTAAT ACCTAGCTTT ACAGCATTTG CAGTAACGGC TTTATTAGAA GAACATTTCC CAGATCTAGT TGATACTAGG TTTACTGCTA GAATGGAATT GACTTTAGAT GAAATCTCTA CAGGTAAAGT GGAGTGGTTA CCATATTTAT CAGGCTTTTA TAAAGGAGAG GATGGCCTTG AAAATCAAGT CGAGAAAAAA GAAGGAGACA TAGACCCAGG TTTATCTAGA ACTATTGCTT TAGAAGGCCT AAAATGTGTT GTCAGGATTG GACGCTTTGG AGCCTATCTT GAATCGAGAC GTCTAGGAGA TAATGGCGAA GAAGAGTTGA TTAAAGCCAC ACTCCCTCAA GAAACGACCC CAGCAGATCT GGATGAAGAG AAAGCAGAGT TAATTCTGAA GCAAAAATCA GATCAACCTG ATCCTTTGGG GACGGATCCT GAAACAGGTG AAGAGATTTA TTTGCTCTTT GGTCAATATG GCCCTTATGT TCAGAGAGGG CAAGTAACTG ATGAGGTGCC GAAACCAAAA AGAGCTTCAG TGCCAAAAGC AGTTAAACCA GAAGATCTGA CTATAGAGGA AGCCCTTGGG TTGCTTAAAT TGCCACGTGC TCTTGGGGAG CATCCTGATG GGGGTAAGAT TGCTGCAGGT CTTGGGCGCT TTGGTCCTTA TATTGTTTGG AATAAAGGAA AAGGAGAAAA AGATTATCGA TCACTAAAAG GAGCTGATGA TGTTCTTGAA GTAAAGATTG AAAGAGCACT TGAGTTATTG GCTATGCCAA AGAGAGGTAG AGGTGGTAGG ACTGCATTGA AAGACCTTGG CATACCTAAG GGACAGAAAG ATAAGGTTGA GGTTTATAAC GGCCCCTATG GACTTTACGT AAAACAAGGG AAAGTCAATG CTTCTTTGCC AAAAGGTAAA AGTGCTGAAG AAATTACCAT CGAAGAAGCA GTTGAATTGC TCGAAGCTAA ACTTACAAGT AAAAAAACTA AAAAGAAAAA AGTAGCTAAA CCCAAAAGCT CTACAAAAAG CAAAGCAAAG TCCAATAAAG CGACTCCTAA AGCTCCATCT ACTACTAAGT CAGGACGATT AAGAGCAAGT GCGGTAAGAG TTATTAAGTC TGGTAATTAA
|
Protein sequence | MAQALVIVES PTKARTIKSF LPKDFEVVAS MGHVRDLPNS ASEIPAAQKG EKWATLGVNT TADFEPLYVV PKDKKKIVRE LKTALKGADQ LLLATDEDRE GESISWHLLQ LLNPKVPAKR MVFHEITKDA ISHALTQTRD LDMELVHAQE TRRILDRLVG YTLSPLLWKK VAWGLSAGRV QSVAVRLLVL RERARRAFRS ANYWDLKTTL QNQGNSFEAK LATLGGQKIA SGIDFDDSTG LMKDGNNARL VSEKEAKDLA KTLQLNEWKV SSIEEKPTVR RPVPPFTTST LQQESNRKLR LSTRETMRCA QGLYERGFIT YMRTDSVHLS EQAIKASRKC IESRYGADYL SNKVRQFSNK SRNAQEAHEA IRPAGATFKM PDQTGLEGRD LALYELIWKR TVASQMQEAR LTMIGVEITV GDAVFRSSGK RIDFPGFFRA YVEGNDDPDA ALEGQEVLLP SLEVGDTPIV EKIEPLAHQT QPPARYSEAS LVKMLEKEGI GRPSTYSSII GTIVDRGYSS IHNNSLIPSF TAFAVTALLE EHFPDLVDTR FTARMELTLD EISTGKVEWL PYLSGFYKGE DGLENQVEKK EGDIDPGLSR TIALEGLKCV VRIGRFGAYL ESRRLGDNGE EELIKATLPQ ETTPADLDEE KAELILKQKS DQPDPLGTDP ETGEEIYLLF GQYGPYVQRG QVTDEVPKPK RASVPKAVKP EDLTIEEALG LLKLPRALGE HPDGGKIAAG LGRFGPYIVW NKGKGEKDYR SLKGADDVLE VKIERALELL AMPKRGRGGR TALKDLGIPK GQKDKVEVYN GPYGLYVKQG KVNASLPKGK SAEEITIEEA VELLEAKLTS KKTKKKKVAK PKSSTKSKAK SNKATPKAPS TTKSGRLRAS AVRVIKSGN
|
| |