Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1768 |
Symbol | |
ID | 3607178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | + |
Start bp | 433444 |
End bp | 436350 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637688659 |
Product | DNA topoisomerase I |
Protein accession | YP_292959 |
Protein GI | 72383604 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACTG ACCATACTCT GGTGATTGTT GAAAGTCCTA CAAAGGCAAA AACTATTAGA GGGTTTTTGC CTAAGGACTT TCAGGTTCTT GCGTCAATGG GGCACATAAG AGACTTGCCT AACAATGCAT CTGAGATCCC TGCGAAGCAC AAAGGCGAAA AGTGGGCAAC GATTGGAGTT AATACAACTG CTGATTTTGA TCCTTTGTAC GTTGTACCCA AAGACAAGAA AAAAATTGTC AAGGAATTAA AACAATCTTT GAAGGGTGCT AGTGAATTGT TGCTTGCGAC TGATGAAGAT AGAGAAGGAG AAAGTATAAG TTGGCATTTA ATGAATGTGC TTGACCCGAA AATCCCTGTG AAGAGGATGG TCTTTCATGA GATAACTAAA GAAGCTATTT CCAAAGCTCT ATCGAAAACA AGAGCAATTG ATATGGAATT AGTTCATGCC CAAGAGACAA GGAGGATCTT AGACAGATTA GTTGGGTACA CGCTTTCTCC TCTTTTATGG AAGAAAGTTT CATGGGGTTT ATCTGCAGGA AGAGTTCAAT CAGTTGCAGT AAGATTGCTA GTTCTGAGAG AGAGAGCAAG GAGAGCTTTC AAAAGCGGGA GTTATTGGGA CTTAAAAGCA AAATTAGAGA AAGAAGGTAG TGAATTTGAG GTGAAAATGA CCTCAATTGG TGGGAAAAGA ATTGCTACAG GTAGTGATTT TGATGAGTCA ACGGGATTAT TGAAATCTGG CCGAAATGTC ATATTACTCA AGGAAGAGGA GTCTAAGGAA CTTGCACAAA AATTAACTAC TGATAAATGG AAAGTTGTTA ATGTCGAGGA AAAGCCGTCA ATCCGTAAAC CAGTTCCTCC TTTTACAACA AGCACATTAC AACAAGAGGC TAATAGAAAA CTTCGATTAT CAGCTAGGGA GACTATGAGA TGTGCTCAGG GTTTGTATGA AAGAGGTTTT ATTACATATA TGAGAACAGA TTCTGTTCAT CTGTCTGATC AGGCAATTAA TGCCTCACGA AATTGTGTTG AATCAAAATA TGGTGTTGAA TATTTAAGTA AAAAGCCCCG ACAATTCTCC AACAAGACGA GAAATGCTCA AGAAGCCCAT GAAGCAATAC GTCCTTCTGG TGAGAGCTTT AAAACACCCA AAGAGTCAAA CTTGCAAGGT AGGGATCTTT CTTTATACGA ACTTATTTGG AAACGGACAG TTGCTAGTCA AATGGCCGAT GCAAGGTTGA CAATGCTTGG AGTCGAATTA AAAGCATCGG ATGTATCTTT TCGGGCTAGT GGTAAACGAA TAGATTTTCC TGGATTCTTT AGAGCTTATG TTGAAGGTAC TGATGATCCT GATAGTGCAC TTGAAGGACA AGAAGTGCTT TTGCCTAAAT TAGAGGTAGG AGATTCTCCA ACAGCTAAGA ATGTAGAGGC ATTGGGGCAT CAGACTCAAC CTCCAGCTAG ATATAGCGAA GCTTCATTAG TTAAAACACT TGAGAAAGAA GGCATAGGTC GTCCGTCAAC TTATGCAAGC ATTATAGGAA CAATTGTAGA TCGAGGTTAT TCAGTCCTAA ATAACAATTC TTTAACTCCA AGCTTTACAG CATTTGCTGT GACGGCACTT CTTGAAGAAC ATTTTCCTGA TCTTGTAGAT ACTAGTTTTA CTGCTCGAAT GGAATCTACA CTTGATGAGA TCTCAACAGG AAAAGTGAGT TGGCTTCCAT ACCTTAAGGG CTTTTATAAG GGTGATACTG GCCTAGAGAA TCAGGTTCAA CAAAGGGAAG GGGATATTGA TGGAGGCGAG TTTAGAGCTG TTTCCTTGGA GGGACTTTCA TCTCTAGTTA GGTTGGGCAA ATTTGGAACA TATCTGGAAT CAAAGCAACT GGGTGAAAAT GGCAAGCCCA TAACAGCTAC TCTTCCACAG GAAATTACTC CCGCAGATTT GGATGAGGAT ATCGCAGAGA TGATTTTAAA ACAAAAAGCT GAGGGTCCTG AATCACTTGG GGTTGACCCT GATAGTGGAC AGAATCTATA TCTATTAAAT GGTAGATATG GTCATTTTGT TCAAAGGGGA TTAGTAGTCG AATTGAAAGA TCTTGGAATT CCAAAAGGTA AGAAAAAATT AGGAAATCTT CGCTTGTTCA AAAGCAGTCA ATATGGACTC TATTTGAAGC AGGATTCATC AAAGGTTCAG CTTTTGTTGC CAGAGAATAT AAAAGAGGAA GAGATAGATG TTGAAAAAGC ACTTGAGTAT TTAGATGATA AATCATTGAA AAAAGCTCCA AATCCAAAAA GAACTTCCTT GCCAAAGAGT TTAAAACCAG AGGACTTGAC CTTTGAGGAG GCCCTTGGAT TGATTCAATT ACCACGTCTA CTGGGAGAGC ATCCAGAGGG AGGTAGGATT CAATCAAGTT TAGGTAGATT TGGTCCCTAT GTGGTTTGGA GTAAAAATGG TGGTGAAAAA GATTATCGCT CAATTAAAGG TGACGATGAC GTTCTTCAAG TAAGCCTAGA AAGAGCTCTT GAGCTTTTAT CAATACCTAA AAGAGGAAGA GGCGGAAGAA CTGCGTTGAA GGAACTTGGT ATCCCAGAGG GAGAAAAAGA AACTATCCAA TTATTTGATG GTCCTTATGG TTTATATGTT AAACAGGGCA AAGTAAATGC TTCTCTACCA GAGGGAAAAA CCGCTGAAGA TATCACTATT GAGGTAGCTA TTGAATTATT GGCAGCTAAG AAATCAAGTA AAAAGACAAC ATCTAAGAAA AGAAAATCTA CACAAAAGAC AACCAAGTCA ACAAAGAAAG ATTTAAACTC ATCAGCATCA AAAAAAAGTA GTACTCAAAA AGCGCCCTCT ACAACTAAAA CAGGACGTCT AAGAGCCAGT AAAGTAAGGG TAATTAAAAC AAAATAA
|
Protein sequence | MPTDHTLVIV ESPTKAKTIR GFLPKDFQVL ASMGHIRDLP NNASEIPAKH KGEKWATIGV NTTADFDPLY VVPKDKKKIV KELKQSLKGA SELLLATDED REGESISWHL MNVLDPKIPV KRMVFHEITK EAISKALSKT RAIDMELVHA QETRRILDRL VGYTLSPLLW KKVSWGLSAG RVQSVAVRLL VLRERARRAF KSGSYWDLKA KLEKEGSEFE VKMTSIGGKR IATGSDFDES TGLLKSGRNV ILLKEEESKE LAQKLTTDKW KVVNVEEKPS IRKPVPPFTT STLQQEANRK LRLSARETMR CAQGLYERGF ITYMRTDSVH LSDQAINASR NCVESKYGVE YLSKKPRQFS NKTRNAQEAH EAIRPSGESF KTPKESNLQG RDLSLYELIW KRTVASQMAD ARLTMLGVEL KASDVSFRAS GKRIDFPGFF RAYVEGTDDP DSALEGQEVL LPKLEVGDSP TAKNVEALGH QTQPPARYSE ASLVKTLEKE GIGRPSTYAS IIGTIVDRGY SVLNNNSLTP SFTAFAVTAL LEEHFPDLVD TSFTARMEST LDEISTGKVS WLPYLKGFYK GDTGLENQVQ QREGDIDGGE FRAVSLEGLS SLVRLGKFGT YLESKQLGEN GKPITATLPQ EITPADLDED IAEMILKQKA EGPESLGVDP DSGQNLYLLN GRYGHFVQRG LVVELKDLGI PKGKKKLGNL RLFKSSQYGL YLKQDSSKVQ LLLPENIKEE EIDVEKALEY LDDKSLKKAP NPKRTSLPKS LKPEDLTFEE ALGLIQLPRL LGEHPEGGRI QSSLGRFGPY VVWSKNGGEK DYRSIKGDDD VLQVSLERAL ELLSIPKRGR GGRTALKELG IPEGEKETIQ LFDGPYGLYV KQGKVNASLP EGKTAEDITI EVAIELLAAK KSSKKTTSKK RKSTQKTTKS TKKDLNSSAS KKSSTQKAPS TTKTGRLRAS KVRVIKTK
|
| |