Gene PMN2A_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1768 
Symbol 
ID3607178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp433444 
End bp436350 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content39% 
IMG OID637688659 
ProductDNA topoisomerase I 
Protein accessionYP_292959 
Protein GI72383604 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACTG ACCATACTCT GGTGATTGTT GAAAGTCCTA CAAAGGCAAA AACTATTAGA 
GGGTTTTTGC CTAAGGACTT TCAGGTTCTT GCGTCAATGG GGCACATAAG AGACTTGCCT
AACAATGCAT CTGAGATCCC TGCGAAGCAC AAAGGCGAAA AGTGGGCAAC GATTGGAGTT
AATACAACTG CTGATTTTGA TCCTTTGTAC GTTGTACCCA AAGACAAGAA AAAAATTGTC
AAGGAATTAA AACAATCTTT GAAGGGTGCT AGTGAATTGT TGCTTGCGAC TGATGAAGAT
AGAGAAGGAG AAAGTATAAG TTGGCATTTA ATGAATGTGC TTGACCCGAA AATCCCTGTG
AAGAGGATGG TCTTTCATGA GATAACTAAA GAAGCTATTT CCAAAGCTCT ATCGAAAACA
AGAGCAATTG ATATGGAATT AGTTCATGCC CAAGAGACAA GGAGGATCTT AGACAGATTA
GTTGGGTACA CGCTTTCTCC TCTTTTATGG AAGAAAGTTT CATGGGGTTT ATCTGCAGGA
AGAGTTCAAT CAGTTGCAGT AAGATTGCTA GTTCTGAGAG AGAGAGCAAG GAGAGCTTTC
AAAAGCGGGA GTTATTGGGA CTTAAAAGCA AAATTAGAGA AAGAAGGTAG TGAATTTGAG
GTGAAAATGA CCTCAATTGG TGGGAAAAGA ATTGCTACAG GTAGTGATTT TGATGAGTCA
ACGGGATTAT TGAAATCTGG CCGAAATGTC ATATTACTCA AGGAAGAGGA GTCTAAGGAA
CTTGCACAAA AATTAACTAC TGATAAATGG AAAGTTGTTA ATGTCGAGGA AAAGCCGTCA
ATCCGTAAAC CAGTTCCTCC TTTTACAACA AGCACATTAC AACAAGAGGC TAATAGAAAA
CTTCGATTAT CAGCTAGGGA GACTATGAGA TGTGCTCAGG GTTTGTATGA AAGAGGTTTT
ATTACATATA TGAGAACAGA TTCTGTTCAT CTGTCTGATC AGGCAATTAA TGCCTCACGA
AATTGTGTTG AATCAAAATA TGGTGTTGAA TATTTAAGTA AAAAGCCCCG ACAATTCTCC
AACAAGACGA GAAATGCTCA AGAAGCCCAT GAAGCAATAC GTCCTTCTGG TGAGAGCTTT
AAAACACCCA AAGAGTCAAA CTTGCAAGGT AGGGATCTTT CTTTATACGA ACTTATTTGG
AAACGGACAG TTGCTAGTCA AATGGCCGAT GCAAGGTTGA CAATGCTTGG AGTCGAATTA
AAAGCATCGG ATGTATCTTT TCGGGCTAGT GGTAAACGAA TAGATTTTCC TGGATTCTTT
AGAGCTTATG TTGAAGGTAC TGATGATCCT GATAGTGCAC TTGAAGGACA AGAAGTGCTT
TTGCCTAAAT TAGAGGTAGG AGATTCTCCA ACAGCTAAGA ATGTAGAGGC ATTGGGGCAT
CAGACTCAAC CTCCAGCTAG ATATAGCGAA GCTTCATTAG TTAAAACACT TGAGAAAGAA
GGCATAGGTC GTCCGTCAAC TTATGCAAGC ATTATAGGAA CAATTGTAGA TCGAGGTTAT
TCAGTCCTAA ATAACAATTC TTTAACTCCA AGCTTTACAG CATTTGCTGT GACGGCACTT
CTTGAAGAAC ATTTTCCTGA TCTTGTAGAT ACTAGTTTTA CTGCTCGAAT GGAATCTACA
CTTGATGAGA TCTCAACAGG AAAAGTGAGT TGGCTTCCAT ACCTTAAGGG CTTTTATAAG
GGTGATACTG GCCTAGAGAA TCAGGTTCAA CAAAGGGAAG GGGATATTGA TGGAGGCGAG
TTTAGAGCTG TTTCCTTGGA GGGACTTTCA TCTCTAGTTA GGTTGGGCAA ATTTGGAACA
TATCTGGAAT CAAAGCAACT GGGTGAAAAT GGCAAGCCCA TAACAGCTAC TCTTCCACAG
GAAATTACTC CCGCAGATTT GGATGAGGAT ATCGCAGAGA TGATTTTAAA ACAAAAAGCT
GAGGGTCCTG AATCACTTGG GGTTGACCCT GATAGTGGAC AGAATCTATA TCTATTAAAT
GGTAGATATG GTCATTTTGT TCAAAGGGGA TTAGTAGTCG AATTGAAAGA TCTTGGAATT
CCAAAAGGTA AGAAAAAATT AGGAAATCTT CGCTTGTTCA AAAGCAGTCA ATATGGACTC
TATTTGAAGC AGGATTCATC AAAGGTTCAG CTTTTGTTGC CAGAGAATAT AAAAGAGGAA
GAGATAGATG TTGAAAAAGC ACTTGAGTAT TTAGATGATA AATCATTGAA AAAAGCTCCA
AATCCAAAAA GAACTTCCTT GCCAAAGAGT TTAAAACCAG AGGACTTGAC CTTTGAGGAG
GCCCTTGGAT TGATTCAATT ACCACGTCTA CTGGGAGAGC ATCCAGAGGG AGGTAGGATT
CAATCAAGTT TAGGTAGATT TGGTCCCTAT GTGGTTTGGA GTAAAAATGG TGGTGAAAAA
GATTATCGCT CAATTAAAGG TGACGATGAC GTTCTTCAAG TAAGCCTAGA AAGAGCTCTT
GAGCTTTTAT CAATACCTAA AAGAGGAAGA GGCGGAAGAA CTGCGTTGAA GGAACTTGGT
ATCCCAGAGG GAGAAAAAGA AACTATCCAA TTATTTGATG GTCCTTATGG TTTATATGTT
AAACAGGGCA AAGTAAATGC TTCTCTACCA GAGGGAAAAA CCGCTGAAGA TATCACTATT
GAGGTAGCTA TTGAATTATT GGCAGCTAAG AAATCAAGTA AAAAGACAAC ATCTAAGAAA
AGAAAATCTA CACAAAAGAC AACCAAGTCA ACAAAGAAAG ATTTAAACTC ATCAGCATCA
AAAAAAAGTA GTACTCAAAA AGCGCCCTCT ACAACTAAAA CAGGACGTCT AAGAGCCAGT
AAAGTAAGGG TAATTAAAAC AAAATAA
 
Protein sequence
MPTDHTLVIV ESPTKAKTIR GFLPKDFQVL ASMGHIRDLP NNASEIPAKH KGEKWATIGV 
NTTADFDPLY VVPKDKKKIV KELKQSLKGA SELLLATDED REGESISWHL MNVLDPKIPV
KRMVFHEITK EAISKALSKT RAIDMELVHA QETRRILDRL VGYTLSPLLW KKVSWGLSAG
RVQSVAVRLL VLRERARRAF KSGSYWDLKA KLEKEGSEFE VKMTSIGGKR IATGSDFDES
TGLLKSGRNV ILLKEEESKE LAQKLTTDKW KVVNVEEKPS IRKPVPPFTT STLQQEANRK
LRLSARETMR CAQGLYERGF ITYMRTDSVH LSDQAINASR NCVESKYGVE YLSKKPRQFS
NKTRNAQEAH EAIRPSGESF KTPKESNLQG RDLSLYELIW KRTVASQMAD ARLTMLGVEL
KASDVSFRAS GKRIDFPGFF RAYVEGTDDP DSALEGQEVL LPKLEVGDSP TAKNVEALGH
QTQPPARYSE ASLVKTLEKE GIGRPSTYAS IIGTIVDRGY SVLNNNSLTP SFTAFAVTAL
LEEHFPDLVD TSFTARMEST LDEISTGKVS WLPYLKGFYK GDTGLENQVQ QREGDIDGGE
FRAVSLEGLS SLVRLGKFGT YLESKQLGEN GKPITATLPQ EITPADLDED IAEMILKQKA
EGPESLGVDP DSGQNLYLLN GRYGHFVQRG LVVELKDLGI PKGKKKLGNL RLFKSSQYGL
YLKQDSSKVQ LLLPENIKEE EIDVEKALEY LDDKSLKKAP NPKRTSLPKS LKPEDLTFEE
ALGLIQLPRL LGEHPEGGRI QSSLGRFGPY VVWSKNGGEK DYRSIKGDDD VLQVSLERAL
ELLSIPKRGR GGRTALKELG IPEGEKETIQ LFDGPYGLYV KQGKVNASLP EGKTAEDITI
EVAIELLAAK KSSKKTTSKK RKSTQKTTKS TKKDLNSSAS KKSSTQKAPS TTKTGRLRAS
KVRVIKTK