Gene NATL1_04911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04911 
SymboltopA 
ID4781077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp445813 
End bp448719 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content39% 
IMG OID640083766 
ProductDNA topoisomerase I 
Protein accessionYP_001014318 
Protein GI124025202 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.624674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACTG ACCATACTCT GGTAATTGTT GAAAGTCCTA CAAAGGCAAA AACTATTAGA 
GGGTTTTTGC CTAAGGACTT TCAGGTTCTT GCGTCAATGG GTCACATAAG AGACTTGCCT
AACAATGCAT CTGAGATCCC TGCGAAGCAC AAAGGCGAAA AGTGGGCAAC GATTGGAGTT
AATACAACTG CTGATTTTGA TCCTTTGTAC GTCGTACCCA AAGACAAGAA AAAAATTGTC
AAGGAATTAA AACAATCTTT GAAGGGTGCT AGTGAATTGT TGCTTGCGAC TGATGAAGAT
AGAGAAGGAG AAAGTATAAG TTGGCATTTA ATGAATGTGC TTGACCCGAA AATCCCTGTG
AAGAGGATGG TCTTTCATGA GATAACTAAA GAAGCTATTT CCAAAGCTCT ATCGAAAACA
AGAGCAATTG ATATGGAATT AGTTCATGCC CAAGAGACAA GGAGGATCTT AGACAGATTA
GTTGGGTACA CGCTTTCTCC TCTTTTATGG AAGAAAGTTT CATGGGGGTT ATCTGCAGGA
AGAGTTCAAT CAGTTGCAGT AAGGTTGCTA GTTCTGAGAG AGAGAGCAAG GAGAGCTTTC
AAAAGCGGGA GTTATTGGGA CTTAAAAGCA AAATTAGAGA AAGAAGGTAG TGAATTTGAG
GTGAAAATGA CCTCAATTGG TGGAAAAAGA ATTGCTACAG GTAGTGATTT TGATGAGTCA
ACGGGATTAT TGAAATCTGG CCGAAATGTC ATATTACTCA AGGAAGAGGA GTCTAAGGAA
CTTGCAAAAA ATTTAACTAC TGATAAATGG AAAGTTGTTA ATGTCGAGGA AAAGCCGTCA
ATCCGTAAAC CAGTTCCTCC TTTTACAACA AGCACATTAC AACAAGAGGC TAATAGAAAA
CTTCGATTAT CAGCTAGGGA GACTATGAGA TGTGCTCAGG GTTTGTATGA AAGAGGTTTT
ATTACATATA TGAGAACAGA TTCTGTTCAT CTGTCTGATC AGGCAATTAA TGCCTCACGA
AATTGTGTTG AATCAAAATA TGGTGTTGAA TATTTAAGTA AAAAGCCCCG ACAATTCTCC
AATAAGACGA GAAATGCTCA AGAAGCCCAT GAAGCAATAC GTCCTTCTGG TGAGAGCTTT
AAAACACCCA AAGAGTCAAA CTTGCAAGGT AGGGATCTTT CTTTATACGA ACTTATTTGG
AAACGGACAG TTGCTAGTCA AATGGCCGAT GCAAGGTTGA CAATGCTTGG AGTCGAATTA
AAAGCATCGG ATGTATCTTT TCGGGCTAGT GGTAAACGAA TAGATTTCCC TGGATTCTTT
AGAGCTTATG TTGAAGGTAC TGATGATCCT GATAGTGCAC TTGAAGGACA AGAAGTGCTT
TTGCCTAAAT TAGCGGTAGG AGATTCTCCA ACAGCTAAGA ATGTAGAGGC ATTGGGGCAT
CAGACTCAAC CTCCAGCTAG ATATAGCGAA GCTTCATTAG TTAAAACACT TGAGAAAGAA
GGCATAGGTC GTCCGTCAAC TTATGCAAGC ATTATAGGAA CAATTGTAGA TCGAGGTTAT
TCAGTCCTAA ATAACAATTC TTTAACTCCA AGCTTTACAG CATTTGCTGT GACGGCACTT
CTTGAAGAAC ATTTTCCTGA TCTTGTAGAT ACCAGTTTTA CTGCTCGAAT GGAATCTACA
CTTGATGAGA TCTCAACAGG AAAAGTGAGT TGGCTTCCAT ACCTTAAGGG CTTTTATAAG
GGTGATACTG GCCTAGAGAA TCAGGTGCAA CAAAGGGAAG GGGATATTGA TGGAGGCGAG
TTTAGAGCTG TTTCCTTGGA GGGACTTTCA TCTCTAGTTA GGTTGGGCAA ATTTGGAACA
TATCTGGAAT CAAAGCAACT GGGTGAAAAT GGCAAGCCCA TAACAGCTAC TCTTCCACAG
GAAATTACTC CCGCAGATTT GGATGAGGAT ATCGCAGAGA TGATTTTAAA ACAAAAAGCT
GAGGGTCCTG AATCACTTGG GGTTGACCCT GATAGTGGAC AGAATCTATA TCTATTAAAT
GGTAGATATG GTCATTTTGT TCAAAGGGGA TTAGTAGTCG AATTGAAAGA TCTTGGGATT
CCAAAAGGTA AGAAGAAATT AGGAAATCTT CGCTTGTTCA AAAGCAGTCA ATATGGACTC
TATTTGAAGC AGGATTCATC AAAGGTTCAG GTTTTGTTAC CAGAGAATAT AAAAGAGGAA
GAGATAGATG TTGAAAAAGC ACTTGAATAT TTAGATGATA AATCATTAAA AAAAGCTCCA
AATCCAAAAA GGACTTCCTT ACCAAAGAGT CTAAAACCAG AGGACTTGAC ATTTGAGAAG
GCCCTTGGAT TAATCCAATT ACCACGTCTA CTTGGAGAGC ACCCAGAGGG AGGTAAAGTT
CAATCAAGCT TGGGTAGATT TGGTCCGTAT GTGGTTTGGA GTAAAAATGG TGGTGAAAAA
GATTATCGCT CAATTAAGGG GGAAGATGAC GTTCTTCAAG TAAGCCTAGA AAGAGCTCTT
GAGCTTTTAT CAATACCAAA AAGAGGAAGA GGCGGAAGAA CTGCGTTGAA AGAACTTGGT
ATCCCAGATG GAGAAAAAGA AACTATCCAA TTATTTGATG GTCCTTATGG TTTATATGTT
AAACAGGGTA AAGTAAATGC TTCTCTACCA GAGGGAAAAA CCGCTGAAGA TATTACTATT
GAGGTAGCTA TTGAATTATT GGCAGCTAAG AAATCAAGTA AAAAGACAAC ATCTAAGAAA
AGAAAATCTA CACAAAAGAC AACCAAGTCA ACAAAGAAAG ATTTAAACTC ATCAGCATCA
AAAAAAAGTA GTACTCAAAA AGCGCCCTCT ACAACTAAAA CAGGACGTCT AAGAGCCAGT
AAAGTAAGGG TAATTAAAAC AAAATAA
 
Protein sequence
MPTDHTLVIV ESPTKAKTIR GFLPKDFQVL ASMGHIRDLP NNASEIPAKH KGEKWATIGV 
NTTADFDPLY VVPKDKKKIV KELKQSLKGA SELLLATDED REGESISWHL MNVLDPKIPV
KRMVFHEITK EAISKALSKT RAIDMELVHA QETRRILDRL VGYTLSPLLW KKVSWGLSAG
RVQSVAVRLL VLRERARRAF KSGSYWDLKA KLEKEGSEFE VKMTSIGGKR IATGSDFDES
TGLLKSGRNV ILLKEEESKE LAKNLTTDKW KVVNVEEKPS IRKPVPPFTT STLQQEANRK
LRLSARETMR CAQGLYERGF ITYMRTDSVH LSDQAINASR NCVESKYGVE YLSKKPRQFS
NKTRNAQEAH EAIRPSGESF KTPKESNLQG RDLSLYELIW KRTVASQMAD ARLTMLGVEL
KASDVSFRAS GKRIDFPGFF RAYVEGTDDP DSALEGQEVL LPKLAVGDSP TAKNVEALGH
QTQPPARYSE ASLVKTLEKE GIGRPSTYAS IIGTIVDRGY SVLNNNSLTP SFTAFAVTAL
LEEHFPDLVD TSFTARMEST LDEISTGKVS WLPYLKGFYK GDTGLENQVQ QREGDIDGGE
FRAVSLEGLS SLVRLGKFGT YLESKQLGEN GKPITATLPQ EITPADLDED IAEMILKQKA
EGPESLGVDP DSGQNLYLLN GRYGHFVQRG LVVELKDLGI PKGKKKLGNL RLFKSSQYGL
YLKQDSSKVQ VLLPENIKEE EIDVEKALEY LDDKSLKKAP NPKRTSLPKS LKPEDLTFEK
ALGLIQLPRL LGEHPEGGKV QSSLGRFGPY VVWSKNGGEK DYRSIKGEDD VLQVSLERAL
ELLSIPKRGR GGRTALKELG IPDGEKETIQ LFDGPYGLYV KQGKVNASLP EGKTAEDITI
EVAIELLAAK KSSKKTTSKK RKSTQKTTKS TKKDLNSSAS KKSSTQKAPS TTKTGRLRAS
KVRVIKTK