Gene A9601_03081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03081 
SymbolphrB 
ID4716995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp284525 
End bp285958 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content31% 
IMG OID640078010 
Productputative DNA photolyase 
Protein accessionYP_001008703 
Protein GI123967845 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC CTAGAATACT TTTCTGGCAT AGAAAGGATT TAAGAATATT TGATAATCAA 
GCTTTAATCA AAGCATTTTC ATTATCAAAT GCTATTACTT CAACCTATAT ATTTGATAAA
AATTACTCAC ACGATTTCAA TGCAAGTTCA AGAGCTTGGT TTCTAGGAAA TTCACTTCAA
GAATTAGGAA ATAATTGGAA AAAAATGGGT AGTAGATTAG TTATGGAAGA AGGAGATCCG
GTATTAATAA TTCCCAAATT AGCAAAGAAA ATAAATGCTA AATTTGTTTT TTGGAATAGA
TCAATTGAAC CTTATGAGAT TAATCGCGAT TTACAAATAA AAAAAAATTT AAAAGAACAA
AACATTCAAG TTATTGAAAC TTGGGATCAC TTATTAGTAG AACCTTTAAA AATATTTTCA
GGGAATAATA ATCCTTATTC AGTTTATGGA CCTTTTTATA AAAACCTTAA ATCAAAAATG
AATTTATTAG GTTTATATGA ACAAGATAAA GTTGGTTTCC AGTTTAAAGA TATTGATAAT
AAACTCAAAG ATAAGACAAT AAATTCATCT GATTCGGTTT TAGAGAAATT TATCAAAAAT
ATCAAATTTC CTGGTTCGAA TATTTGTCCA TGTAAACCTG GAGAGAATGC TGCAGAAACA
TTATTAGAAA ACTTCATTAA CGAAAAAAAA ATATATTCTT ATAATTCTGC ACGAGATTTT
CCTTCCCATA ATGGGACATC TTTTCTAAGT GCATCTCTCA GATTCGGTAC CATCAGCATT
AGAAAAGTTT GGAACGCCAC TTTAAATTTA AATTCAGATT TGGAAAATCA AGGAAATTAT
CTATCAATTG AAACTTGGCA AAAAGAACTT GTTTGGCGTG AATTTTATCA ACATTGCTTA
TTCCATTTCC CAGAGCTAGA GAAAGGTCCC TATAGAAAAA AATGGGATCA CTTTCGATGG
CAAAACAATA ATGAATGGTT TCAGCATTGG AGCAACGGAG AGACCGGAGT ACCTATAGTT
GATGCTGCAA TGCGTCAACT AAATAGTACT GGCTGGATGC ATAACAGATG TAGGATGATA
GTCGCTTCAT TTCTGGTAAA AGATCTTATA TGCAATTGGC AAATGGGCGA GAAAAAATTT
ATGGAGACTT TGGTTGATGG AGACTTAGCT GCAAATAATG GGGGATGGCA GTGGAGCGCA
AGTAGCGGTA TGGATCCAAA ACCTCTTAGA ATTTTTAATC CATATACCCA AGCAAAAAAA
TTTGATCCTA TTTGCGAATA TATAAAATAT TGGATTCCTG AATTATCTAA AGTGTCAAAT
TCAGAATTAT TAAATGGGGA TATATCTAAT TTAGAAAAAA ATGATTATTC AAGCCCTATT
GTCAATCACA AGATACAACA AAGATTATTT AAATCACTTT ATGCTGAAAT TTGA
 
Protein sequence
MNKPRILFWH RKDLRIFDNQ ALIKAFSLSN AITSTYIFDK NYSHDFNASS RAWFLGNSLQ 
ELGNNWKKMG SRLVMEEGDP VLIIPKLAKK INAKFVFWNR SIEPYEINRD LQIKKNLKEQ
NIQVIETWDH LLVEPLKIFS GNNNPYSVYG PFYKNLKSKM NLLGLYEQDK VGFQFKDIDN
KLKDKTINSS DSVLEKFIKN IKFPGSNICP CKPGENAAET LLENFINEKK IYSYNSARDF
PSHNGTSFLS ASLRFGTISI RKVWNATLNL NSDLENQGNY LSIETWQKEL VWREFYQHCL
FHFPELEKGP YRKKWDHFRW QNNNEWFQHW SNGETGVPIV DAAMRQLNST GWMHNRCRMI
VASFLVKDLI CNWQMGEKKF METLVDGDLA ANNGGWQWSA SSGMDPKPLR IFNPYTQAKK
FDPICEYIKY WIPELSKVSN SELLNGDISN LEKNDYSSPI VNHKIQQRLF KSLYAEI