Gene P9301_03921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_03921 
Symbol 
ID4911828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp347540 
End bp349036 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content28% 
IMG OID640159968 
Productputative deoxyribodipyrimidine photolyase 
Protein accessionYP_001090616 
Protein GI126695730 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.322027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA TAAATATCTT ATGGTTTAAG AAAGATTTAA GAATTTTTGA TAACGAAGCT 
CTCTGTGAGG CTATAAAAGA TAATGATATT TTACCTATTT ATATTATTGA GTTAGATATT
TGGAACCAAA ATACTCATTC AGATAGACAA TGGCAATTTT GCAAAGAAAG TTTAATAGAT
TTAAGAAATG CACTTGCTGA GATTGGACAA CCATTAATTA TTAGGACTGG GAATGTTATT
AATATATTTG ATGAAATTAG TTCAAAATTT AAGATCAAAG GTTTATATAG CCATCAAGAA
ACCGGAGATT GGCTTACTTA TAAAAGAGAT CAAAAAGTAA GGGAATGGGC TTTAAGTAAA
AATATTATTT GGAAGGAATT TCTACAATTT TCAGTTTTCA GAGGAAATTT AGATAGGAAT
AATTGGTCTA AAAAGTGGCA AAAAAATTCT GAAAAAAACT TACTTAAAGC ACCATTAAGA
ATTAATTCTA TTAACTTAAA TATTGGAGAA ATACCCTCAG ACAAAATTTT TTCCTTTAAA
AAAGAAACTT GTCCAGGAAG AATGCAAGGT GGAAGAAAGA AAGGTTTAGA GAGAATGCAA
TACTTCTTTA GTAATAAATT AGATTCTTAT TCAAAAGATA TATCTAGCCC AGAAAAATCA
TTTGATAGTT GTACAAGACT ATCCCCATAT ATTTGTTGGG GATGCATTTC ATTAAAAGAA
ATTTTTAAAA GGGCAAATAT ATCAAAAAAC AATAATTCTA GGATGTTAAA AAGCAGATTA
ACTTGGCATT GTCATTTTAT TCAGAAACTT GAAAGTGAAC CAGAACTAGA GTTTAGGGAA
TACCATCCTT TTTTTAAAAA TATTAGAGAA AAAAATAATG AATTACTTTA TTCATGGAGT
TCAGGTAATA CGGGCTTTCC TTTTATAGAT GCATGTATGC GTTCATTAAA TTTCAATGGA
TGGATTAACT TCAGGATGCG AGCGATGTTA ATGTCTTTTG CTAGCTATAA TTTATGGCTA
CCATGGCAAG ATTCAGGTTC TGAATTAGCA AATAAATTTG TAGATTATGA GCCTGGAATA
CATTGGAACC AATGCCAAAT GCAATCTGGA ACTACGTCTA TAAATACGAA TAGAATTTAT
AATCCTATTA AGCAGGGAAA AGATCATGAT CCTCAAGGTA AATTTATAAA AAAATGGATA
CCAGAATTAA AAGATATATC ACTTAATTTC ATTCATGAAC CATGGCTACT ATCTATATTT
AATCAAGAAG AATATGAAAA AATTAATTAC ATAAGACCAA TAATTGACAT CCCAATTAGC
ACTAGAACTG CAAAGAAGAA AATTCAGGAA ATCACTAAAA AGGATGGATA TTGGGATATC
TCAAAAGAAA TTTATTTAAA GCATGGCTCA AGAAAAAGGC CTAGAAAAAA CATAAATAAT
AAAAAAAATG TTTCTAAGGA AAAGGAAAAA CAATACGAAC TGAAATTAGA TTTCTAA
 
Protein sequence
MKGINILWFK KDLRIFDNEA LCEAIKDNDI LPIYIIELDI WNQNTHSDRQ WQFCKESLID 
LRNALAEIGQ PLIIRTGNVI NIFDEISSKF KIKGLYSHQE TGDWLTYKRD QKVREWALSK
NIIWKEFLQF SVFRGNLDRN NWSKKWQKNS EKNLLKAPLR INSINLNIGE IPSDKIFSFK
KETCPGRMQG GRKKGLERMQ YFFSNKLDSY SKDISSPEKS FDSCTRLSPY ICWGCISLKE
IFKRANISKN NNSRMLKSRL TWHCHFIQKL ESEPELEFRE YHPFFKNIRE KNNELLYSWS
SGNTGFPFID ACMRSLNFNG WINFRMRAML MSFASYNLWL PWQDSGSELA NKFVDYEPGI
HWNQCQMQSG TTSINTNRIY NPIKQGKDHD PQGKFIKKWI PELKDISLNF IHEPWLLSIF
NQEEYEKINY IRPIIDIPIS TRTAKKKIQE ITKKDGYWDI SKEIYLKHGS RKRPRKNINN
KKNVSKEKEK QYELKLDF