Gene P9515_03991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_03991 
Symbol 
ID4719587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp361515 
End bp363026 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content29% 
IMG OID640080072 
Productputative deoxyribodipyrimidine photolyase 
Protein accessionYP_001010715 
Protein GI123965634 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA TTAATATTTT ATGGTTTAAA AAAGACTTAA GAATAAACGA TAATGAGGCT 
CTTATCGAAT CTCTAAAGGA TAGAGACATT ATACCTATAT TCATAATTGA AAAAGAAATA
TGGAGTCAAA AAACTTATTC AGATAGACAA TGGCAATTTT GCAAGGAAAG TCTTTTAGAC
TTAAGAATTT CACTTGCAAA TATTGGTCAA CCTTTAATAA TTAGAACGGG TAAAGTAATT
GAGATTTTTG ATCAAATATC TAATAATTTT GAAATTAAAG CTATTTATAG CCATCAAGAA
ACTGGAGACT ATTTGACATA TAAAAGAGAT CAAGAAGTTA GAAAATGGGC CTCTATGAAA
AAAATTATTT GGAAAGAATA TTTACAATTT TCAGTATTTA GAGGAAAGCT AGATAGAAAT
AATTGGTCAA CAAAATGGAA AAAAAATATG GAGAGAAAAC TATTTACTGA ACCTTCAAAA
ATTAATCCTA TAGAAATTGA TCCTGGAGAA ATCCCACCAG ATAATTTCTT TTGCTTCAAA
GATGATTTCT GCAAAGGGAG ATTAAAGGGT GGAAGAGAAA TTGGTCTAAA GAGAATGGAA
TATTTTTTCA GTAATAAATT AAGTTATTAC TCAAAAGATA TTTCAAGCCC AGAGAAATCA
TTCGATAGCT GTTCAAGATT ATCTCCCTAT ATAAGTTGGG GATGTATTTC TATAAAAGAG
ATTATTCATA AAGCAAATTC AATAACAAAT CCCAATTCTA AAATGTTAAA AAGCAGATTA
ACTTGGCATT GTCACTTTAT ACAAAAACTT GAGAGTGAAC CAGAATTAGA GTTTAAAGAA
TTTCATCCTT ATTTCCAAAA AATCAGAAAA AAAGATAGTC ATTTACTCAA ATTATGGAGT
GAAGGTAAAA CCGGTTTTCC TTTTTTAGAT GCTTGTATGA GATCTTTAAA TTTTCATGGA
TGGCTTAATT TCAGAATGCG TGCAATGCTA ATGTCTTTTG CAAGTTATAA TTTATGGATA
CCTTGGCAAG AATCCGGTTC TGAATTGGCA AGCAAATTTC TCGATTATGA GCCAGGAATT
CACTGGAATC AGTGTCAAAT GCAAGCAGGT ACAACATCTA TAAATGTAAA TAGAATATAT
AATCCTATTA AGCAGGGGAA AGATCATGAT CCCAAAGGAA ATTTTATTAA GAAATGGGTT
CCTGAGATAC AAAATTATCC TGAAAATTTC GTCCATGAAC CTTGGTTGAT GGAAAAATTT
AATTCCAAAG AATATGAAAA CTCTGAATAC ATTAAACCAA TAATTAACCT CTCAGAAACA
ACTAAAAATG CACGAAAAAG AATTCAAGAA ATAACCCAAA GAGAAGGTTA CTGGGATATT
TCAAAAGGAA TTTATATGAA ACATGGCTCA AGAAGAAAAG CTCTAAATAA GAGAAAAAAT
TATCCTAAAA AGAGTAATTT GAGAAAAAAA AGAGAACTTC AAAATCAATT AAATCTTGAT
TTACTAATTT AA
 
Protein sequence
MKNINILWFK KDLRINDNEA LIESLKDRDI IPIFIIEKEI WSQKTYSDRQ WQFCKESLLD 
LRISLANIGQ PLIIRTGKVI EIFDQISNNF EIKAIYSHQE TGDYLTYKRD QEVRKWASMK
KIIWKEYLQF SVFRGKLDRN NWSTKWKKNM ERKLFTEPSK INPIEIDPGE IPPDNFFCFK
DDFCKGRLKG GREIGLKRME YFFSNKLSYY SKDISSPEKS FDSCSRLSPY ISWGCISIKE
IIHKANSITN PNSKMLKSRL TWHCHFIQKL ESEPELEFKE FHPYFQKIRK KDSHLLKLWS
EGKTGFPFLD ACMRSLNFHG WLNFRMRAML MSFASYNLWI PWQESGSELA SKFLDYEPGI
HWNQCQMQAG TTSINVNRIY NPIKQGKDHD PKGNFIKKWV PEIQNYPENF VHEPWLMEKF
NSKEYENSEY IKPIINLSET TKNARKRIQE ITQREGYWDI SKGIYMKHGS RRKALNKRKN
YPKKSNLRKK RELQNQLNLD LLI