Gene A9601_15621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_15621 
Symbol 
ID4718289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1337981 
End bp1339135 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content27% 
IMG OID640079288 
ProductDNA photolyase-like protein 
Protein accessionYP_001009952 
Protein GI123969094 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.299984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTTT TATTAAAAGC ACAAAATACC TGGGAAAATT TTGCGAAATA CAAAATTAAT 
GATTATGCAA AATTAAGAAA TTTTGATTTT GGGCCAAATA ACGAAAGTTC AGTTTCAAAA
TTATCGCCTT TCATTACTCA TAGAATATTA TCGGAATATG ATCTGATCCA TGATATTAAA
AGTAAGTACA AAATCAAAAA TTCAACTAAA TTTGTTGAAG AAATATTTTG GAGAGTTTAC
TGGAAAGGGT GGATGGAAAA TAGACCTAAA GTTTGGAGAA ATTTTATTTC AGAAAATAAT
CTCGATTTTG ATTATGAGCT ATATGAAAAT GCAATTAATG GCAATACAGA ATTAGATTTT
TTTAATTCTT GGGTTCATGA ATTAAAGCAG TACAACTATT TGCATAATCA TACAAGAATG
TGGTTTGCGA GTACTTGGAT ATTTAATTTA GGCCTCCCAT GGCAATTAGG AGCAAAGTTT
TTCTTTAAAT ATCTTTTTGA TGGAGATGCT TCATCTAATC TCCTTAGCTG GAGATGGGTT
GGAGGATTGC AAACGAAGGG AAAACAATAT CTTTTTTCAT CATCAAACCT CAGAAAGTTT
TCTAATAATA GATTTAATGT GGAAAAAATA AGTAATCAAC AAATTTTTCT TGAAGAATCT
AATCAAATAC CATTTGAAGA TGAGATTTAT AAAAATGATA TGGATCCTAA ATCAGATAAT
CTGATTATGT TTGAAAATGA TCTGCACCTT GCAACTCTTA AAAATTTACT TCCAAGCTAT
AAAAAAGTAT TTATTATCCT TTTAAAAAAT GAACAAAGAC AAATTAAATT GTCTGAATCT
GTTTTGAAAT TTAAACAAGA TTTGGTCTCT GAATTTGTAG AGCAATTTGA TAATGTTAAA
CAGATTGATC CTTATTCACT GGAAAATACT TTTAAAAATA CCAATGAAAT AGACATTATT
TATCCTGGAG TGGGAGAAAA TTATGATTTC ATAACTGAGT TTAAAAATTT ACACCATAAA
GAAATTTTTA ATCTTGTGAG GGATGAAGAT TTATTTGCTT GGAAATTTGC TAAAAGAGGG
TTTTTTAAAT TTAAAGAAAA TATTCCAAAA ATAAATCAGA GAATATTAGA AAATTTTTCA
AAAAACAATT TTTAA
 
Protein sequence
MSFLLKAQNT WENFAKYKIN DYAKLRNFDF GPNNESSVSK LSPFITHRIL SEYDLIHDIK 
SKYKIKNSTK FVEEIFWRVY WKGWMENRPK VWRNFISENN LDFDYELYEN AINGNTELDF
FNSWVHELKQ YNYLHNHTRM WFASTWIFNL GLPWQLGAKF FFKYLFDGDA SSNLLSWRWV
GGLQTKGKQY LFSSSNLRKF SNNRFNVEKI SNQQIFLEES NQIPFEDEIY KNDMDPKSDN
LIMFENDLHL ATLKNLLPSY KKVFIILLKN EQRQIKLSES VLKFKQDLVS EFVEQFDNVK
QIDPYSLENT FKNTNEIDII YPGVGENYDF ITEFKNLHHK EIFNLVRDED LFAWKFAKRG
FFKFKENIPK INQRILENFS KNNF