Gene NATL1_03651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03651 
SymbolphrB 
ID4780782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp337695 
End bp339209 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content32% 
IMG OID640083633 
Productputative DNA photolyase 
Protein accessionYP_001014194 
Protein GI124025078 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA GCATGCCAAC TCTTGAATAT AAGATGACAA AATTTCAATC AATATTTTGG 
CATAGGAGAG ATTTAAGATT TGGAGATAAT ATCGGCCTAT TCGAAGCATC AAAAAATTCA
AAAAGCCTCA TAGGAGTATA TGTTTTAGAT CCCAACCTCT TAGATCTAAA TAGAACTACA
TCTGAAGCAA AAAACTGGTT TTTAGGTGAA AGTCTTTTAG AACTACAAAA GAATTGGGAA
ATTAGAGGAA GTCTTTTATT AATATTAAAT GGAGATCCTA TTGAATTAAT ATCTAAATTA
GCTGAGTTAG TTCATGCTGA ATGTATTTAC TGGAACGAGA ATATTGAACC TTATGAAATT
AATAGAGACA AACAAATTGC AGAAAAACTT TCAAAAGAAA AAAGGAAAGT TTATACATTT
CTAGATCAAT TAATTGTTAA CCCAAGTGAT ATCAAAACAA ACAACGATGA ACCATACAAG
GTATATGGAC CTTTTTATCG CAAATGGATT GATATAATCA ATAGAACCAA ATCATCAGAT
AATAATTTAA TACAAACCTC AGAGACAGCT AAAAAACTTA CAGGCCTGAA TGAAAGAGAA
TTATCATCAA TTAAAAACTC TGATTTAAAC TACTGCATTA CAAAAAAAAG CAAATCTATT
TACGAATTAC TTTCTTCAAA TAGATTCAGC AATACCAGTC TATGTCCTTG CAAGCCGGGA
GAATCGGAAT CAATAAAACA ATTAAACTCT TTTATACATT CAGGAGTTAT AAATTCATAC
AATCAAGCAA GAGATATTCC ATCTTTAGAG AATACTTCTA ATCTAAGTGC TGCTTTAAGT
TTAGGCACTA TAAGTTGTAG AGCAGTATGG AATGGGGCTC AAGTATCAAA AAGTTCGACA
CATGACGAAT ATAAAATAAA TTCTATTGAT ACATGGATAA AGGAACTTGC TTGGAGAGAG
TTTTATCAAA ATGCTCTCAT TAATTTTCCA GAACTTGAGA AAGGTCCATA TAGGGAGAAA
TGGTTAAATT TTCCATGGCA AAATAGACCT GATTGGTTTG AAGCGTGGGG AGATGGATTG
ACTGGTATCC CAATAATTGA TGCTGCAATG AGACAACTTA AGTGTTCTGG ATGGATGCAT
AATCGTTGCA GAATGATTGT TGCCTCTTTT CTAGTAAAAG ACCTTTTAAT TGATTGGAGA
TTGGGAGAAC TTTTCTTCAT GAAAAGTTTA GTTGATGGTG ACTTGGCAGC AAACAACGGA
GGTTGGCAAT GGAGTGCTAG CAGCGGAATG GATCCAAAAC CTATGAGAAT ATTTAATCCT
TTTAGACAAG CTTCTAAGTT CGACGAAGAT GGTGAATATA TCCGAAAATG GATTCCTGAG
TTATCACATA TTTCAACACC TAATTTACTT TCAGGTGAAA TAAGTTCTGC GGAGAGAAAT
AGCTATCCCA ACCCAATAAT AAATCACAAA AATCAAACAT CAATCTTCAA AGAATTATAT
TCGAATATTA AATAA
 
Protein sequence
MVQSMPTLEY KMTKFQSIFW HRRDLRFGDN IGLFEASKNS KSLIGVYVLD PNLLDLNRTT 
SEAKNWFLGE SLLELQKNWE IRGSLLLILN GDPIELISKL AELVHAECIY WNENIEPYEI
NRDKQIAEKL SKEKRKVYTF LDQLIVNPSD IKTNNDEPYK VYGPFYRKWI DIINRTKSSD
NNLIQTSETA KKLTGLNERE LSSIKNSDLN YCITKKSKSI YELLSSNRFS NTSLCPCKPG
ESESIKQLNS FIHSGVINSY NQARDIPSLE NTSNLSAALS LGTISCRAVW NGAQVSKSST
HDEYKINSID TWIKELAWRE FYQNALINFP ELEKGPYREK WLNFPWQNRP DWFEAWGDGL
TGIPIIDAAM RQLKCSGWMH NRCRMIVASF LVKDLLIDWR LGELFFMKSL VDGDLAANNG
GWQWSASSGM DPKPMRIFNP FRQASKFDED GEYIRKWIPE LSHISTPNLL SGEISSAERN
SYPNPIINHK NQTSIFKELY SNIK