Gene NATL1_15661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15661 
Symbol 
ID4780515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1271884 
End bp1272999 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content30% 
IMG OID640084848 
ProductDNA photolyase-like protein 
Protein accessionYP_001015388 
Protein GI124026272 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.675982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.652082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT TTGATGAGGC TAATGATTTA TTAGATAATT TCATTTTCAA TCACCTTAAT 
TCATACCATC AACTAAGAAA TTACGATTAT GGGATCGAAG ATAGGACTAA TGTTTCTCAG
ATTTCTAAAT ATACATCACA TAGAATTCTC TACGAATTTG ACATAATTGA AAAATTAAAA
AAGTATGATA AGAAACAAAA GTACACTGAT GAAATCCTTT GGAGAATCTA TTGGAAGGGT
TACCTTGAAA ATTACAAATC TATATGGTTT GAATATATAA ATTTCAAAGA AAACTCAAAT
AATTCATACT TAATTAGTTC TGCGATAAAT GGTAAGACAG GTATAGATTG CTTTGATACA
TGGATAGAGG AGCTTAGAGA GAATAACTAT TTGCATAATC ACGCAAGGAT GTGGTTCGCA
AGTATATGGA TTTTCACTCT GGGACTTCCA TGGCAATTAG GAGCGAGGCT TTTTATGAAA
CATCTGCTAG ATGGGGATGC TGCATCAAAC ACCCTTAGCT GGAGATGGGT GGCTGGAATG
CACACTAATA AGAAGCCTTA CTTAGCATCT AAAGAAAACA TCAACAAGTA CACAGTTAAT
CGATTTAGAG ATACATCAAT TAGCCTCTCG AGCAAAATTA ATATCATAAA ACATAGTCAA
CATAAATCGA ATAAACTTCC AGTTCAAAGA AGTTTTCCTA ATAGTAATAT TCTAATAATG
TTTGATAATG ATATGGATAT CATGAGCAGA TCTACGTTAT TTAATTCATA CTCAAAAGTT
TATATATTGC GTAATATAGC CATAAATAAT GAATTTGATC TAAGTGAGAA CGTGAGTCAA
TTCAAACGAG GTTTAATTGA TAAAGTAAAT AAGTTAATTC CAAACTCAGA AGTATTAAAA
TCAACTGACC TAGGAATTAA TTTGTCTGGT CACAATTTTA TTGATGTTAT CTACCCAGGA
GTTGGTCATA ATCTAGATTT AATAAATAAA TTCGCCAATC AAAATCAAAT CATTTTTAAT
TATATATATA GAGAAGATGA CCTTAAATAT TGGAATTATG CAAATTCAGG ATTTTACAAA
TTTAAGACTT CATTTAATAG AATTAATATG ATCTAA
 
Protein sequence
MNIFDEANDL LDNFIFNHLN SYHQLRNYDY GIEDRTNVSQ ISKYTSHRIL YEFDIIEKLK 
KYDKKQKYTD EILWRIYWKG YLENYKSIWF EYINFKENSN NSYLISSAIN GKTGIDCFDT
WIEELRENNY LHNHARMWFA SIWIFTLGLP WQLGARLFMK HLLDGDAASN TLSWRWVAGM
HTNKKPYLAS KENINKYTVN RFRDTSISLS SKINIIKHSQ HKSNKLPVQR SFPNSNILIM
FDNDMDIMSR STLFNSYSKV YILRNIAINN EFDLSENVSQ FKRGLIDKVN KLIPNSEVLK
STDLGINLSG HNFIDVIYPG VGHNLDLINK FANQNQIIFN YIYREDDLKY WNYANSGFYK
FKTSFNRINM I