Gene NATL1_15641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15641 
Symbol 
ID4780521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1269802 
End bp1271292 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content29% 
IMG OID640084846 
Producthypothetical protein 
Protein accessionYP_001015386 
Protein GI124026270 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.681049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAAAGA TTTTCCTGAT TTTTCCAAAT CAATTATTCA AAATAAAAAA ACAATTTACT 
GACGTTAGTC ATATTGCTCT GATCCAAGAT AGCTTATTTT TTGGTTGTGA TTCTCAATGG
CAACAAAAGT TTCATTGTAA CAAAATAATT TTCCACAAAG CTACTATGGA CTCTTACGAA
GAAGATCTCA AATCTCAGGG GTTCAATGTA ATTTATTTAA AACATCAAAG AGAAAGCAGA
ACAGAAGATA ATCTCAATTA TCTCTCAGAA AAAGGTTTTA ACTATTTCAT CACTTATGAA
GCGTTTGATT GGTCGCTAGA AAAAAGAATT AAGGATTTCT CTTTGAAAAA GAATATCAAG
TTGGAAATAA GAAAAAATGA TATGTTTTTA ACTTGTAAAG ATATATCTGA AGAAATACTT
AATCAAAAAA AAATTTATGG AATGCAGAAA TTTTATAAGA TTCAAAGAAA AAGCCTAAAT
ATACTTATCG AAAAAGATGG TTCGCCAACA GGGGGGACAT GGAGTTTTGA CAAAATGAAC
AGAAAGAAGC TTCCAAATTC AATTGAAGTT CCTAGAATAC CAACTATAAA AACAAGCAGA
TTACTAGATA AAGCTAAGAA AGAAGTTTCT ATAAATTATA AAGATTATTA TGGGAGCACA
GAAAACTTTA ATTATCCATT GTCTCATAAA GATGCTGAAG AATGGTTAGA TAATTTTTTA
ATTGAAAGAT TTAATTTATT TGGAGATTAT GAAGATGCAA TACATTCAAA TCATAGGACA
CTTTGGCATA GTGTTCTTTC TCCATTAATT AATTCCGGAT TACTTACTCC GAGACAAATA
ATAGATAAAT CATGGGAGTT TTATCAATCA AACAATATTG GGATTAATTG CTATGAAGGA
TTTGTTAGAC AAATTATTGG CTGGCGTGAA TTTATCCTAT TAATGTATAA ACGAAATAGT
TTAGAACTAA GAAATGGAAA TTTCTGGGAT TTTGAGGACA AACCAATACC CTTAAGTTTT
TACACTGGTC AAACAGGAAT AAGGCCTTTA GATGACTCAA TAAAAAATAT TTTAGAGACA
GGATATGCTC ATCATATAGA AAGACTAATG ATAGTTGGAA ATTTAATGCT TCTATGCAGA
TTTCATCCAA ATCAAGTATA CAAATGGTTT ATGGAATTAT TTATAGATTC ATATGATTGG
GTTATGGTTC CAAATGTTTA TGGAATGAGT CAATTTTCAG ATGGAGGACT ATTTACAACC
AAACCATATA TTTCTGGCTC TAATTATATT CGAAAAATGT CTAACTATAA ATCTGAAGAT
TGGTGCTCAA CTTGGGATAG TCTTTTTTGG ACATTTATAG ATGATTATAA AAATAAGTTC
AAGGACCAAT ATCGTTTGTC AATGATTTTA AGGAATTTAG AAAAAATGGA CCCTAATAAA
AAAATGAACC ACAGACGTAA TGCTAATGAA TTCTTGTCTA AACTAAATTA A
 
Protein sequence
MRKIFLIFPN QLFKIKKQFT DVSHIALIQD SLFFGCDSQW QQKFHCNKII FHKATMDSYE 
EDLKSQGFNV IYLKHQRESR TEDNLNYLSE KGFNYFITYE AFDWSLEKRI KDFSLKKNIK
LEIRKNDMFL TCKDISEEIL NQKKIYGMQK FYKIQRKSLN ILIEKDGSPT GGTWSFDKMN
RKKLPNSIEV PRIPTIKTSR LLDKAKKEVS INYKDYYGST ENFNYPLSHK DAEEWLDNFL
IERFNLFGDY EDAIHSNHRT LWHSVLSPLI NSGLLTPRQI IDKSWEFYQS NNIGINCYEG
FVRQIIGWRE FILLMYKRNS LELRNGNFWD FEDKPIPLSF YTGQTGIRPL DDSIKNILET
GYAHHIERLM IVGNLMLLCR FHPNQVYKWF MELFIDSYDW VMVPNVYGMS QFSDGGLFTT
KPYISGSNYI RKMSNYKSED WCSTWDSLFW TFIDDYKNKF KDQYRLSMIL RNLEKMDPNK
KMNHRRNANE FLSKLN