Gene NATL1_03891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03891 
SymbolcpeY 
ID4780219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp359884 
End bp361188 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content28% 
IMG OID640083657 
Productputative bilin biosynthesis protein CpeY 
Protein accessionYP_001014218 
Protein GI124025102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.312651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0450059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAATCTA ATCCATTTAA TAATCTACCG AAAATCAATA AAATAGACGC TATCAATATT 
CTTAGAAGAC CAATTTCTGA GGTTAAGCTT TTAGCAGATT ATTATAAGGC TGTATTCCAC
TTAGCAAATT TTCCTTGTGA AGAATCAGAA CTGGTCCTTC TTGACTTTAT TAAACATGAC
TGTGAAAAAC TTGAATATAA GATAGCTAAA AGGAAAGCAA TTGAGGTACT CGCTAATTTC
GGTTGCAAAA AAGCCATCCA AGCTATTGCG GAGTTTCTAG AAAATGATGA TGATTATCTT
GTTGAGACAG TTATTTGGTC ATTAGCTAAA CTTAAATGTA ATGATATTGA TATCATTAAC
AAGATTTGTT CAAAATTATA TAAGCAATTT AATAATAAAA GAGTAGTAAT ACAAACATTA
ACTCATCTAG GAGTTAGAAA AGAAATAGAT ATGATTAGAT CATTATCAAG AGATAAACAA
TCCTCCAATG GAGTTAAAGG AGCCTCTTTT GCGGCATTAA TAAAACTTGC TGGTGAAGAG
GATAAGCTGA CTGATCTGAA AAAGTTTTTG AGACTATCAA ATCAAAACGA TAGGCATTGT
GCAGTTCAAG ATATTATAAA TGCTGGTCAT TTATCTGTTT TACCTGATTT AATTAAGGCG
CCACTTTCTC CATCATTTAA ATTACAGGCA ATAGATTCTC TTTGGATTAA TGAAGTAGTA
TTATGTGAAA ATATAAATCT ATTTAATTGT ATAGACTCAG TAGTTGTTGA TGATCCAAGG
AATATAGATA CTTTAAAAGT TAATAATTTT AATAAAGACT TGAGTTTTCT TATTGAGCAA
CTTTTTCATA CAGATTTTAA TAGATGTTAT CAGTCAATCA AAGAATTACT AAAATTCCCT
TTAGATAAAG TTTTATATTA TCTAAACAAT AATTGGGATA GAGCCAAATC AGACTATGGA
GCTATATATT TCTTTATTAA TGTATATAAA CTACTATTAG ATCAGCAATT ATATGATGAA
TTTCTTTTAG ATAAAGTAGG TTTTTTGCTA TCCGATGATT GGCCTGATTA TATGAAATTT
AAATCTTCAG CAATACAAGT ATTGGGTTGC TTAAATGAAA ATAAATTTTA TAATAATATA
ATTTATTTTT CAGATGAGAG TCATACACCT TATTGGAAAA ATAGATATAC TGCTTTGCTT
GTATTACAAA ATAAGCAAAT TCATATTAAA AATAAATTCG CTAAATTATT TTTCAATGAC
AGTCATAGAT TTGTGAGATT CAAAGCAAAA GAAATTAGTA CTTAG
 
Protein sequence
MQSNPFNNLP KINKIDAINI LRRPISEVKL LADYYKAVFH LANFPCEESE LVLLDFIKHD 
CEKLEYKIAK RKAIEVLANF GCKKAIQAIA EFLENDDDYL VETVIWSLAK LKCNDIDIIN
KICSKLYKQF NNKRVVIQTL THLGVRKEID MIRSLSRDKQ SSNGVKGASF AALIKLAGEE
DKLTDLKKFL RLSNQNDRHC AVQDIINAGH LSVLPDLIKA PLSPSFKLQA IDSLWINEVV
LCENINLFNC IDSVVVDDPR NIDTLKVNNF NKDLSFLIEQ LFHTDFNRCY QSIKELLKFP
LDKVLYYLNN NWDRAKSDYG AIYFFINVYK LLLDQQLYDE FLLDKVGFLL SDDWPDYMKF
KSSAIQVLGC LNENKFYNNI IYFSDESHTP YWKNRYTALL VLQNKQIHIK NKFAKLFFND
SHRFVRFKAK EIST