Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03891 |
Symbol | cpeY |
ID | 4780219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 359884 |
End bp | 361188 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640083657 |
Product | putative bilin biosynthesis protein CpeY |
Protein accession | YP_001014218 |
Protein GI | 124025102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.312651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0450059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAATCTA ATCCATTTAA TAATCTACCG AAAATCAATA AAATAGACGC TATCAATATT CTTAGAAGAC CAATTTCTGA GGTTAAGCTT TTAGCAGATT ATTATAAGGC TGTATTCCAC TTAGCAAATT TTCCTTGTGA AGAATCAGAA CTGGTCCTTC TTGACTTTAT TAAACATGAC TGTGAAAAAC TTGAATATAA GATAGCTAAA AGGAAAGCAA TTGAGGTACT CGCTAATTTC GGTTGCAAAA AAGCCATCCA AGCTATTGCG GAGTTTCTAG AAAATGATGA TGATTATCTT GTTGAGACAG TTATTTGGTC ATTAGCTAAA CTTAAATGTA ATGATATTGA TATCATTAAC AAGATTTGTT CAAAATTATA TAAGCAATTT AATAATAAAA GAGTAGTAAT ACAAACATTA ACTCATCTAG GAGTTAGAAA AGAAATAGAT ATGATTAGAT CATTATCAAG AGATAAACAA TCCTCCAATG GAGTTAAAGG AGCCTCTTTT GCGGCATTAA TAAAACTTGC TGGTGAAGAG GATAAGCTGA CTGATCTGAA AAAGTTTTTG AGACTATCAA ATCAAAACGA TAGGCATTGT GCAGTTCAAG ATATTATAAA TGCTGGTCAT TTATCTGTTT TACCTGATTT AATTAAGGCG CCACTTTCTC CATCATTTAA ATTACAGGCA ATAGATTCTC TTTGGATTAA TGAAGTAGTA TTATGTGAAA ATATAAATCT ATTTAATTGT ATAGACTCAG TAGTTGTTGA TGATCCAAGG AATATAGATA CTTTAAAAGT TAATAATTTT AATAAAGACT TGAGTTTTCT TATTGAGCAA CTTTTTCATA CAGATTTTAA TAGATGTTAT CAGTCAATCA AAGAATTACT AAAATTCCCT TTAGATAAAG TTTTATATTA TCTAAACAAT AATTGGGATA GAGCCAAATC AGACTATGGA GCTATATATT TCTTTATTAA TGTATATAAA CTACTATTAG ATCAGCAATT ATATGATGAA TTTCTTTTAG ATAAAGTAGG TTTTTTGCTA TCCGATGATT GGCCTGATTA TATGAAATTT AAATCTTCAG CAATACAAGT ATTGGGTTGC TTAAATGAAA ATAAATTTTA TAATAATATA ATTTATTTTT CAGATGAGAG TCATACACCT TATTGGAAAA ATAGATATAC TGCTTTGCTT GTATTACAAA ATAAGCAAAT TCATATTAAA AATAAATTCG CTAAATTATT TTTCAATGAC AGTCATAGAT TTGTGAGATT CAAAGCAAAA GAAATTAGTA CTTAG
|
Protein sequence | MQSNPFNNLP KINKIDAINI LRRPISEVKL LADYYKAVFH LANFPCEESE LVLLDFIKHD CEKLEYKIAK RKAIEVLANF GCKKAIQAIA EFLENDDDYL VETVIWSLAK LKCNDIDIIN KICSKLYKQF NNKRVVIQTL THLGVRKEID MIRSLSRDKQ SSNGVKGASF AALIKLAGEE DKLTDLKKFL RLSNQNDRHC AVQDIINAGH LSVLPDLIKA PLSPSFKLQA IDSLWINEVV LCENINLFNC IDSVVVDDPR NIDTLKVNNF NKDLSFLIEQ LFHTDFNRCY QSIKELLKFP LDKVLYYLNN NWDRAKSDYG AIYFFINVYK LLLDQQLYDE FLLDKVGFLL SDDWPDYMKF KSSAIQVLGC LNENKFYNNI IYFSDESHTP YWKNRYTALL VLQNKQIHIK NKFAKLFFND SHRFVRFKAK EIST
|
| |