Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22321 |
Symbol | cpeY |
ID | 4778244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1976042 |
End bp | 1977391 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640087749 |
Product | putative bilin biosynthesis protein CpeY |
Protein accession | YP_001018232 |
Protein GI | 124023925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGAGC TGAAGAGCCC ATTCGATAAT CTTCCATCTC TCAGCCAGGA GGAGGCTCTC AAGATTCTTT GTACGCCTAT CAATAAGTTG GAGTTGGCCA GTGATTACTA CAAGGCAGCC TTTCACCTAA GCAAATTCCC TGGACCCATG ACTGAACGGG CCTTGTTGCG CCTGATTGAG TCAGAGTCCT CTGAATTTCC TGTTGTGATT GCCCGTAAAA AGGCCGTTGA GGGCTTGGCT CGTCTTCGAT GTACCGCTGC AATTCCTGCC ATTGGTCGCT GCCTCACAAG CAGTGATCCA TATTTGGTTG AGATTTCGGC TTGGGCACTA CAAGAGTTGG ATTGTCAGGA TCCTGATCTA CATCAACTCA TGATGTCCTT GCTAGATGAT CCCAAGCAGC ATCGGCGCGT GTTGATTCAG AGCCTGTCTT GTCTTGGGGT GGTTTCAGCG GCTCCAAGAA TTAAGTCTTT GCAGGATGAT GCCAACCCTG GGGTTAGTGG TGCTGCTCTT GCTGCAGTGT TTAAGCTTTG TGGGCAAAGA GCTCGATTGG TGGAGCTGGA GTTGCATCTT GCCCTTCCTG TTCAGATGGA TCGTCATCTT GCAATTCAGG ATGTTATCGA TGCAGGTGAG TTTGACTTAC TTAAGGCCAC GTTAAGGGCA CCTGTATCTC CGACCTTTCG AATGCGGGCC TTGAATGCTC TTTGGCCTGA AGAGGTGGGG CAGCAGAATG GTCTTGATCT GTTGGTTATT TTGGATGGGC TGATGCGTGA TGACCCAGAT GATCTTGATC TCGTGCATCA TTATGATGAA TCTCCTACAG ATGCCTTCCT GATAGAGGAA CTATTCGCGA CTGATTTCAG TCGCTGTTAT CTTGCTGTTC AGACCTTGCG TAGTCGTAAT CCAAAAGAGC TTTGGCCTTT GTTGTTGAAA TGCTGGCAAC GAGCCGAGAA GGATTATGGC GCGCTCTATT TCTTTATGCT TCTTTTTAGA TGTATGACCG ATTGGCCGGA AACAGCACAA CAGAAAATTC AGGATTTATG TTTTTTTGCA CTTGATAGGC GTTGGCCTGA TTTTATTAAA TTCAAACCGG CGTCCATTCT CACACTGATG CAATATAGCC CCGAGATTGG TTGTTCCTAT TTGTCTCAAT GGCTGAACCC AGGTAAGTCT CCTTACTGGG CCTGTCGATA TGCAGCATTA TTGGCTATTG AGCCTTTGCT TCATGTAGAA GAATGGGGCA CATTGGTAGA GAATGTGGCT AGAAATAAGG AGGACCCACA TCGCTTTGTT CGAGCTAAAG TGAACAGTCT TGAGATGAAT CGTATCGGGG CTTCTCCTCC TGTAATTTAG
|
Protein sequence | MKELKSPFDN LPSLSQEEAL KILCTPINKL ELASDYYKAA FHLSKFPGPM TERALLRLIE SESSEFPVVI ARKKAVEGLA RLRCTAAIPA IGRCLTSSDP YLVEISAWAL QELDCQDPDL HQLMMSLLDD PKQHRRVLIQ SLSCLGVVSA APRIKSLQDD ANPGVSGAAL AAVFKLCGQR ARLVELELHL ALPVQMDRHL AIQDVIDAGE FDLLKATLRA PVSPTFRMRA LNALWPEEVG QQNGLDLLVI LDGLMRDDPD DLDLVHHYDE SPTDAFLIEE LFATDFSRCY LAVQTLRSRN PKELWPLLLK CWQRAEKDYG ALYFFMLLFR CMTDWPETAQ QKIQDLCFFA LDRRWPDFIK FKPASILTLM QYSPEIGCSY LSQWLNPGKS PYWACRYAAL LAIEPLLHVE EWGTLVENVA RNKEDPHRFV RAKVNSLEMN RIGASPPVI
|
| |