Gene P9303_22321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22321 
SymbolcpeY 
ID4778244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1976042 
End bp1977391 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content47% 
IMG OID640087749 
Productputative bilin biosynthesis protein CpeY 
Protein accessionYP_001018232 
Protein GI124023925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGAGC TGAAGAGCCC ATTCGATAAT CTTCCATCTC TCAGCCAGGA GGAGGCTCTC 
AAGATTCTTT GTACGCCTAT CAATAAGTTG GAGTTGGCCA GTGATTACTA CAAGGCAGCC
TTTCACCTAA GCAAATTCCC TGGACCCATG ACTGAACGGG CCTTGTTGCG CCTGATTGAG
TCAGAGTCCT CTGAATTTCC TGTTGTGATT GCCCGTAAAA AGGCCGTTGA GGGCTTGGCT
CGTCTTCGAT GTACCGCTGC AATTCCTGCC ATTGGTCGCT GCCTCACAAG CAGTGATCCA
TATTTGGTTG AGATTTCGGC TTGGGCACTA CAAGAGTTGG ATTGTCAGGA TCCTGATCTA
CATCAACTCA TGATGTCCTT GCTAGATGAT CCCAAGCAGC ATCGGCGCGT GTTGATTCAG
AGCCTGTCTT GTCTTGGGGT GGTTTCAGCG GCTCCAAGAA TTAAGTCTTT GCAGGATGAT
GCCAACCCTG GGGTTAGTGG TGCTGCTCTT GCTGCAGTGT TTAAGCTTTG TGGGCAAAGA
GCTCGATTGG TGGAGCTGGA GTTGCATCTT GCCCTTCCTG TTCAGATGGA TCGTCATCTT
GCAATTCAGG ATGTTATCGA TGCAGGTGAG TTTGACTTAC TTAAGGCCAC GTTAAGGGCA
CCTGTATCTC CGACCTTTCG AATGCGGGCC TTGAATGCTC TTTGGCCTGA AGAGGTGGGG
CAGCAGAATG GTCTTGATCT GTTGGTTATT TTGGATGGGC TGATGCGTGA TGACCCAGAT
GATCTTGATC TCGTGCATCA TTATGATGAA TCTCCTACAG ATGCCTTCCT GATAGAGGAA
CTATTCGCGA CTGATTTCAG TCGCTGTTAT CTTGCTGTTC AGACCTTGCG TAGTCGTAAT
CCAAAAGAGC TTTGGCCTTT GTTGTTGAAA TGCTGGCAAC GAGCCGAGAA GGATTATGGC
GCGCTCTATT TCTTTATGCT TCTTTTTAGA TGTATGACCG ATTGGCCGGA AACAGCACAA
CAGAAAATTC AGGATTTATG TTTTTTTGCA CTTGATAGGC GTTGGCCTGA TTTTATTAAA
TTCAAACCGG CGTCCATTCT CACACTGATG CAATATAGCC CCGAGATTGG TTGTTCCTAT
TTGTCTCAAT GGCTGAACCC AGGTAAGTCT CCTTACTGGG CCTGTCGATA TGCAGCATTA
TTGGCTATTG AGCCTTTGCT TCATGTAGAA GAATGGGGCA CATTGGTAGA GAATGTGGCT
AGAAATAAGG AGGACCCACA TCGCTTTGTT CGAGCTAAAG TGAACAGTCT TGAGATGAAT
CGTATCGGGG CTTCTCCTCC TGTAATTTAG
 
Protein sequence
MKELKSPFDN LPSLSQEEAL KILCTPINKL ELASDYYKAA FHLSKFPGPM TERALLRLIE 
SESSEFPVVI ARKKAVEGLA RLRCTAAIPA IGRCLTSSDP YLVEISAWAL QELDCQDPDL
HQLMMSLLDD PKQHRRVLIQ SLSCLGVVSA APRIKSLQDD ANPGVSGAAL AAVFKLCGQR
ARLVELELHL ALPVQMDRHL AIQDVIDAGE FDLLKATLRA PVSPTFRMRA LNALWPEEVG
QQNGLDLLVI LDGLMRDDPD DLDLVHHYDE SPTDAFLIEE LFATDFSRCY LAVQTLRSRN
PKELWPLLLK CWQRAEKDYG ALYFFMLLFR CMTDWPETAQ QKIQDLCFFA LDRRWPDFIK
FKPASILTLM QYSPEIGCSY LSQWLNPGKS PYWACRYAAL LAIEPLLHVE EWGTLVENVA
RNKEDPHRFV RAKVNSLEMN RIGASPPVI