Gene P9303_30181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_30181 
Symbol 
ID4778854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2672542 
End bp2674059 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content52% 
IMG OID640088542 
Productesterase/lipase/thioesterase family protein 
Protein accessionYP_001019013 
Protein GI124024706 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.157772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTTT TTCGTTCCCG TCTTTGGCTG CTTGCAATCA GTTTTGGAGC AGGGCTTGGA 
CTTGTTCCAA GCTCTGCTCC TGCACTTGAA CGCTTGGTGT TTGATCTGCC TGTGCTGGAA
AGTCAGATTG AGTTTGAGCT TGGCGCTTCT CAGAGCGCTG GCGATCTAAT TGATGCCAAT
CCCGATTTTG TGGAGTTGGA TCGGGCTACG GATGGTGCTT TTGTGCGACT TCTCAATCAG
GTCTTTAACG CTCCCCTGCC AGCGCAGATT GAGAAGGTGG TTGAGAAGTC TGTCGGGCAG
CCTCTTTTGG AGCAGGCTCT GATCGCAGTA TCCAAGTTGG TTCAAGTTGA GGGGTTGCCC
AAAGACACCA GTGGAAGGAT GTTGCTTGAG GCGCTTTCGC GTGCTTCCAA GAGTGGTCAG
CCAACTGTGC TTGGTTTGTT GCGACAAATC CCTGGTCAAG CTGCATCCAT CAACTTGTCG
AAATTGGCCA GCTATGTCTC ACGGCTACAA CGTAATCAGC TAGCAGCAAA TCTGCTTGTG
GAGAAAGAGG CTTCTGTTCA GATTAAACCT GGATTACGCA TGCCGCTTAG CGGGTTGTGG
TTAAGTCAGC AAGTTGATTT TCAGGCTTCC CATCGCTCTA AACCGATACG GGTGGTGGTG
ATACAACCAA AGTCTCGCTC AAATGGTCGC TTGGTGGTCA TTTCACATGG GCTTTGGGAG
TCTCCGAGAG ATCTTCAGGG TTGGGCTGAA TATCTTTCTG CTAACGGTTA TACGGTGTTG
CTGCCGGAGC ATCAGGGCAG TGATGCTGAT CAGCAGAAGG CGATGTTGGC GGGGGATCAA
CCTCCACCGG GACCTCAAGA GTTGCGTCTT CGTGCGATGG ATGTGACTGC GATGCTCTCT
GCTGTTGAGT CAGGTGGTTT GTTGTCAAGA CTTTCCCTCA ATACAGATGA GGTCGCTGTT
GTTGGTCATT CATGGGGGGC GACTACAGCG ATTCAATTGG CTGGGGCACG CTCAACGGAT
GTGAAGCTCT CTGCTCGTTG TCATAACCAG GATGACCCTG AGCGCAATAT CAGCTGGATA
CTGCAGTGCA GTTGGCTTTC CAAAATCAAT GAGTCTTCTT TTGAAGACTC ACGGGTCAAG
GCAGTTGTGG CGGTGAGTCC GCCGTTACGT CTTCTATTTG ATCCCAGCAG AACTTCAGTT
TTGACGGCCA AGGTTTTGTT GGTTAGTGGC ACTCGTGATT GGGTGGTTCC TCCCGTGCCT
GAGGCTCTGA TGCCCATGCG TGATAGTGGT GCTTTGGAGT TTGGCCATCG CTTGGTGCTT
GCCCAAGATG GTGGTCACTT CAACTTGATG GCACCTGCAA ATCAGCCTCA GCCGGCGATT
TTGGCGCCCC TCATTCTTGC TTGGATTAAT GAACAGCTTG CAAATCCTGG TGTTGTCACC
TTCAGTGGCG GCGGTTGGGG TGATGCCGTG CATCCTTTAG TGGATGTGAC TGATGCGGCT
CTGAATTTGT ATCGCTGA
 
Protein sequence
MMFFRSRLWL LAISFGAGLG LVPSSAPALE RLVFDLPVLE SQIEFELGAS QSAGDLIDAN 
PDFVELDRAT DGAFVRLLNQ VFNAPLPAQI EKVVEKSVGQ PLLEQALIAV SKLVQVEGLP
KDTSGRMLLE ALSRASKSGQ PTVLGLLRQI PGQAASINLS KLASYVSRLQ RNQLAANLLV
EKEASVQIKP GLRMPLSGLW LSQQVDFQAS HRSKPIRVVV IQPKSRSNGR LVVISHGLWE
SPRDLQGWAE YLSANGYTVL LPEHQGSDAD QQKAMLAGDQ PPPGPQELRL RAMDVTAMLS
AVESGGLLSR LSLNTDEVAV VGHSWGATTA IQLAGARSTD VKLSARCHNQ DDPERNISWI
LQCSWLSKIN ESSFEDSRVK AVVAVSPPLR LLFDPSRTSV LTAKVLLVSG TRDWVVPPVP
EALMPMRDSG ALEFGHRLVL AQDGGHFNLM APANQPQPAI LAPLILAWIN EQLANPGVVT
FSGGGWGDAV HPLVDVTDAA LNLYR