Gene OSTLU_42354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42354 
Symbol 
ID5003323 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp690181 
End bp692178 
Gene Length1998 bp 
Protein Length631 aa 
Translation table 
GC content59% 
IMG OID640418744 
Productpredicted protein 
Protein accessionXP_001419459 
Protein GI145350096 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0774885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAGA TCCTGCGCGG CGCCGATGGA CCGCCGAGGA TGTTCAGTCC GTTGGTGACG 
CGCGGCGGCG CGAGACGCGG CGACGCGCCG TTAGCGGTGT ACTTGCCGGG ATTAGACGGC
ACTGGATTCA GCGCGGCGTC GCAGTTTGAG TACATCGCCG ATGAATTCAA TCTCATCGCG
CTGAACGTGC CCGCGGGCGA TCGTGGTGAC GTTTTCGATT TAGTGAAAGC GACGACGGCT
TACTTGGACA CGCACGTCGC GGCGGCGCGC GCGAACGGTG AGAACGAGGA CGTCTATCTC
ATCGGAGAGT CGATGGGTGG TATGCTGTCT TTGTGCGTCG CAAGTGAGCG TCCAGATTTG
ATCACGCGCT TGATTTTGGT CAATCCCGCG AGTTCGTTCG ATCGAAGCGC GTGGCCGGCG
CTCGGCCCGT TGCTGCCGAA CGTCCCGAGC GAATTGTGGG GCGCCGTGCC GTACGCGCTG
ACGCCGGTGC TGATCGATCC CGTACGCATG GCACGCGGTA TGATGGATAA AGTCATGTCG
TCCGCGGTGT CGGACGATCC GTTGACAACC ATCGCGGCGG GGGTGGAAGA GCTCGCCGGA
TTGCTACCGG CGCTTGGCGC GCTGGCCGAA ATCATCCCGC GCGAGACGCT CGCACATCGA
TTGGATAAAG TCCTTCGCAT GGGATGTGAA TACTTGAACA GCGATGATTA CGCCAAGCTG
ACAGCGATTG ACGTGCCCAC GCTCGTCATC GCAAGTGAGA ACGATAATCT GATACCGAGT
TTGGCCGAGA GCGAACGTCT CAGGAAGTTT TTGCCCCGCG CCAAAGTCGA GGTATTGAAA
GGTGCGTCGC ACGCGGCACT TCAGGAGCCG GGGGTCAATG TAATGACCAT CGCGCGTCGA
AATGGGTTCG TTCCAAAGCG TGCAGATGCG CCGGTGATGA CGCGTGACGC AAAGTTTGAT
CCACCGTCGC CGGCGGACAT CGAACGCGCT CGCGAAAGTC TCGCAGGTTT GCGAGCGCTG
ACGTCACCGG TGTTTTTTAG CACGCGACCG GATGGGAAAA TTGTGCGCGG TCTCAGCGCG
GTACCAATAC GCCAACGTGG TTCGCGACCG ATCTTGCTAG TTGGGAACCA CCAAACGATG
GCGCCGGATC TCGGATTTCT AGTAGATGAA TTCTTGCGTG AATACGACGT CTGCCTTCGC
GGCTTGGCGC ATCCTGTGGT GTCGCGCGAA GGCGGTGGCG ATGGATTCGG CGGCGAAGAC
GCACCGCGCT CGTTCGAAGA TACGCTTCGT GACGCTGTGA AGAACACGCC CGTGGAACCG
TTACTGCCGC GTCGAGAGCC GAAGCCCCCG CGGCGCGCGA TGAATATTGT CGGCGGCGGG
TCATCATTCA CGTCTTTCGG CGCCGTGCCC GTCAGTGGCT TCGCGTTGTT TCGCCTACTA
AAACAAGGCG AGGCCGTGTT GCTCTTTCCG GGTGGCGTTC GCGAAGCGTT CAAACGAAAA
AACGAAAAGT ACAAACTCTT TTGGCCTTCC AAGCCAGAGT TCATTCGCAT GGCAATCAAG
CACGACGCGA TAATCGTCCC GTTCGCGGCG ATCGGCGCCG AGGACTCCAT CGACATCGTC
GCCGACGCCA ACGACTTGAT GAATAACCCT ATCGTGGGCG ATTCCGTCCG TAAACGCTCG
CAAAGCGTTC CGAAGGCGCG CGCCGTCGAC ACTCGCGTCA CCGCGGACGC GGGAGAAGAG
GAGTTATTCA TCCAGCCTGT CGTCGTACCC AAAGCCCCTG AGCGCTTCTA CTTTCGTTTC
ATGGCGCCTA TTGACGTGAG TGGAGCGGAT TTGGATGACG AAGAGCGCGT CAAGGCGATT
TACGAGCGAG TATACGGTGA AGTTGAAGGC GGTATACAGT ATCTGTTGCG CGAACGCGAG
AGCGATCCAT TCAAAGAGCT TGCGCCGAGA ATAGTGTTCG AAGCGGCGAC CTCTACGCAG
GCGCCGACGT TTCGTTAA
 
Protein sequence
MREILRGADG PPRMFSPLVT RGGARRGDAP LAVYLPGLDG TGFSAASQFE YIADEFNLIA 
LNVPAGDRGD VFDLVKATTA YLDTHVAAAR ANGENEDVYL IGESMGGMLS LCVASERPDL
ITRLILVNPA SSFDRSAWPA LGPLLPNVPS ELWGAVPYAL TPVLIDPVRM ARGMMDKVMS
SAVSDDPLTT IAAGVEELAG LLPALGALAE IIPRETLAHR LDKVLRMGCE YLNSDDYAKL
TAIDVPTLVI ASENDNLIPS LAESERLRKF LPRAKVEVLK GASHAALQEP GVNVMTIARR
NGFVPKRADA PVMTRDAKFD PPSPADIERA RESLAGLRAL TSPVFFSTRP DGKIVRGLSA
VPIRQRGSRP ILLVGNHQTM APDLGFLVDE FLREYDVCLR GLAHPVVSRE GEPKPPRRAM
NIVGGGSSFT SFGAVPVSGF ALFRLLKQGE AVLLFPGGVR EAFKRKNEKY KLFWPSKPEF
IRMAIKHDAI IVPFAAIGAE DSIDIVADAN DLMNNPIVGD SVRKRSQSVP KARAVDTRVT
ADAGEEELFI QPVVVPKAPE RFYFRFMAPI DVSGADLDDE ERVKAIYERV YGEVEGGIQY
LLRERESDPF KELAPRIVFE AATSTQAPTF R