Gene OSTLU_48152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_48152 
Symbol 
ID5006953 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp98121 
End bp101332 
Gene Length3212 bp 
Protein Length979 aa 
Translation table 
GC content58% 
IMG OID640422374 
Productpredicted protein 
Protein accessionXP_001422806 
Protein GI145357194 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0169632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGACGCGA CGCGCGCGCG CGGCGACATG GCGCCGTTTC TCGGTGGTAT GCGCGGCCTG 
ACGGTGTTCG TGCAAGACGT GCGGAACTGC AGTAATAAGG TGCGGCGAGC GCGCGCGCGC
GAGGGATGAC GACGACGGGC GAGGGCATTC GATCGAACGA AACGACGCGC GGTGACGCGT
TGAGCGGCGA GAGCGCGAGA GAGAGGCGCG CGGACTGACG ACGAGGGTCG CGCGCGTCGC
GAACGCGAAC GCAGGAGCAA GAGCGCGCGC GCGTGGAGAA GGAACTGGCG AATATTCGAC
GGAAATTCAA TAAGACCCAC CGCGCGCTCA CGGCGTACGA ACGGAAAAAG TACGTGCTGA
AGCTGCTGTA CATATACATG CTCGGGTATA ACGTGGACTT TGGACACACC GAGGCGCTGA
AGTTGATATC GGCGTCGTCG TACGCGGAGA AACAGGTTGG GTACATGACG ACGTCGGTGA
TATTGAACGA GAGAAACGAG TTTTTGAGAA TGGCCATCAA TAGCATACGC ACGGATGTGA
TCTCGAGCAA TGAGACGAAT CAGTGCTTGG GGTTGTCGTG CATCGCGAAC GTCGGGGGGC
GGGAGTTCGC GGATTCGTTA GCTGGGGACG TGGAGACGAT TGTGATGACG CCGACGATTC
GGCCGGTGGT TCGGAAAAAG GCGGCGCTGT GTCTGTTGAG GTTGTTTCGT AAGAATCCTG
AAATTTTACT CGCGGAAACG TTCGCGTCAA AAATGACCGA CTTACTCGAC GCCGAGCGCG
ATTTGGGGGT GCTCATGGGC GTCTTGGGTT TGTTGCTGGG TCTCGTGCAG CACGATTACC
GAGGGTACGA GGCGTGCGTG CCCAAGGTCA TCGCGTTGTT GGAACGATTG ACGAGGAATA
AGGACATTCC GCCCGAGTAT TTGTACTACG GTATTCCCTC TCCATGGTTA CAGGTGAAGT
GCATGAAGAT TTTGCAGTAC TTTCCCACAC CAGACGATCA GGCGCTGCTC GATTCGCAGC
TCATCGCCAT GCGAAACATC CTCACCAAGA CGGACACGGT GAAAAACTTC AATAAGAACA
ATGCGCTGCA CGCCATCTTG TTCGAGGCGA TCAATTTAGT TACTAGCATG GACTACGCGC
ACGAACTGTT GGACCCGTGC GTGGAGATTC TCGGGAATTT TCTCGACATG AAGGAACCGA
ATATTCGCTA CTTGGCTCTC AACACGCTCA ACGCCCTCGC GGCGATGGCG GATTTGCGAG
AAGCCATAAA GGTGTACCAA GAGCAAGTCG TGGCTGCGTT GCACGACGCG GACATTTCCA
TTCGTCGCCG CGCGTTGACT TTATTGTTTT CTATGTGCGA TGCTTCCAAC GTGCACTCTG
TCATCGAGGA GCTCATCAAG TACTTCGTCA CCGCTGATTT TGACATTCGC GAGGAACTGG
CGCTCAAAAC GGCCATCTTG GCCGAGCGCT ACAGCGTGAA CGATCGCATG TGGTTCATTG
AGATCGCGAT GCAAATGATA GACAAGGCGG GCGATTTCAT CAACGACGAC TTGTGGCATC
GCATGGTGCA AATCGCAACC AACGACGCGT CGCTTCACGG TCGCACGGCG CAATTGATGT
TCGTCAAGTT GCGCGACGAG GGCGCGTCGA ACGAACTCAT GCTTCGCGCG ATGTCGTACT
GCATCGGAGA GTTTGGGTAT TTGCTTCCCA TTCCCGCGTC GCAGTACGTC GATCTCTTAG
TGCCACTGTT CCAGGATACG GATGAGGTCA CGCAGGGCAT CATGCTCACA GCCTTCGTCA
AGGTTGCGAT GCACAAGAAT TGCGATCAGG CGTCGATGGG TAAGATCGTG AAGGTGTTCA
CCGACATGAG CTCATCGTTT GACGTCGAGT TGCAGCAACG TGCAAACGAA TATCTGAAGC
TCTTGCGTCT CGGACCGAAC ATGCGACCGA TTCTCGAGCC CATGCCCGAG TACCCTGAAC
GTTCGAGCGT GTTGGAGAAG CACATACAAG TAGAAAACGT CGCCTCGGAC GTCGCCGCGG
GAGTTCGTAA ACTTGCCATG AGTGGTGTCG TGACGGCGAG AGAGCAACCC CGCGCTCAGG
CGCGGTCGGC GCCGGCGCTT CCTGCAGCCG CAGCACCGCC GGTCGATGCC GTCACAGATT
TGCTCGGCAA CTTGATGGGC GACGGATCTT CGGCGCCGGC GGCGCTTCCG CCGTCGTCGA
CTGGGATGAA TCTCGACGAG CTTCTCGGAA ACGCCCCTCC CGCACTTCCA GCGGTAGAAG
AGCGTCTCGC ACTTCCGAGT TCCACGTCAC CTCCCGCGGG GCCGGTGACG ACCACATCGT
CCGCAGACGC TTTAGACGAT TTACTAGGCT TAGGCGCGCT CGCGGCGACG CAACCGCCGC
CGGCGACGCA CGGCGACGCC TTAGACGCCT TTGGAGCTCT GGGTGCGCCG GCGCCCGCGG
CGCCGGCACC GACGCAACCC GTGGCACCGG TACAATCATT GACGTCGAGC GACGGTATTC
AACCCACGGT GAACGTTCAA GACTGCGCGA AACGGTTCCT CATCGCCGAC AACGGCTTGC
TGTATGAAGA CGCGAACGTA CAGATTGGCG TGAAATCGCA GTGGCAAGGG TCTCAAGGTC
GCGTGATGTT CTACGTTGGG AACAAGTCCG CGAGCGCGGA TCTGCAAAAC TTCAGAATGG
TCATACCGTC GATCGAAGGC TTGCGTCACA GTCTTCAACC CTTCCCCGCG TCCATCGGAC
CGAAGCGCCA GGTGCAGTTG ATGTTGCAAG TAGCGATTAC GTCGGCGTTT GCCTCGGCGC
CAAAACTCGA GTTTTCGTAC ACGTCCACCG CCGTCGCGGC GGCGTGTGCC AGGTCTCTGG
AGTTGCCCGT ACGTGTGACC AAGTTTTTGA GCCCGATGAC CATCGCTTCG CCGCAAGAGT
TCATCGCCAA GTGGCACCAG ATGGCGTCCG CCGGGCAGCA ACAGAAAATT ATGGACGTGT
CGCAGCAGTA CGCGACGAGC ATCGAAAGCG TGTCAAACGC CTTCTCGGGC ATGCGGCTCG
TCGTACATAA AGGCTTAGAT CCAAACCCCG CAAACTTAAT CGCGGGAAGC CGGTTCGTCG
GCGAACGATG CGGTGAAGTC TTTGTGGGCG TTCGCGTGGA GAGCGACGCG AACGTGCGCG
GACGATATAG ATTCACCGTC GCTTCGATGG AC
 
Protein sequence
MAPFLGGMRG LTVFVQDVRN CSNKEQERAR VEKELANIRR KFNKTHRALT AYERKKYVLK 
LLYIYMLGYN VDFGHTEALK LISASSYAEK QVGYMTTSVI LNERNEFLRM AINSIRTDVI
SSNETNQCLG LSCIANVGGR EFADSLAGDV ETIVMTPTIR PVVRKKAALC LLRLFRKNPE
ILLAETFASK MTDLLDAERD LGVLMGVLGL LLGLVQHDYR GYEACVPKVI ALLERLTRNK
DIPPEYLYYG IPSPWLQVKC MKILQYFPTP DDQALLDSQL IAMRNILTKT DTVKNFNKNN
ALHAILFEAI NLVTSMDYAH ELLDPCVEIL GNFLDMKEPN IRYLALNTLN ALAAMADLRE
AIKVYQEQVV AALHDADISI RRRALTLLFS MCDASNVHSV IEELIKYFVT ADFDIREELA
LKTAILAERY SVNDRMWFIE IAMQMIDKAG DFINDDLWHR MVQIATNDAS LHGRTAQLMF
VKLRDEGASN ELMLRAMSYC IGEFGYLLPI PASQYVDLLV PLFQDTDEVT QGIMLTAFVK
VAMHKNCDQA SMGKIVKVFT DMSSSFDVEL QQRANEYLKL LRLGPNMRPI LEPMPEYPER
SSVLEKHIQV ENVASDVAAG VRKLAMSDLL GNLMGDGSSA PAALPPSSTG MNLDELLGNA
PPALPAVEER LALPSSTSPP AGPVTTTSSA DALDDLLGLG ALAATQPPPA THGDALDAFG
ALGAPAPAAP APTQPVAPVQ SLTSSDGIQP TVNVQDCAKR FLIADNGLLY EDANVQIGVK
SQWQGSQGRV MFYVGNKSAS ADLQNFRMVI PSIEGLRHSL QPFPASIGPK RQVQLMLQVA
ITSAFASAPK LEFSYTSTAV AAACARSLEL PVRVTKFLSP MTIASPQEFI AKWHQMASAG
QQQKIMDVSQ QYATSIESVS NAFSGMRLVV HKGLDPNPAN LIAGSRFVGE RCGEVFVGVR
VESDANVRGR YRFTVASMD