Gene OSTLU_278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_278 
Symbol 
ID5005208 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp427138 
End bp430304 
Gene Length3167 bp 
Protein Length971 aa 
Translation table 
GC content58% 
IMG OID640420629 
Productpredicted protein 
Protein accessionXP_001421149 
Protein GI145353712 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.385886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGTATGCGCG GCCTGACGGT GTTCGTGCAA GACGTGCGGA ACTGCAGTAA TAAGGTGCGG 
CGAGCGCGCG CGCGCGAGGG ATGACGACGA CGGGCGAGGG CATTCGATCG AACGAAACGA
CGCGCGGTGA CGCGTTGAGC GGCGAGAGCG CGAGAGAGAG GCGCGCGGAC TGACGACGAG
GGTCGCGCGC GTCGCGAACG CGAACGCAGG AGCAAGAGCG CGCGCGCGTG GAGAAGGAAC
TGGCGAATAT TCGACGGAAA TTCAATAAGA CCCACCGCGC GCTCACGGCG TACGAACGGA
AAAAGTACGT GCTGAAGCTG CTGTACATAT ACATGCTCGG GTATAACGTG GACTTTGGAC
ACACCGAGGC GCTGAAGTTG ATATCGGCGT CGTCGTACGC GGAGAAACAG GTTGGGTACA
TGACGACGTC GGTGATATTG AACGAGAGAA ACGAGTTTTT GAGAATGGCC ATCAATAGCA
TACGCACGGA TGTGATCTCG AGCAATGAGA CGAATCAGTG CTTGGGGTTG TCGTGCATCG
CGAACGTCGG GGGGCGGGAG TTCGCGGATT CGTTAGCTGG GGACGTGGAG ACGATTGTGA
TGACGCCGAC GATTCGGCCG GTGGTTCGGA AAAAGGCGGC GCTGTGTCTG TTGAGGTTGT
TTCGTAAGAA TCCTGAAATT TTACTCGCGG AAACGTTCGC GTCAAAAATG ACCGACTTAC
TCGACGCCGA GCGCGATTTG GGGGTGCTCA TGGGCGTCTT GGGTTTGTTG CTGGGTCTCG
TGCAGCACGA TTACCGAGGG TACGAGGCGT GCGTGCCCAA GGTCATCGCG TTGTTGGAAC
GATTGACGAG GAATAAGGAC ATTCCGCCCG AGTATTTGTA CTACGGTATT CCCTCTCCAT
GGTTACAGGT GAAGTGCATG AAGATTTTGC AGTACTTTCC CACACCAGAC GATCAGGCGC
TGCTCGATTC GCAGCTCATC GCCATGCGAA ACATCCTCAC CAAGACGGAC ACGGTGAAAA
ACTTCAATAA GAACAATGCG CTGCACGCCA TCTTGTTCGA GGCGATCAAT TTAGTTACTA
GCATGGACTA CGCGCACGAA CTGTTGGACC CGTGCGTGGA GATTCTCGGG AATTTTCTCG
ACATGAAGGA ACCGAATATT CGCTACTTGG CTCTCAACAC GCTCAACGCC CTCGCGGCGA
TGGCGGATTT GCGAGAAGCC ATAAAGGTGT ACCAAGAGCA AGTCGTGGCT GCGTTGCACG
ACGCGGACAT TTCCATTCGT CGCCGCGCGT TGACTTTATT GTTTTCTATG TGCGATGCTT
CCAACGTGCA CTCTGTCATC GAGGAGCTCA TCAAGTACTT CGTCACCGCT GATTTTGACA
TTCGCGAGGA ACTGGCGCTC AAAACGGCCA TCTTGGCCGA GCGCTACAGC GTGAACGATC
GCATGTGGTT CATTGAGATC GCGATGCAAA TGATAGACAA GGCGGGCGAT TTCATCAACG
ACGACTTGTG GCATCGCATG GTGCAAATCG CAACCAACGA CGCGTCGCTT CACGGTCGCA
CGGCGCAATT GATGTTCGTC AAGTTGCGCG ACGAGGGCGC GTCGAACGAA CTCATGCTTC
GCGCGATGTC GTACTGCATC GGAGAGTTTG GGTATTTGCT TCCCATTCCC GCGTCGCAGT
ACGTCGATCT CTTAGTGCCA CTGTTCCAGG ATACGGATGA GGTCACGCAG GGCATCATGC
TCACAGCCTT CGTCAAGGTT GCGATGCACA AGAATTGCGA TCAGGCGTCG ATGGGTAAGA
TCGTGAAGGT GTTCACCGAC ATGAGCTCAT CGTTTGACGT CGAGTTGCAG CAACGTGCAA
ACGAATATCT GAAGCTCTTG CGTCTCGGAC CGAACATGCG ACCGATTCTC GAGCCCATGC
CCGAGTACCC TGAACGTTCG AGCGTGTTGG AGAAGCACAT ACAAGTAGAA AACGTCGCCT
CGGACGTCGC CGCGGGAGTT CGTAAACTTG CCATGAGTGG TGTCGTGACG GCGAGAGAGC
AACCCCGCGC TCAGGCGCGG TCGGCGCCGG CGCTTCCTGC AGCCGCAGCA CCGCCGGTCG
ATGCCGTCAC AGATTTGCTC GGCAACTTGA TGGGCGACGG ATCTTCGGCG CCGGCGGCGC
TTCCGCCGTC GTCGACTGGG ATGAATCTCG ACGAGCTTCT CGGAAACGCC CCTCCCGCAC
TTCCAGCGGT AGAAGAGCGT CTCGCACTTC CGAGTTCCAC GTCACCTCCC GCGGGGCCGG
TGACGACCAC ATCGTCCGCA GACGCTTTAG ACGATTTACT AGGCTTAGGC GCGCTCGCGG
CGACGCAACC GCCGCCGGCG ACGCACGGCG ACGCCTTAGA CGCCTTTGGA GCTCTGGGTG
CGCCGGCGCC CGCGGCGCCG GCACCGACGC AACCCGTGGC ACCGGTACAA TCATTGACGT
CGAGCGACGG TATTCAACCC ACGGTGAACG TTCAAGACTG CGCGAAACGG TTCCTCATCG
CCGACAACGG CTTGCTGTAT GAAGACGCGA ACGTACAGAT TGGCGTGAAA TCGCAGTGGC
AAGGGTCTCA AGGTCGCGTG ATGTTCTACG TTGGGAACAA GTCCGCGAGC GCGGATCTGC
AAAACTTCAG AATGGTCATA CCGTCGATCG AAGGCTTGCG TCACAGTCTT CAACCCTTCC
CCGCGTCCAT CGGACCGAAG CGCCAGGTGC AGTTGATGTT GCAAGTAGCG ATTACGTCGG
CGTTTGCCTC GGCGCCAAAA CTCGAGTTTT CGTACACGTC CACCGCCGTC GCGGCGGCGT
GTGCCAGGTC TCTGGAGTTG CCCGTACGTG TGACCAAGTT TTTGAGCCCG ATGACCATCG
CTTCGCCGCA AGAGTTCATC GCCAAGTGGC ACCAGATGGC GTCCGCCGGG CAGCAACAGA
AAATTATGGA CGTGTCGCAG CAGTACGCGA CGAGCATCGA AAGCGTGTCA AACGCCTTCT
CGGGCATGCG GCTCGTCGTA CATAAAGGCT TAGATCCAAA CCCCGCAAAC TTAATCGCGG
GAAGCCGGTT CGTCGGCGAA CGATGCGGTG AAGTCTTTGT GGGCGTTCGC GTGGAGAGCG
ACGCGAACGT GCGCGGACGA TATAGATTCA CCGTCGCTTC GATGGAC
 
Protein sequence
GMRGLTVFVQ DVRNCSNKEQ ERARVEKELA NIRRKFNKTH RALTAYERKK YVLKLLYIYM 
LGYNVDFGHT EALKLISASS YAEKQVGYMT TSVILNERNE FLRMAINSIR TDVISSNETN
QCLGLSCIAN VGGREFADSL AGDVETIVMT PTIRPVVRKK AALCLLRLFR KNPEILLAET
FASKMTDLLD AERDLGVLMG VLGLLLGLVQ HDYRGYEACV PKVIALLERL TRNKDIPPEY
LYYGIPSPWL QVKCMKILQY FPTPDDQALL DSQLIAMRNI LTKTDTVKNF NKNNALHAIL
FEAINLVTSM DYAHELLDPC VEILGNFLDM KEPNIRYLAL NTLNALAAMA DLREAIKVYQ
EQVVAALHDA DISIRRRALT LLFSMCDASN VHSVIEELIK YFVTADFDIR EELALKTAIL
AERYSVNDRM WFIEIAMQMI DKAGDFINDD LWHRMVQIAT NDASLHGRTA QLMFVKLRDE
GASNELMLRA MSYCIGEFGY LLPIPASQYV DLLVPLFQDT DEVTQGIMLT AFVKVAMHKN
CDQASMGKIV KVFTDMSSSF DVELQQRANE YLKLLRLGPN MRPILEPMPE YPERSSVLEK
HIQVENVASD VAAGVRKLAM SGVVTAREQP RAQARSAPAL PAAAAPPLLG NAPPALPAVE
ERLALPSSTS PPAGPVTTTS SADALDDLLG LGALAATQPP PATHGDALDA FGALGAPAPA
APAPTQPVAP VQSLTSSDGI QPTVNVQDCA KRFLIADNGL LYEDANVQIG VKSQWQGSQG
RVMFYVGNKS ASADLQNFRM VIPSIEGLRH SLQPFPASIG PKRQVQLMLQ VAITSAFASA
PKLEFSYTST AVAAACARSL ELPVRVTKFL SPMTIASPQE FIAKWHQMAS AGQQQKIMDV
SQQYATSIES VSNAFSGMRL VVHKGLDPNP ANLIAGSRFV GERCGEVFVG VRVESDANVR
GRYRFTVASM D