Gene OSTLU_16541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16541 
Symbol 
ID5003110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp546961 
End bp549273 
Gene Length2313 bp 
Protein Length770 aa 
Translation table 
GC content59% 
IMG OID640418531 
Productpredicted protein 
Protein accessionXP_001419416 
Protein GI145350007 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5047] Vesicle coat complex COPII, subunit SEC23 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.700638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.499516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGGA ACGTGGTGGC GAATTCGCGA GCCGAGATGA CGAAGTGCGT CGTGCCGTTC 
GGCGCGGTCG TCACGCCGCT GCGAGAGATC GCGTCGGAGG TCGCGCCGCG CGTGCCGTAC
GAGTGCGTGC GGTGCAAGGG GTGCGGGGGG GGGCTGAACC CGTACGCGAG GGTGGATTTC
GCGAGCAAGA TCTGGGTGTG CTGCCTGTGC CACGCGAGGA ATCACTTTCC GCCGCACTAC
AACGCGCTGA GCGAGACGAA TCTGCCGGCG GAGCTGTTCC CGAGCTACAC GACGATAGAG
TACGCGGTGC CGCGAGCGGG ACAGGGGACG GGATGCGGAC CGGGGTATCT GTTCGTCGTC
GACGCGTGCG CGGAGGAGGA GGAGTTGACG GCGCTGAAGC AGGCGCTGAC GCAGGCGCTG
AGTCTGTTGC CGGAGGACGC GCGGGTTGGG TTGGTGACGT TCGGGACGCA CGTGCACGTG
CACGAGTTGG GGTTCGCGGA TTGCCCGAAA TCGTACGTGT TTCGGGGAAA TAAGGAGTTT
ACGCAACAGC AGATTAAGGA CCAGTTGACG CTCGGGGGCG GGCACCGGCA GACGAACGGT
CGAGCTCCGG GGATGGCGGG CGCGCAGCCG GGGATGAACG GCGCACACCC GGGAGCGATG
CCGACGAGCA AGTTCTTGGT GCCGCTGAGC GAGTGTGAAT TTCAGTTTAC GGCGATTTTG
GAGGAATTGC AGCGAGACGC GTTCGCGCCG TTGCCGTCGT GCCGACGGTC GCGATGCACG
GGCACGGCGC TCATGGTGGC GTCTTGTCTG TTGTCCACGT CCGCGATCGG GATGAACGCG
AGGGCGATGT TATTCACCGG CGGCGCGGCT ACGGATGGAG GTGGAACGAT CGTGGCGAAA
GATATGGAGC AGGCGGTGCG ATCGCACAAG GACATAGTCA AGGGCGCGGC GCCGTTTTAT
CACAAGGCCA AAAAGTATTA CGAACAGGTG GCGATCAACC TGTGCGCGAA TGGACACGTG
TTGGACGTGT TCGCGTGCGC GCTCGATCAA GTCGGACTGG CGGAGATGAA AGTTTGCGTG
GAAAAGACTG GCGGGAACGT CGTGCTGGCT GAGTCGTTTT CGCACACCGT GTTCAAGACG
TCGTTTCAAA AACTTTTCGC CCCAGACGCT GAGGGTGGAT TGGGAATCGC GTACAACGGA
CAGTTCGAAG TAATCACGAG CCGCGACGTG AGGACAGCGG GGGTGATCGG ACCATGCGCT
GCGCTCGATC GAAAAGGTTT GCCAGGCGCG ACGAGCGACA CGCCCATCGG TAGCGGCGGA
ACGACGGCGT GGAAGCTTTG CACACTCACG AACGAAACGT CTCTGGCGGT GTACTTTGAT
GTTGCGAATC CGGGTGGGAA AGATCAACAG CCGATGGCCA TGCAAGGGCA GCAAGCGCAA
CAGTTTTACC TGCAATTTCT GTGCACGTAC ACGCTCCCCA GCGGCGAGAC GCGCATGCGC
GTCATCACGA CGAGTCGTCG TTGGACCGAA GGACAAAATT TGAACGACAT CGCCGCCGGG
TTCGATCAAG AAGCCGCAGC CGTGCTCGTC GCTAGGCAGT TGTCTTGGAA GATGGAGACT
GAGGAAGAAA CGGATTGCCC CGCGGCGACG AGATGGCTCG ATCGCAAGCT CATCCAGCTG
TGCCAGCGCT TTGGCGATTA CAGAAAGGAC GATCCGCATT CGTTTCAGCT CATGCCGCAA
TTTAGCATTT ACCCGCAGTT CATGTTCAAC TTGCGACGAT CGCAGTTCGT GCAAGTGTTC
AACAATTCGC CCGACGAGAC GGCGTATTTC CGCATGATTT TGATGCGTGA AAACGTCTTC
AATTCGTTGG TGATGATTCA ACCCACGTTG ACGGCGTACT CCTTCAACGG TCCTCCCGAG
CCCGTATTGC TGGACGTGTG CTCGATCGCC GCGGACAAAA TCTTGGTCCT CGACGCTTAC
TTCAGTGTTG TCGTTTTCCA CGGTATGACG ATCGCGCAGT GGCGAAAGGC GAACTATCAA
GATCAGCCAG AGCACGTGGC GTTCAAAGAC TTGCTCGCGG CGCCCAAGGC GGAGGCGGAA
CAAATAATCG CCACTAGATT CCCCGTGCCG CGTTTGGTTG ATTGCGATCA GCACGGCTCT
CAGGCGCGCT TTTTGTTGGC CAAATTGAAT CCGAGCGCGA CGTACAACTC GAGCGCGACG
ATGGGTGGCG GATCGGACAT CATTTTCACG GATGACGTGA GTCTGCAAGT GTTCATGGAT
CATTTGAAGC GCTTAGCGGT TACCACTAAC TGA
 
Protein sequence
MSWNVVANSR AEMTKCVVPF GAVVTPLREI ASEVAPRVPY ECVRCKGCGG GLNPYARVDF 
ASKIWVCCLC HARNHFPPHY NALSETNLPA ELFPSYTTIE YAVPRAGQGT GCGPGYLFVV
DACAEEEELT ALKQALTQAL SLLPEDARVG LVTFGTHVHV HELGFADCPK SYVFRGNKEF
TQQQIKDQLT LGGGHRQTNG RAPGMAGAQP GMNGAHPGAM PTSKFLVPLS ECEFQFTAIL
EELQRDAFAP LPSCRRSRCT GTALMVASCL LSTSAIGMNA RAMLFTGGAA TDGGGTIVAK
DMEQAVRSHK DIVKGAAPFY HKAKKYYEQV AINLCANGHV LDVFACALDQ VGLAEMKVCV
EKTGGNVVLA ESFSHTVFKT SFQKLFAPDA EGGLGIAYNG QFEVITSRDV RTAGVIGPCA
ALDRKGLPGA TSDTPIGSGG TTAWKLCTLT NETSLAVYFD VANPGGKDQQ PMAMQGQQAQ
QFYLQFLCTY TLPSGETRMR VITTSRRWTE GQNLNDIAAG FDQEAAAVLV ARQLSWKMET
EEETDCPAAT RWLDRKLIQL CQRFGDYRKD DPHSFQLMPQ FSIYPQFMFN LRRSQFVQVF
NNSPDETAYF RMILMRENVF NSLVMIQPTL TAYSFNGPPE PVLLDVCSIA ADKILVLDAY
FSVVVFHGMT IAQWRKANYQ DQPEHVAFKD LLAAPKAEAE QIIATRFPVP RLVDCDQHGS
QARFLLAKLN PSATYNSSAT MGGGSDIIFT DDVSLQVFMD HLKRLAVTTN