Gene OSTLU_33543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33543 
Symbol 
ID5003576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp384565 
End bp386921 
Gene Length2357 bp 
Protein Length771 aa 
Translation table 
GC content57% 
IMG OID640418997 
Productpredicted protein 
Protein accessionXP_001419581 
Protein GI145350370 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5047] Vesicle coat complex COPII, subunit SEC23 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0336143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.164519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCA ACGAATTGGA ACGACGCGAG GGCGTGCGGT TGTCGTGGAA CGCGTGGCCG 
TGCTCGCGGA TCGAAGCCAC GCGCGTGGTG CTGCCGATCG GCGCGCTCGT GACGCCGGGA
CGCGATCTCG GGGACGCCGC GCCGACGCTG CCGTACGAAC CGGTGGTGTG CGAGGGGTGC
CAGGCGGCGT TGAATCCGTA CTGCTCGGTG GATTACTACG GCAAGACGTG GCGATGCTCG
CTGTGCGATG GGCTGAATAA ATTGCCGAGG AATTACGAAC AGATCAGCGA AAATAACCTG
CCGGCGGAGC TGTTTCCGAC GTACACGAGC GTGGAGTACA CGATGACGAA TAAGACGCCG
ACGGCGCCGT GCTTCGTGCT GTGCGTGGAT TGCGCGTGCG GGAGCGCGGA GGAGTTGCAG
GACGCGAAGG ACAGCGTGAT GCAGTTGGTG AGCTTGTTGC CGGAGGATGC GTTCGTGGGG
TTGGTGACGT TCGGGTCGAC GGTGCGCGTG CACGAGTTGG CGGAGACGAA CGGGGCGATG
CGGCGATCGT ACGTGTTTCG CGGGACGAAG GATTGCGAGC AGGAAGATTT GCGCAAGATG
TTAGGGCTGG ATTTCAATCG TCAGCGCGCG GGGATGAATG GAATGAATGG AATGAACGGC
GCTGCGGCGA TGATGCCGAA TGGCGTGGAC GCCAAGCCGG TGAGACGATT CGTCGCGCCC
ATCAGCGAGT GCGAGTTCAC GCTGCAGAGC GTCCTCGACG AGGTGACACT GGACGAGGAG
AAGACAGAGC GCGGGAAGCG CGCGCTACGG GCGATGGGTG CGGCTATCAG CATCGCCGCC
GGACTTTTAG CCGAATCGCA CTCGTCGCAA GGGGGTCGTG TGCTCACGTT CACCTCCGGG
CCGTGCACCG TAGGGCCTGG CGCCGTCGTG GGGACGGATA TGAGCGAGAA TTTGCGCTCG
CACCAAGACT TGGAAAAGAA CGCGGCTAAG CATTACAAGG AGGCGTGTAA AGTTTATAAC
GCGATCGGAA TTCGACTAGC AACGAATTCT CATACATTGG ACGTCTTCGC GTGCTCTCTG
GATCAAGTCG GACTGGCGGA GATGAAAATA GCGGTGGATC AAACAGGCGG GAACATGATT
CTCGCCGAGC AGTTCCGCGC AGAGACTTTC AGGCAATCAT TAGCGAAGAT GTTTGCGAGA
GATCCGAAAA CTGGTGCCTT GGAGATGAAG TTCAACGGAA CTTTTAGCGT GTTCTGCACG
CCGCAAATCA TGGTGTGCGG CGCCATCGGG CCAATTAGCG CTTTGGCGGT CAAGTCGCAG
CGAATTAGTG AAAACGAAAT CGGTTTGGGA CAAACCACAT CGTGGCGCAT GTGTTCGTTC
ACGCCCACTT CAACCATAGC GGTCTACTAC GAAGTCGTCA ACCAGCACAG CAATCCGATC
CCCAACGGGC AACCATTCTT TCTGCAGTTT TGCACGAGAT TTAAGACGAG TGACGGGCAG
ATTCGCCTGC GAGTCACCAC CGTGGCGAGA CGCTGGGTCG AAAGTAGTTT GGCGCCCGAA
ATCGTGGGCG GGTTCGACCA AGAGGCGTGC GCTGTACTCA TGGCGAGAAT TGCAACGTTT
AGAACTGAGA ATGAAGAGTC CTTCGACTTG TTGAGGTGGC TCGATCGCAC ACTCATTCGC
GTTGGCGCAA AGTTTGGGGA ATATCAGCGA GACGCGCCGG ATAGCTTCCG CATGCCACCG
AGCATGTCCA TCTATCCACA GTTCATTTTC CACTTGCGCC GCTCACAGTT CTTACAGACC
GCGAATAACT CACCGGATGA AACAGCGTTT TATCGCATCA TGCTCTCTCG CGAGACGGTG
ACAAACTCGT TGGTGATGAT TCAACCGACG CTGTTGAGTT ATTCCTTCAA CGGCCCACCC
CAACCGGTGC TTTTGGACGT CAGCGCAATC ACGCCCGATA CGATCTTGCT GTTAGACTCA
TACTTCTTGA TAGTCGCACA TAGAGGCAGC ACGATCGCGG CGTGGCACAA AGCCGGATAC
CAGGATCAAC CAGAACACGA GGCGTTCCGC GCCCTGCTCG CGGCGCCGGT ACGCGACGCG
AAAGCTTTGG CGGCAGATAG ATGTCCGACG CCGCGTCTCG TGGAGTGCAA TCAAGGGGGG
TCGCAGGCGA GGTTCTTGCT CGCCAAGTTG AACCCTTCGG CAACGCACAA CACCGACCTC
GGGTACGGTC AAAGTGGTGG AGAGATCATC TTCACCGATG ATATAAGCAT GAACGTATTC
GTCGACCATT TAGCCAAGCT CGCAGTTAGT TCGTAGATCA TATCATCAAT GAATGCGCAA
GTACATTAAT CACTACA
 
Protein sequence
MDFNELERRE GVRLSWNAWP CSRIEATRVV LPIGALVTPG RDLGDAAPTL PYEPVVCEGC 
QAALNPYCSV DYYGKTWRCS LCDGLNKLPR NYEQISENNL PAELFPTYTS VEYTMTNKTP
TAPCFVLCVD CACGSAEELQ DAKDSVMQLV SLLPEDAFVG LVTFGSTVRV HELAETNGAM
RRSYVFRGTK DCEQEDLRKM LGLDFNRQRA GMNGMNGMNG AAAMMPNGVD AKPVRRFVAP
ISECEFTLQS VLDEVTLDEE KTERGKRALR AMGAAISIAA GLLAESHSSQ GGRVLTFTSG
PCTVGPGAVV GTDMSENLRS HQDLEKNAAK HYKEACKVYN AIGIRLATNS HTLDVFACSL
DQVGLAEMKI AVDQTGGNMI LAEQFRAETF RQSLAKMFAR DPKTGALEMK FNGTFSVFCT
PQIMVCGAIG PISALAVKSQ RISENEIGLG QTTSWRMCSF TPTSTIAVYY EVVNQHSNPI
PNGQPFFLQF CTRFKTSDGQ IRLRVTTVAR RWVESSLAPE IVGGFDQEAC AVLMARIATF
RTENEESFDL LRWLDRTLIR VGAKFGEYQR DAPDSFRMPP SMSIYPQFIF HLRRSQFLQT
ANNSPDETAF YRIMLSRETV TNSLVMIQPT LLSYSFNGPP QPVLLDVSAI TPDTILLLDS
YFLIVAHRGS TIAAWHKAGY QDQPEHEAFR ALLAAPVRDA KALAADRCPT PRLVECNQGG
SQARFLLAKL NPSATHNTDL GYGQSGGEII FTDDISMNVF VDHLAKLAVS S