Gene OSTLU_39338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39338 
Symbol 
ID5004851 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp95499 
End bp97505 
Gene Length2007 bp 
Protein Length630 aa 
Translation table 
GC content52% 
IMG OID640420272 
Productpredicted protein 
Protein accessionXP_001420751 
Protein GI145352857 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.109278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0495163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAG CGCCCCCTCC GGCGGTGCAG AAGAACAAAT CCAAGGAATT TTACGATTTG 
GTTCGACGTA TAGGTGAGAG AACGCGACGG AGATGTCGCG AAAGCGAGTG AGATTGTCCC
GCTGCGTGCG ATTGTCCCGC GACAATGCAT CGACGACTGA CGCGCGCGCG CGCGCGAAAT
ATCCAAACAG GGGAGTGTAA GAGCAAGACA GATGAAGACG TCATCATGCA GCGCGAGTCG
ATGTACCTTC GAGCGCTGCT ACAGCAGCCC AAGATTGATA AAATGAAGAT CAAGGAAGTC
ATGCTGCGGT TGATGTATCT GGAAATGCTC GGTCACGACG CGTCGTTCGG ACACATACAC
GCGGTGAAAG CGTGCGTGGA GAGCGACATC GCGATAAAAC GAGCGGGGTA CCTGGCGACG
ACGTCGTTTT TAAACGAAGA TCACGATTTG ATCATTTTAA TCGTGAACAC GGTGCAGCAA
GATTTAAAGA GCGATGATTA TTTGGTCGTG TGCGCGGCGT TGACGGCCAT CATGCGGTTG
GTGAACGAGG ATACGGTGCC GGCGGTGTTG CCGCAAGTTA CATCATTGCT CATGCATCCC
GTGGCCCACG TGCGGAAGAA GGCGGTGATG GCACTCATGC GATTTTATCA AAAGAGTCCG
CAGAGTGTGA GTCATTTACA CGGCAAGTTT CGAGAGATGA TTTGCGATAA GGATCCAAGC
GTCATGTCCG CGGCTGTGTG TGCTTTACAC GAACTGGTGG CGCACGATCC CGAACCACAC
AAGAATTTGT CGTCGAGCTT CGTGAGCGTG CTCAAGCAAG TCATCGATCG AAGATTACCA
AAGTCGTACG AATACCACAG GACGCCGGCT CCGTTTGTGC AAATCAAGTT ATTGAAGATA
TTAGCAATCT TAGGCGCTCA TGACAAGACC ACGAGCAGCG AGATGTATAA TGTTTTGGAA
GACACGCTCG CGCGGGCGAC AGACTCTAAG AACCAAATAG GTAACGCTCT GGTGTACGAA
TCGGTGAGGA CAATCACAAG CATCTATCCA AACCCGCAAT TGTTGGCGCA GTGCGCGATG
GTGATATCTC GGTTCATCAA GAGCTCAAAC AACAATTTGA AATATGCTGG CTTGAATACA
CTGGCATGTA TAGTAAACGT CAATCCGCAG TACGCGGCAG AGCATCAGAT GGCGGTCGTG
GACTGCTTGG AAGACTCCGA CGAGACGTTG CGCAAAAAGA CGCTCGACTT GCTCTACAAG
ATGACAAAAC CAAACAACGT GGAGGTGATC GTCGAGCGTA TGTTGGCCTT TTTGAAACGG
GACGGCGACA AATATAGCGA TCAGTACGTG CGAGAGGAGA CGGCTTCACG TGTCGCAGAA
CTCGCGGAGA GATATGCCCC CGACGCAAAG TGGTACGTGG AAGTCATGAC GGAACTCTTT
GAGACGGCGG GCGACGTGGT AAAGCCATCC ATCGGTCAGG GTTTAATGCG TCTATTAGCT
GAAGGCACGG GAGATGATGC TATCGATGAT CTTTCGCGCA AATCTGCCGT TAATGCGTAC
GTGAATTTGC TTCACAAGCC AAAACTTCCT CTCGTCTTGT TGAAGACGAT GGTTTGGGTC
CTCGGCGAGC TCGGGGAACT GAGCGGTCGG AACGCCGAGA CGCTGATGGA CATGCTCGTT
GAAGTCACGG AGAAGCAAAT TCATGGCCCC GCAGTTGAGA CTTTAGTTTT GAGCGCCATA
GCGAAGATAG CACGTCGCGC CAGTGGTGGG TTGAGCCCAA ACGCGCGCGC ATTCGTCGAG
CAAAACGCGA AGAGCAAATT CGTAGAGAAG CAGCAACGTG CGCTCGAAGT CGATGTGCTC
GTGGGTGAGG AGACGCAGAT ACTTTCGGGT GTCATCGCAC CTTCCGCAGT AGATGTCAAC
GTGGATGCAT CGCTGAGTAT GCTGAATCAA TACGTCTCAA ATGCGCTCGC AAACGGTGCA
AAGCCGTACC AGGAAAAGGC GCAACGA
 
Protein sequence
MSSAPPPAVQ KNKSKEFYDL VRRIGECKSK TDEDVIMQRE SMYLRALLQQ PKIDKMKIKE 
VMLRLMYLEM LGHDASFGHI HAVKACVESD IAIKRAGYLA TTSFLNEDHD LIILIVNTVQ
QDLKSDDYLV VCAALTAIMR LVNEDTVPAV LPQVTSLLMH PVAHVRKKAV MALMRFYQKS
PQSVSHLHGK FREMICDKDP SVMSAAVCAL HELVAHDPEP HKNLSSSFVS VLKQVIDRRL
PKSYEYHRTP APFVQIKLLK ILAILGAHDK TTSSEMYNVL EDTLARATDS KNQIGNALVY
ESVRTITSIY PNPQLLAQCA MVISRFIKSS NNNLKYAGLN TLACIVNVNP QYAAEHQMAV
VDCLEDSDET LRKKTLDLLY KMTKPNNVEV IVERMLAFLK RDGDKYSDQY VREETASRVA
ELAERYAPDA KWYVEVMTEL FETAGDVVKP SIGQGLMRLL AEGTGDDAID DLSRKSAVNA
YVNLLHKPKL PLVLLKTMVW VLGELGELSG RNAETLMDML VEVTEKQIHG PAVETLVLSA
IAKIARRASG GLSPNARAFV EQNAKSKFVE KQQRALEVDV LVGEETQILS GVIAPSAVDV
NVDASLSMLN QYVSNALANG AKPYQEKAQR