Gene OSTLU_30347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30347 
Symbol 
ID5000639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp41687 
End bp43063 
Gene Length1377 bp 
Protein Length458 aa 
Translation table 
GC content61% 
IMG OID640416060 
Productpredicted protein 
Protein accessionXP_001416534 
Protein GI145344014 
COG category[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5391] Phox homology (PX) domain protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.192095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCGG AGGAGAACGA TGCGGGCGCG GAGGACGCGC CGCCGCCGGC GTACGAAAGC 
ATTCACTTCG CCGAGGATGC CGACGCGCTG CCGGAGGTGC CGCGGGATGG ATACGTGAAA
ACGAAGACGA TCGAGGTGAG CGTGACGAAT CCGACCAAAG TCGGCGACGG GTTGACGGCG
TACGCGGTGT ACACGGTGAG CACCAAGAAT AAAGATCCGG CGTATAAAAA GGATGAATCC
ATCGTGGTGA GAAGGTATTC CGACTTTCAG TGGTTGCGCG GACGCTTGTC GACGCTGTAC
CCAGGCATCG TGTTGTTTCC GCTGCCGGAG AAGACGGTGA CGACGAATCC GTTTCAGAGC
GATTTCTTGG AGCATCGTCG AAGCGGTTTG GAGGCGTTCA TGAAGAAAGT GGTGGAACAT
CCGGGACTCG GAACGTGCGA AGACGTGGTG ATGTTCTTAG AGGAACAGGG GGGGAGTTCG
TGGGACCAGC GCGCCCCGTG GTATTCGCGC GGCGCGGTTG GCACCGCGCT CGGAGCCGTG
GATTCTTGGT TTCAGTCCAT CGGTACGGCG ACGGAGACGT GGAGCACGGG CGCGGGGATG
GAAAGTGTCA TGATGGAGGA AGATCCCAAA TATCTCGAAG CTACGGAATA TTTGTTGCTC
CTCGAAGAGC GCCTGAAGCG AGCGCTGAAG AGCGGGGGTG AAATCGTCAA CGCGGTGCAG
AATCTCGGCA TTTTAACCGG TACGTTTGGA GAAAACGCGC ATCACTTAGG CGACTGCGAG
GAAAAAGGGG CGAAGATGTT GCTCGGCGAT GAGGCCGGGG GATTAGGGCA AGCGTTCAGA
CAGGTTGGTT CCGCCGCGTG CACGATGCGG GCGCCGACGG AGGCGCAAGC GGAGCGCTTG
GCGAAAGAGT TCCGCGCGCC GTTGAAACAA GCGCTGCAAT ACGTGCGCGC GGCGAAAGAA
TCAATAGACG CGCGCCTCGA CGCGTTGTTG AAGCTGCAAG CGTGCCGCGC CAAGGTGCAG
TCCAAGCGCG CCAAGCTTGA ACACGCGTTG CACGCGCCGC CGCCGCCGCC GCCGCCGCAA
CCGACGACAA TTTTTGAGCG CCTCTCCGCT GCCGTCACAT CCCCGACGCC CGTCACCGTG
GAAGAATTAC AGCGCGATGT CGGCCTCGCT GAATCCGCGG TGAACGACGC GCAAGCAAAG
TACGACGACA TCAAATCTCG CATGACCAAC GAGCTCCCGC GCGTGCACGC CGAATTAGAA
CAAGTCATCA ACGCCGCCTT CGCGAACTGC GCCGTCACCA TGAAGGCGCT CGCCGAGACG
CACGTCGAGG CTTGGGAATC CGTCTTCCCC GGGTGCACCG CCGACGCCGC GGCGTGA
 
Protein sequence
MMSEENDAGA EDAPPPAYES IHFAEDADAL PEVPRDGYVK TKTIEVSVTN PTKVGDGLTA 
YAVYTVSTKN KDPAYKKDES IVVRRYSDFQ WLRGRLSTLY PGIVLFPLPE KTVTTNPFQS
DFLEHRRSGL EAFMKKVVEH PGLGTCEDVV MFLEEQGGSS WDQRAPWYSR GAVGTALGAV
DSWFQSIGTA TETWSTGAGM ESVMMEEDPK YLEATEYLLL LEERLKRALK SGGEIVNAVQ
NLGILTGTFG ENAHHLGDCE EKGAKMLLGD EAGGLGQAFR QVGSAACTMR APTEAQAERL
AKEFRAPLKQ ALQYVRAAKE SIDARLDALL KLQACRAKVQ SKRAKLEHAL HAPPPPPPPQ
PTTIFERLSA AVTSPTPVTV EELQRDVGLA ESAVNDAQAK YDDIKSRMTN ELPRVHAELE
QVINAAFANC AVTMKALAET HVEAWESVFP GCTADAAA