Gene OSTLU_32784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32784 
Symbol 
ID5002838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp633362 
End bp634417 
Gene Length1056 bp 
Protein Length351 aa 
Translation table 
GC content62% 
IMG OID640418259 
Productpredicted protein 
Protein accessionXP_001418758 
Protein GI145348648 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0879664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCGG TCGCGCGAAG CGCGCAGGGC ACGCCGCTCG ACGGCGCGCC GCTCGCGACG 
ACGGCCGTCG TGATCGCGGT CGCGATATCT TTCGCGCTCG CGCTCGCCGC GAACGGCGAT
TTCGCGCGCG TGTGCGCGTC GCCGCGGTTG GCGTTTGAAC ATCCGCTGTC GAGCTATCAT
CGCGTCTGGA CGTCGACGTT CTCGCACGGG AGCTTCCCGC ACGCGCTGCT AAATTGTCTA
GCGTTCGTTC CGATGGCGTC GGCGCTCGAG CGATCGATCG GGACGACGCA CTTCGCGTGG
TTATTCGCGA CGTTCGCGCA CGCGGCGTAC GCGCTCTCGG CGAGCGCGGC GACGGCGCTT
TGGATGGCGC TCGGATATCG CGCGTCGTAC GAGAGCTGCG CCATAGGAAT GTCTGGGGTG
GTGTTCGCGC TGATCGTGTG CGAGACGAAC GTGAACGACG TGGAGCGGCG AAGCGTGTTC
GGGTTGTTCA CAGTGTCGAG CGAGTATTAC CCGATCGCTC TTCTGCTTTT CATTCAACTT
TTGATGCCTG GCGTCTCTTT CATCGGTCAC GCGGGTGGTA TCGCGGCTGG ATGGTTGTAC
GTTCGCGGAT ACTTGAACTT TTTGCTCCTG AAGGAGACGC ACGTGGAGTA TTTAGAGAAA
TTAGCGATTT GCGCGCCCGC GCGCGCCCTG GCGTCGTTCG TGCCGTCAAA CGCCGATCGG
GGCGCGCGGC CGAACGCGGA GGCGAGTTCG ACCGCATTTC CCGCGTTTTC GACGGTTAGA
GCGATCCCGA CTCGTATGAG TGAGGTGACG CGCAACGCGT TCGCGGGTAA TTTCCCGGGC
CAAGGACGGA AACTTGGCGG CGACGGGTCC ACGACGGGTG AGATGGCGAA TCTCGTTCGG
GTCGATCCTC GGGCGTTGGA TACGTTAGTA GAGCTGGGAT TCGCAGAACA CGCCGCGCGG
CGAGCGTTGC AAGAATGTGA CGGCGACTCG CAGCGTGCGA TCGAGTTGTT GACGGAATCA
GCGGCGCACG ACGCGAATAG CGACGAAATA GTTTAG
 
Protein sequence
MSAVARSAQG TPLDGAPLAT TAVVIAVAIS FALALAANGD FARVCASPRL AFEHPLSSYH 
RVWTSTFSHG SFPHALLNCL AFVPMASALE RSIGTTHFAW LFATFAHAAY ALSASAATAL
WMALGYRASY ESCAIGMSGV VFALIVCETN VNDVERRSVF GLFTVSSEYY PIALLLFIQL
LMPGVSFIGH AGGIAAGWLY VRGYLNFLLL KETHVEYLEK LAICAPARAL ASFVPSNADR
GARPNAEASS TAFPAFSTVR AIPTRMSEVT RNAFAGNFPG QGRKLGGDGS TTGEMANLVR
VDPRALDTLV ELGFAEHAAR RALQECDGDS QRAIELLTES AAHDANSDEI V