Gene OSTLU_33075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33075 
Symbol 
ID5003241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp353341 
End bp354433 
Gene Length1093 bp 
Protein Length338 aa 
Translation table 
GC content61% 
IMG OID640418662 
Productpredicted protein 
Protein accessionXP_001419353 
Protein GI145349877 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCGTTCGCGA CCCTTCGCGC GCGCGCCGCG AATCCTCGAT CGCTCGGATC GCGAGGAAAA 
TCGCGAACCG CGACCGATGC CGCGCCGCGA GTCGTCGCGC GACGCGCGCC TTCGTCGCGC
GCGCGTCGCG TTCGCGCCAT CGCGCGTCGT CGTCATCGTC GTCATCGCCG CGCTGACGCT
GCTGCGTCAC GCGCGACGGG TCGATGCGGG CGGATTCGGA CGAAGCTCGC TGTACAGTGA
ACTGCAACGC ACGTACGGAC TCGGCGGTCG CGAAGCCGAC GCCGAACACG ACGTCGTCCC
GGACCCGAAC GCGCGCGAAG CCGAACCGGA ACCCGAGGAA GAGAAGCCGG CGAACCTGCT
GCACGTGGGC CATCGCATGG ATCACGAGGA GCTGCGAAAT CAATTTGAAA AATTTCCGGA
TCATCACGAC ATAAACGCGC CGTTGCACCG AAGTGGGTTG ACGGCGCTGA AACACGCGAT
ATTGGTCGGG AACGATACGT ACGTGCGGAC GGTGATCGAA CTCGGGGCGG ATACCAACGC
GAAACTGTAC GGAGGCAAGG CGATTCACTT CAACGCGGGG TCGTGCGGAC AGCGAGGAAG
CGTGCCGATT TTGCACGCGC TGTTGGAACA CGGGGCGGAT CCGATCGAGG AAACGGATCA
AGGGTACCAA CCGATTCACA TCGCGGCGCG AGGGTGGAAG AAGTATTGCA TTCCTTACAT
GCAGACGTTA CTCGAGGCGG AAGCCGTAGA TCCAAACGCG CGCCGGAGCA CGGGCAACTT
GACAACGCCG CTGCACGAGG CGGCGCATAA ATCCTCCTTC GAAATGGTGA AGGTGTTGAT
CGATGCCGGT GCGGACGTGA ACGCGCAAGA CGCGGACGGT GAAACGCCGC TCCACAAGGC
GATTCGCGGC GATCACATCC ACGCGGCGTG GGGACTGATG TCGAACGGCG CGGACATCTT
CATCAAGAAC AAGCGGGGTA TCGACGTGGA AGCCTTGGCA AATCTGATGC GCGCCGACTA
CGACACGAAG CAAATGTTCA AGCAGCACAA AGAAAGCCTG GAGAAGAAGA AGGCGGTGAA
AGACGAACTT TGA
 
Protein sequence
MPRRESSRDA RLRRARVAFA PSRVVVIVVI AALTLLRHAR RVDAGGFGRS SLYSELQRTY 
GLGGREADAE HDVVPDPNAR EAEPEPEEEK PANLLHVGHR MDHEELRNQF EKFPDHHDIN
APLHRSGLTA LKHAILVGND TYVRTVIELG ADTNAKLYGG KAIHFNAGSC GQRGSVPILH
ALLEHGADPI EETDQGYQPI HIAARGWKKY CIPYMQTLLE AEAVDPNARR STGNLTTPLH
EAAHKSSFEM VKVLIDAGAD VNAQDADGET PLHKAIRGDH IHAAWGLMSN GADIFIKNKR
GIDVEALANL MRADYDTKQM FKQHKESLEK KKAVKDEL