Gene OSTLU_33299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33299 
Symbol 
ID5003503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp2158 
End bp3621 
Gene Length1464 bp 
Protein Length486 aa 
Translation table 
GC content61% 
IMG OID640418924 
Productpredicted protein 
Protein accessionXP_001419670 
Protein GI145350558 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGATGCCAC CGCTTGAGGA TATCAAAAAT GTGTTACTCG ACGCGCTCAC GGAGATCGAG 
AGCACGGCGC GCGACGTTCT CGCGAGTCTC GACGAACGTC GTCTCGTTCG GGTGTCGAAG
AAGCTCCCCG GCGGCGACGT CGAATGCGGA GCCGCGAACG CGCTGTTTCA CGCCCTACGC
GCGCAGAGCG GTCGTTATGC GTCCACCACG GTGACGTGCG TCTCGCCCAT GGGCGTCGAA
CGTGAACACA CGGCCAAATC GCTCGCCACG GCGGCGACGG AGCTCATACC AAACGCGTTC
GCGTCAAAGC ATTTCGCCGC CGTGCGCGTG CACGACGGCG GTGACAGCGG GATGATCACC
ATATGCACGT TAGAGCGATA TGCCGAGCTG CGTGACGCGG GATCGGCGAT GTGCGATAAA
TGCGGAAAAT TTATCTCCGG CGGCGAACGC GGTTTGTGGT GGCATCGAAA GACGAGGCAT
AACGATTTAC ACCAGGAAGC CATGGACGCG GTGGAGAGAG AGCGAAACGC GCTCGTGGCG
ATGTCGACCT CGGGATCGAG GTCGGACCTC ACGAACGGAG ACGCGGCGTA TTTGGATAAT
AAGACGAAAA AAGCGTCGCG GGAGGACGAT TTGCGCGAGG CGATGGCGGC GGCCCGCCGC
GGCGACGCCA CTGTCATGGA TGCCCTGATC GCGGCGAAGC GCGTCAAAGC ACTGCCTTTG
CCCGGACTTG AAGCCGCGCG GCGAGGCGAT TTGAATCTCT TACGCTCGCT CGTTTCGCGC
GATGGATGGG ATCCACGCTC GAAGGACGCC GTCGATAAGC ACGGTTCCAA CGCGTTGCTT
TGGGCCGCGG GCGCCGGGCA CGTCGAGTGC GTCGAGTTTC TCGTCGAAAA ATGCTGTATG
AATCCTCAAA CCTCCGTCCA GAGCGGACGG CGCTCGTACG CCGGTCGAAG CGCCTTGCAC
TGGGCGGCGC GAAACGGCCA CGTCGAGGTG GTGGAATATC TGCTTTCGCG CGGCGTCGAT
CCGAACAGCA CCACTGAAGA CGGATCCACC GCTTTCGCGT GGGCTTGTTG GCAAGGTCAT
CTCGCCGTCA TGCGCCAGCT CGTTGAACGC GCCGAGTGCG ATTACAAGTC GTGCAACGAT
TACGGTTGTA ACGTCGCGTG CTGGACCGCC ATGGGCGCCG GTGGCGTCGA GTGTTGCGAA
TATCTCGCCT CACTCGGCGT GCGTTTCAAT TTGATCAACG CCAACGGTCA TAGCTGTTTA
CACAAAGCCG CACAGCGTGG AAATCGAGAC GTGTGCGAGT GGCTCTTAGA TACGCCGAGT
CTGGGTCTAA CGCGAGACCA CGCCCAACCC GACGCGGAGG GATACGATCC GGCGGGTTTA
GCTCTCGTGG AAGGCTTCAA CGACGTCGCC GACTGGCTCA AGGCGCGCCA GCTGGAGCTC
GAGTTCGCAA ACCATAAACC TTAG
 
Protein sequence
MPPLEDIKNV LLDALTEIES TARDVLASLD ERRLVRVSKK LPGGDVECGA ANALFHALRA 
QSGRYASTTV TCVSPMGVER EHTAKSLATA ATELIPNAFA SKHFAAVRVH DGGDSGMITI
CTLERYAELR DAGSAMCDKC GKFISGGERG LWWHRKTRHN DLHQEAMDAV ERERNALVAM
STSGSRSDLT NGDAAYLDNK TKKASREDDL REAMAAARRG DATVMDALIA AKRVKALPLP
GLEAARRGDL NLLRSLVSRD GWDPRSKDAV DKHGSNALLW AAGAGHVECV EFLVEKCCMN
PQTSVQSGRR SYAGRSALHW AARNGHVEVV EYLLSRGVDP NSTTEDGSTA FAWACWQGHL
AVMRQLVERA ECDYKSCNDY GCNVACWTAM GAGGVECCEY LASLGVRFNL INANGHSCLH
KAAQRGNRDV CEWLLDTPSL GLTRDHAQPD AEGYDPAGLA LVEGFNDVAD WLKARQLELE
FANHKP