Gene OSTLU_93535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93535 
Symbol 
ID5004758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp264291 
End bp265598 
Gene Length1308 bp 
Protein Length435 aa 
Translation table 
GC content61% 
IMG OID640420179 
Productpredicted protein 
Protein accessionXP_001420640 
Protein GI145352625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.742776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGAC GCGAGCGCGC GTTGTGGTCG TTAGAACACG CGCTGTCGAG CCCCGAAATC 
CCGCTCGACG TCAAGGTGGA GGTGAAAAAC GCGTGCGCGG CGTCGCTCGA CGACGAGACG
CTGGCGTCGT GCGATTCGCT CGTCGTGCGA TTGTTTGACG TTTTATCCCG CGCGCGCGAT
GCGTTGATCG CCGGTGAGGG CGCGATGCGA GGAAACAGCG CGCGCGCGTG CGCGGACGCG
CTGCACGCGG TGACGACGCG ACGCGACGCG CCGGAAACGC TGAAAAGGAG AGGGAAAGAC
GTCGTCGAGG CGCTGCGCGG CGAGTGGTGC TTCGCGGCGG CGAGAGAGAC GCGGAACGAG
GGAGGGACGA GCCCGGTGCG AGGATGGAAA GCGATGGAGA ATTTTTGGAT GCGGTGGCGA
ACGCTGACGA TCGGTGACGG AGGGCTGGGC GATGGGAGCG CGTTGGGAAC GAAGGCGAAG
AGGACGTACG ACATGGTCGA GCGCGCGGAG AGCGATATGC GGTTGCCGAG CGACTGGGAG
AGTCGCGTGC CCACGGCGGA GGATATAGCG CAAGATTTAC AAACGTTGTT CACAGATCTG
TGCGAAAGCG GTGAATTGAT GAGTTTCTTG AGCGTCGTGA GGAACGATAT CTCGAGCGGG
AGGTACAAGG TGCAGGTGAG AGGAGACGGG CGCGTGGCGC GGCAATCGGG CGCGATGACG
GATTCGATTG GACTCGAGCC GCACCTCGAG GGATTGGGTC GAAAGCTCAG GGCGGAGGCG
ACGAGACACC GACAAGAACC GCCGTTAGGT GCTCTAACCG CCGTGCCGGA AAGCTTTTCC
TCGCCGCCGA AGCGCGTACA CATAGCGGCT CAACGAAAAA AGAACGCGCG AACCGTGGAG
TGGGATTCGC AAAACGACCA ACCAGACGAC AACGACAAAG AAGAAGAAGA CAAAGATGAA
CACGAAGATA TCGCGCCCAC ACAGCAAACC CCTCGCGCAA AGCCATCACC ATTAACCAAT
TTATCTTGGG AAGAGGTCGG ATACCGCACG CCACGCGCGA ATCTACCCGC CATTTCGCCG
GCGCCCAGGG CTGCGCTGAA ATCGCCGTCG ACGCGCAAGA AGCTTCACGT GAAGGTGAAG
TGGACGGACG CCGAAGTCAC GTGTTTGCAC CTCGGCGTGC AAAAGTACGG CATTGGGAAT
TGGGCGAAGA TTTTGAACGA TCCGACGCTC ACCAACGGCT TTCACACGTC TCGCACCGGC
GTGCATTTGA AGGACAAGTG GCGCACGATA CAACGACAAG CGCGTTGA
 
Protein sequence
MQRRERALWS LEHALSSPEI PLDVKVEVKN ACAASLDDET LASCDSLVVR LFDVLSRARD 
ALIAGEGAMR GNSARACADA LHAVTTRRDA PETLKRRGKD VVEALRGEWC FAAARETRNE
GGTSPVRGWK AMENFWMRWR TLTIGDGGLG DGSALGTKAK RTYDMVERAE SDMRLPSDWE
SRVPTAEDIA QDLQTLFTDL CESGELMSFL SVVRNDISSG RYKVQVRGDG RVARQSGAMT
DSIGLEPHLE GLGRKLRAEA TRHRQEPPLG ALTAVPESFS SPPKRVHIAA QRKKNARTVE
WDSQNDQPDD NDKEEEDKDE HEDIAPTQQT PRAKPSPLTN LSWEEVGYRT PRANLPAISP
APRAALKSPS TRKKLHVKVK WTDAEVTCLH LGVQKYGIGN WAKILNDPTL TNGFHTSRTG
VHLKDKWRTI QRQAR