Gene OSTLU_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1302 
Symbol 
ID5006300 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp243526 
End bp245646 
Gene Length2121 bp 
Protein Length687 aa 
Translation table 
GC content56% 
IMG OID640421721 
Productpredicted protein 
Protein accessionXP_001422243 
Protein GI145356026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.12515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGAAAATGCA CGGATATTTT GTTTCTGATC GTTTTCATCG CGTTTTGGAT CGCGATGCTC 
GTGCTGGGCT TTTACGGCGT CGGCACGGGG AAACCGAAGA TTTTGCTCTT CGGCACCGAT
TACCGCGGCG AAGTTTGCGA TCAAGGGAAG AACGATGGGT TCAAGACGCG GTATTTCATG
AACCCGGGCG AACTCGCTTG GGCGGCGGGT GTGCACACCG ATCAATCGGC GAGCGCGACG
CGTAAGTACA ACTTGCGCGA CGCGAAGAGC ATTTGCTTGA AGGAATGCCC GAAGCCGACA
ATGACGGCCA GCTCGGTTGC GTTTGTGTGC GATTACCCCG AAGACGTCAC TATCGCGTCC
CAACCGAACT ACGACAAGAA TCAATGGGTG GCGGACAACT ACGACTACTA CCAGTACTTG
AGCGGCCCCC AAAAAACGTC CTCGCTCAAG TGGATGGGGC CGTGCTACCC CGTGCTCTAC
GAGAGCGTCA ACACGTTCTA CACTTGCCAA CTCTTTGGTG ACGGCGATTC GCAAGGAGAA
ACGAAGGCGG AACGCGAGGA GCCCGAGACA GTAGGAGTAC GTCCGTACGT CGATCCTTTC
GGCACCGGTT ACAGCTCCGG CAATTTGGGG ACTTACACCA AGTTGCTCGG CGACGAAATC
GACAAGGCGC TCTCTGGTCC GCTCGCCACG GTGGAACGCT ACATCGATGA CTTCACCACT
GGTTGGAAGG TTGTCGTCGT CGCCGGTGGC GCCTGTCCGA TCGTTTTGAG CATCGCCTTC
TTGTTCTTCC TCCGATACTT CACCAGCGTC TTCGCGTACA CCACGCTCTT CTCGGTGAAC
GCGCTCGCCG TCGTGGTCAC CATTTACTTG TACCTCAAGG CCGGTGTCAT CGGTTCCGAT
CAAGTCAACG CGTACGTGAG CAAGGTTTCC GATAGTGCGT CGGCTTCTAT CACCAACTAC
GCCGATCCGG CTGAGAGCGG TCAAGACACG CTCAAGATTT TCGCGTACAT CTCGACGGCG
CTCACCTGCG TCATTTTCCT CTTCACCTTG TTGATGCTCC GTCGCGTCAA GGTTGCCGTC
GGCGTCATCA AGGTTGCCAC GGGTGCTCTC GGAAAGATGC CGCAGTTGAC TCTGTTCCCG
ATCTTGCCCG CCGTTGCCAT GGTGTTGCTC TTCGTCTACT GGCTCATCAC CTTCGTCTAC
TTGTTCGCGG CGGGTGAAGT CAAGCAACAA GATTGCACCT TGGCGGCGGG CGAACCGCCG
TACATGTACT GCGCCACTCC TTCGACGACG CCGACTGATG ACTGCCACTG CGGTTACGCG
ACCGTCTGGG ACCGCAACTT GCAAGGCGCT CTCGCGTACT ACGTCTTTGG CTTCCTCTGG
GGCTCGCAGT GGATCGTCGC CATGTGCTAC TTGATCATCG CGTGCGTCTT TGTGCAGTAC
TACTTCAAGG GCGGTAACTA CAACGGCCTT AAGAACAAAC CGATTATCAC CTCGACCAAA
AGGATGATGT GGTACCACAC CGGTACCGCC GCCGTCGGTT CTTTCTTCGT CGCCTTGTTG
CAGTTCATTC GACTCATCGT GCGCTTCATC GTGCACCGCA TGAAAAAGTT GTCCAAGGAC
AGCAAGATTA TCAAGTACGT CGGGTACTAC GTGGAATACT GCTTGTGGTA CTTGCAAAAG
ACGATCGAGT GGTTGAACCG CAACGCGTAC ATCATGACCG CCATCGAAGG CACGTCCTTC
TGCAACTCTG CCTGGAACGC CTTGGCGCTC ATGGTGAAGA ACGTCGCGGC CGTCGCCACA
GTCAACATCG TCGGCGACAT CATGCTCGTC CTCGGCAAGC TGGTCGTCGC CTTGGGCTCT
GGTACGATCG CCTTCTTGAT GCTCGACGCC GACACGTTCA ACTACGGTGA CGAAAAGGTT
TCTTCCCCGC TCTTCATCGT CATCGTCGTC GTCTTGTTTG CCTTCATCAT CGCGAACGTC
TTCATGTCCA TCGTCGAACT CGGTATCGAC ACCATCTTGC TCTGCTACTG CAAGGACTGC
GACGACAACA ACGGCGCTCC GGTCAACGCC CCGCCGGCGC TCGTCAAAAC TCTCGGCATG
TCGAGGAAGA TCGCCAAGAT G
 
Protein sequence
RKCTDILFLI VFIAFWIAML VLGFYGVGTG KPKILLFGTD YRGEVCDQGK NDGFKTRYFM 
NPGELAWAAG VHTDQSASAT RKYNLRDAKS ICLKECPKPT MTASSVAFVC DYPEDVTIAS
QPNYDKNQWV ADNYDYYQYL SGPQKTSSLK WMGPCYPVLY ESVNTFYTCQ LFVRPYVDPF
GTGYSSGNLG TYTKLLGDEI DKALSGPLAT VERYIDDFTT GWKVVVVAGG ACPIVLSIAF
LFFLRYFTSV FAYTTLFSVN ALAVVVTIYL YLKAGVIGSD QVNAYVSKVS DSASASITNY
ADPAESGQDT LKIFAYISTA LTCVIFLFTL LMLRRVKVAV GVIKVATGAL GKMPQLTLFP
ILPAVAMVLL FVYWLITFVY LFAAGEVKQQ DCTLAAGEPP YMYCATPSTT PTDDCHCGYA
TVWDRNLQGA LAYYVFGFLW GSQWIVAMCY LIIACVFVQY YFKGGNYNGL KNKPIITSTK
RMMWYHTGTA AVGSFFVALL QFIRLIVRFI VHRMKKLSKD SKIIKYVGYY VEYCLWYLQK
TIEWLNRNAY IMTAIEGTSF CNSAWNALAL MVKNVAAVAT VNIVGDIMLV LGKLVVALGS
GTIAFLMLDA DTFNYGDEKV SSPLFIVIVV VLFAFIIANV FMSIVELGID TILLCYCKDC
DDNNGAPVNA PPALVKTLGM SRKIAKM