Gene OSTLU_32696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32696 
Symbol 
ID5003040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp492247 
End bp493878 
Gene Length1632 bp 
Protein Length531 aa 
Translation table 
GC content51% 
IMG OID640418461 
Productpredicted protein 
Protein accessionXP_001418951 
Protein GI145349045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.144212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0707375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGT CTCTTAGCAT CCAAGTCCCC GCGCGAACGC CCCTGGGCGA ACGCTCAGGG 
AACGAAAACA TCAGCGCGGC GTCGAAATTC CCGTCCAGTA AGCTGAAGTT GGACGAGTTA
TTTTTAAACT GGTTATCCAT GGCGGAGAGC CAAAACTTGG TGTACGAACT GTTGAAGGAT
GCGAAGGCCG GGAAACCGCT GCGACAGCCG AAGAGCGGTG CGATGCACGC AAACGTCGCT
GCAGCGATAG GAACGCCACC GAGGAGCCCG CAGAAAGGGT CGAGGTACGG TACGAGCGCG
TTTTCGCCTA CGAGGCGACC GCTGTCGAGG CAGGCGTCTT TGTTCACTCG AGCGACGAAT
GAGCCACACT CGATTCCGAC GTTTTACAAG CCAGGCGGTG AAGGCTTGAG CGAAGAGGTG
AACGCAGGAA AGGTCGCGTT GGCGGAGCGC ATGTTTGAAA GACACTTGAC TGGGATGAAT
TTAGAGGCAT TCGCGGGAGT GGTTCGAGAC GTCGTCGGAT TGCCGCGATA TTTCGCCAAA
CGTGTGATGA AGCTCGTGGC CGGGGCGAAT GCGGATGTCG TCACGCGTGA GCAATGGTTT
AGTTATTGGA ATTCTACGCT TCGTCGGCAG AAGGATGTGA GTTCAGCCAT GTTTGAAATT
TTGCGACGAC CAAACGCTCG AGCGTTGGAA CACGCAGACT TTACCGAGGT GCTCACGGAG
ATGACGCAAA CGCATCCGGG CTTGGATTTT TTGAAGACCA CTCGCGAATT CCAGGAACGT
TACGTAGAAA CGGTGATTTA TCGTATATTT TACGAGTGCA ACACGACCTG GAACGGGCGT
TTGACGCTGC GAGAGTTGAG AAAATCAGAC TTGCTTGAGC ACATGTTGCT CGCCGAAGAA
GAGGAAGACA TCAACCGCGT GTTGAAGTAC TTTTCGTACG AGCATTTCTA CGTCATCTAC
TGCAAGTTTT GGGAACTCGA TACCGACCAT GACTTTTTCA TCAATCGTGA AGACTTATTG
CATTACGGTA ATCACGCGTT GACGTATAGA ATCGTGGCGA GGATATTTGA CCAGGCGGGG
AGGCCGTTCA AATCAGACGT GCCCGGGAAG ATGAGCTACG AAGACTTTGT TTGGTTCATT
TTGAGTGAAG AAAACAAGAA CCATCCGCTA GCGCTAGATT ACTGGTTCAA ATGCATCGAT
ACTCATCACG ACGGCGTCAT CACGCGAGAT GAGATATACT ACTTTTATGA GGAACAGATT
CAACGCATGG AATGCCTGGC GCAAGAACCC GTCCTGTTCG AGGATATTTT GTGTCAAATG
ATGGATATGC TCAAGCCCGA AGTCGACGCG AGAGTGACTC TGAACGACTT ACGATCGAGC
AAGATGAGTG GCAACTTCTT CAACGTTCTC TTCAACATGA ACAAGTTCAT CGCATTTGAA
ACGAGGGATC CGTTCTTGAT GCGACAAGAG CGCGAAGAAC CGCACTTGAC GGAATGGGAC
CGCTTCGCTC GCGGAGAGTA CCTCCGGCTG AGCATGGAGG AAGACGATGA GATGGACCAC
GCGAGCGATG TGGTGTGGGA AGAATCCCCT ATATGAATAA TGATTAGAGA TACTGTACAC
TGTAACGATA GC
 
Protein sequence
MSKSLSIQVP ARTPLGERSG NENISAASKF PSSKLKLDEL FLNWLSMAES QNLVYELLKD 
AKAGKPLRQP KSGAMHANVA AAIGTPPRSP QKGSRYGTSA FSPTRRPLSR QASLFTRATN
EPHSIPTFYK PGGEGLSEEV NAGKVALAER MFERHLTGMN LEAFAGVVRD VVGLPRYFAK
RVMKLVAGAN ADVVTREQWF SYWNSTLRRQ KDVSSAMFEI LRRPNARALE HADFTEVLTE
MTQTHPGLDF LKTTREFQER YVETVIYRIF YECNTTWNGR LTLRELRKSD LLEHMLLAEE
EEDINRVLKY FSYEHFYVIY CKFWELDTDH DFFINREDLL HYGNHALTYR IVARIFDQAG
RPFKSDVPGK MSYEDFVWFI LSEENKNHPL ALDYWFKCID THHDGVITRD EIYYFYEEQI
QRMECLAQEP VLFEDILCQM MDMLKPEVDA RVTLNDLRSS KMSGNFFNVL FNMNKFIAFE
TRDPFLMRQE REEPHLTEWD RFARGEYLRL SMEEDDEMDH ASDVVWEESP I