Gene OSTLU_32797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32797 
Symbol 
ID5002791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp654709 
End bp659177 
Gene Length4469 bp 
Protein Length1479 aa 
Translation table 
GC content52% 
IMG OID640418212 
Productpredicted protein 
Protein accessionXP_001418766 
Protein GI145348666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.480119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.167383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCG CGAGGTACCG TCGCGCGCGT CGACGCGTCG ACGACCGGGA CAGCGGCCGC 
GGCCGTGGTG TCGTCGCGCT GTGCGCGGTC GTCGTCGCGC TCGCGCGGGT GAGAACGAGC
GGTGGTGAAG CGATATGGGG GTTAGAGACC GTGTACGATG GTTTGGGGTA TAATCCAGAG
TTCCCGACGC AGGTTTGCGG CCCGTCCAAG GATGGGAACT CTTTTCGCGC CGCGGGTTGC
AAACCGGCGA TTACAGAGAA GGATTCGTAC GGCACCGCGG GTGATCCGGG ATTTAGTTTC
GCGAGGTACG CGTATAAAGG AAAATCATAC GGGCCACCAT ATAATAATGT AGCGGACGTG
GCGGCGTGTA CGTACTCGTT GACGTACGCC GCGGCGGAGG CGGCGGGCCT GGAGCACTGC
TTTTACGCTC CGTACACGGG TGGCGTGCCG ACGACGACGG TGCAGAAGAC GACGACGAAC
TTGGGATACT CCACTCTAGG TTGGCGTCTA CCCTTGGCTG AATGGCGAGA TAAGTTGCCG
GATTACGTGG TGGAACGATA CGAATTACCG CCGCTTCGGT TGGGCTTCGA GGAGGGGACG
TTTTGGCCGA ATTGGAAGCT ACTCTCGAGC GGTTTGCCAA ACTTTGCCGT GACGAAGACA
TGTTTTCTTG AAGCGCACAG CGGAGACTAT CAGCTGTGTA GCGCCTATCC GACGGAGCGT
CGCGAAGATA TTCACGGCGT CCTTCAAGTG CAAAGCATCT CATTCGCACT CGGGCGTGGG
AGCTTGTTCT TTGAGGCAAA CGGCGGAGAT AAAAACGTCC CAGACATACC TCAAGACCCG
TATCCCATAG CTTCGCACGG GAAAGGTGCG CTTGGAGTTG CGTTAACGAG GATTAGCGAT
GGGTATCGAG TGCTGACAAA ACCTGTGCGC GCGAGAAGAT ATTGGGAAAC CTTACGTTGG
GAGGAGTCTT TACTCGCACA GTATTACGGG GAGGTTTTTA GGTTGGAGAT TTTCGATTAT
CGTTCGGGTT CGTTCGGTTG GCTCGCGGTG GATTCTTTCG TAATTCCGCA GGCGCCTGTT
ATCATCACGG AGGTGACTCC ATCGATTGGA CCTCGAGTCG GTGGCACGAG AATCACTATC
AGCGGGCAAA ACTTTGGAAG TTCCGTTGAC GACAAAACCG TGTTCATCGG CGATAAAGAA
TGCATCGACT TACGCATGAG TTATTCTCGT TGTTCGTCCG ATACCACCGT TTGTGCTGGT
GCGTTGAGCT GTACTACACC AGCAGGTTCC GGGGTAGGTC TCACAGTAGC GGTGGTGATT
GGAGACCCAA CCGTCGTCCG CATCGCAGGG CCGCGAGGTG GGTTCCAAGC GGGTTTTTGC
GGCGATGAAT CGACCGAGCA TCCCTTCTCC GAATGTGCAA TAGATGCAAT CGATGCTGTC
GCTGGAGCGC GAAAGCGTGG CTTTACTTAC GCTGATCCTC CGACGGTGAC AAGTACGCCA
GTCACGAGTG CGATCCAAGA CATTCAGTAC GAGTACACGC TCATTGCTAC GGACCCAGAC
GAGAACGACT TATTGACGTA TACGGCAATT ACGTTACCAA CGTTTTTGCA GTTTGATCCA
ACGACTCGTC TACTTACTGG CGTGCCATTA CGGTCAGATG TACAATGTAG GAGCAACAAT
TGGCACCCAG CTTCTGAGCG CTGCGCGTTG GGAGCAACGT ATCAAGTTGA GTTTGAGATC
TCGGACTCAA TATACAGAGT TCAGCACAAC TTTGAAGTGA AAGTGACGAC GAATGAGACC
TCACTTCTTG TTGATAAGTC CTTTCACTGG GAAAACGCGT TACAAATCTT TGAAAAATAC
GAAGCCGTCG TTGCTTTGGG GTCGGTTACA GAATCGCTCA AAGCCTTGGA AAATAGGAAC
GTGACGAGTC CGACAGAAAT CGTTGGCTTC AATTATAGAA ATCCTTTGAA TGATGAGGAT
AAAACGATCC GTAATGCCCT CATCGCTATT GCAGAGAAGC CGAATATCGA TGAAGTCGAC
GTCGATGCGG CCATTCAGTG GCTGCAAACG AACAACTCTG ACGCCGTAAA CCTTGATATG
ATCCAGGCGC TCAAAGAGCA AGTTGCGGTA CAGAAGCGAG GAGACGTATC CAGCGGTGCT
TCGGCACCAG CGTTGCAAGG CGTCAGCTGG AAAGGCTGGT TAAATTATTT TAAAAAACTA
GGAACGTCGC TCGGCGCTGG ATCTGCCGTT TACACAAATG TTGCTGGGCT CCCCGTTCCA
GCCGGCGCAA CGAAAATGCT TACGCGGGAG AAGATAATTG TCATCAAACT ATCGACTCCT
TGCAACTCGA CCGCGTTGAA CACGGACAGA TATATTCCAT CATGTGACGC GGTTGTGGGT
ATGTCATCAC AGCACGAGGT CTGGCTCGGA TTCAGTGACT ACGGAACAAC GTTGGAGGAT
GTCATCGTGT TCCCCGAGGA TTTTGAGTTT GCGGATCTTG TTACGTCGGC CGTCTTTACA
CAAGACGTTC CAGGTTTTCT CGGTCAATTA ACTAGACGAA TGGTGGCGCA ATCAGCAAAG
CTACAGGAAA TCGCAACATT GGGCGAAACA CACCTTGTAT GGTTCGATAC ATCCACGTCT
ATGCTAACCG TGAACATGAC CATCAAGTCT TCACCGGTGA AAGCAGTCAT GCAACTTGCA
ACTTCGTATC CTTATCCCGA ACCTGGAAAG TATGGATTAC CCGAGATTAG CGTTGTGAGC
TGCGATAACG TGGCGTACGA CTATGAAAAC GCGCTCATTT CATTCAGCAA TACCATGACG
AGAACTGGTG GATCTTTGAC CAGTCGTCTC GCAGCGTTGG CGAACGATGT CTCAACATTT
GCGTCACGGT GCAAAAATTT ATGCAACGGT CAGGGTCTGT GCCAAGATTC GCAACTGCCA
CCCGTTTGTC AGTGTTATCA GGACTACTTC GGCGATGATT GCTCGCAAGT AACGTGTCCG
AATGATTGCA GTAATAATGG CGCCTGCGAT TCGAGGCAAG TATGTACGCA CTTCCCGGAA
AGCGGCGAAA CAGACTGCTC CGGTGGAACT GGAAAGTGTG CGTGTAATTA CCCTTACTTT
GGCAAGGATT GCTCGTTAAA AACATGTGCG AAGAATTTCA TCGTTTTTGA AGGTAATTCG
ACGCACAAGC TCGACGTCAC GGCCGGAACG CAACCTATCA AGGACAAATA CACCGCGCTC
GGTCTCGATG TGTATGACGT GAAGATTAGC GAAACCAATG GTGACCAACC TGCAGCTTGG
GCAATTCAAG GAGACGAGCT TAGCTTGGCG CATCTTGTTG GATCGGCTTA CGTGGAATTT
ACCAGCTCTG AAGACGGAGA AACGGCGAGA TCACTGGAGC CATTTTACGT TGACTCTATC
CCCAGTCGTG GCGTCGCAAT CGAGGCACAA TATTTTGCGC AGGAGTACCG CACTTTCGTA
AAGCGTGAGC AACAGCTACT AGATTTGTTC GGCAAAAGCT CATCAGAGTG CAACGCCGCC
GGTGCTTGTG CCTTTGAAAC CGGTACGTGC TTCTGTGCGA CGCAGTATTA TGGGGCCGGT
TGTGAATTTC AGTACTGCGC AAATGATTGT GCAGGTCACG GTACGTGCAA TAAGCTTACC
GGTGTGTGTA CGTGCGAAAC ACACTACGTG ACAGATGAAA CATATGGATG CGCTTTGAAG
GACTATAGTT TGATTTCAAC CACGTGCACT GACGAAGCAC TTGATAGAGA GGTTGATGAC
GCTGATCTGC GCGTGCGTCC ATTACACGCA TCATGTCTCT TTGGTACAAG ATTGGGTAGC
CCTGTGACTG GCACGACGAC GACTTATATC ACTGACTTAC AAGGTAATGT GTGTTTGGAT
TGCTCTGGTT ATACTCTAGA GTCGAACTCC AGGATTTATT TTTACGAAGA CGAACCATGC
ACTTCGTCTG CTGGGAACCG CGAAGACTTT TGCGACAACA CGAAAGAGTC CTACGACACC
ATCGTTCGCG GCATTGGCAT GATACCCGCT GGAGACAGTG GGTCGACTCT TTCATTCAAC
CTCGCCGCCA TGCGTACATC GGATATTCAG TTTTCTATTT TCAAGGCAAA GGCGGGGATC
GTAAGAGAGC ATTTAGCTGA CATCGATGCG GGCGGTTGTG GTGCATGTAC GGACGATAAT
CCCCAGTGCG GCGCCATATT CACTATCGCC GTCGATGGTG TTACCGTATG GACTTCGTTG
GTGAACAACA AGTACTCCGA GATCAACATC GATATCAGCG CTGCCAGTAC CATGACACTC
TCCACAGCAA AGTACACCCC GCCGTATTGG CGTTCCAACG TCTTGCCTGG TCTCGCAGGA
CAAGACGCAC CTGCCGTTTG GTGCGATGGC GCCGCTTGGG CGGACGCAGA GTTCTATTAG
CTTTTAGCCG AGTATTCACA CGAATGCCA
 
Protein sequence
MRAARYRRAR RRVDDRDSGR GRGVVALCAV VVALARVRTS GGEAIWGLET VYDGLGYNPE 
FPTQVCGPSK DGNSFRAAGC KPAITEKDSY GTAGDPGFSF ARYAYKGKSY GPPYNNVADV
AACTYSLTYA AAEAAGLEHC FYAPYTGGVP TTTVQKTTTN LGYSTLGWRL PLAEWRDKLP
DYVVERYELP PLRLGFEEGT FWPNWKLLSS GLPNFAVTKT CFLEAHSGDY QLCSAYPTER
REDIHGVLQV QSISFALGRG SLFFEANGGD KNVPDIPQDP YPIASHGKGA LGVALTRISD
GYRVLTKPVR ARRYWETLRW EESLLAQYYG EVFRLEIFDY RSGSFGWLAV DSFVIPQAPV
IITEVTPSIG PRVGGTRITI SGQNFGSSVD DKTVFIGDKE CIDLRMSYSR CSSDTTVCAG
ALSCTTPAGS GVGLTVAVVI GDPTVVRIAG PRGGFQAGFC GDESTEHPFS ECAIDAIDAV
AGARKRGFTY ADPPTVTSTP VTSAIQDIQY EYTLIATDPD ENDLLTYTAI TLPTFLQFDP
TTRLLTGVPL RSDVQCRSNN WHPASERCAL GATYQVEFEI SDSIYRVQHN FEVKVTTNET
SLLVDKSFHW ENALQIFEKY EAVVALGSVT ESLKALENRN VTSPTEIVGF NYRNPLNDED
KTIRNALIAI AEKPNIDEVD VDAAIQWLQT NNSDAVNLDM IQALKEQVAV QKRGDVSSGA
SAPALQGVSW KGWLNYFKKL GTSLGAGSAV YTNVAGLPVP AGATKMLTRE KIIVIKLSTP
CNSTALNTDR YIPSCDAVVG MSSQHEVWLG FSDYGTTLED VIVFPEDFEF ADLVTSAVFT
QDVPGFLGQL TRRMVAQSAK LQEIATLGET HLVWFDTSTS MLTVNMTIKS SPVKAVMQLA
TSYPYPEPGK YGLPEISVVS CDNVAYDYEN ALISFSNTMT RTGGSLTSRL AALANDVSTF
ASRCKNLCNG QGLCQDSQLP PVCQCYQDYF GDDCSQVTCP NDCSNNGACD SRQVCTHFPE
SGETDCSGGT GKCACNYPYF GKDCSLKTCA KNFIVFEGNS THKLDVTAGT QPIKDKYTAL
GLDVYDVKIS ETNGDQPAAW AIQGDELSLA HLVGSAYVEF TSSEDGETAR SLEPFYVDSI
PSRGVAIEAQ YFAQEYRTFV KREQQLLDLF GKSSSECNAA GACAFETGTC FCATQYYGAG
CEFQYCANDC AGHGTCNKLT GVCTCETHYV TDETYGCALK DYSLISTTCT DEALDREVDD
ADLRVRPLHA SCLFGTRLGS PVTGTTTTYI TDLQGNVCLD CSGYTLESNS RIYFYEDEPC
TSSAGNREDF CDNTKESYDT IVRGIGMIPA GDSGSTLSFN LAAMRTSDIQ FSIFKAKAGI
VREHLADIDA GGCGACTDDN PQCGAIFTIA VDGVTVWTSL VNNKYSEINI DISAASTMTL
STAKYTPPYW RSNVLPGLAG QDAPAVWCDG AAWADAEFY