Gene OSTLU_32968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32968 
Symbol 
ID5003352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp163505 
End bp166733 
Gene Length3229 bp 
Protein Length1031 aa 
Translation table 
GC content62% 
IMG OID640418773 
Productpredicted protein 
Protein accessionXP_001419089 
Protein GI145349330 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.809445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCCACGGCGC GTGCGCGTCC GCGCGCGCGC ATCGCGACGC TCGGGACGAT CGGTCGCGTT 
CGCGAGCGCG TTCGCGAGCG CGCGCGGGGC GCGATTCGGT CGAGGCGCGA CGCGAACGCT
TCGGGACGTC GACATGCGGG CGGATGAAGT CGCGCGGTGC CTGGCCGAGA CGCTCTCGCC
CGACGCCGTC GCGCGCGCCG AGGCGCAGCG CGCGATCGAG CGCATGGGAG GCGAACCGGG
GTTCGCGGAG ACGCTGGCGT CGATCGCGCT GCGCGGCGTC GAGGGCGCGG TGGTGGACAT
CAGCACGAGA CAGCTGAGCG CGGTGCTGCT GAAGAAACAC GTGCGAGAAC ACTGGAACGC
GCTGGATGAA AGGTTCGTCG CGCCGGAATT GACGGAAGCG GAGAAGCAAG GGTTGAAAAC
AGTGCTCCCG AAGGGACTGG CGGATGAGTC GAGCAAGATG CGCACGGCGT TCGCGGCGGG
GATCGCGCAG GCGGCGGCGA GCGACGGCGC GATTTGGGAC GAGCTGACGA CGACGTTGGT
GGAAGGAATA CGAGCGAAAC GATCGAGGTC GGAGGTTTTG GGGTGCTTGA AGTGTTATGA
GATTATTGCG GGGGAGATTG ACGCGAAGGA TGTCGCCACG GTGGGGCCGA CGTTGTTTCC
AGAGTTGTTG ACGCTGGCGA GGCACGGAGA GGACGGCGCG CTACGGAGGC GAGCGGAAAC
GGCGTTTTCG TCGACAGTGA GCGCGTTGAC CACGCTCACG GGGACGGAGC AAAAGGAAAT
GCGAGACATG TTGTTACCGT ATTTGCCCAC GTGGTTGGAG ACCGTAGCGA TCGCGTTGGA
GGGAATGCCG AATCCGAACA ATTTCGACGC GTTGGCTTCG ACCATGGCGG CGCTCACGAG
TCTCGCGCTC GCGGTGCAGT ACTTTACCAA GCCCGCGGGC GAGGCGCTGA TGCCGGCTTT
ATCCCGTGGG GCGATAATGT TTCACACCAT AGCGCCAGTT TGGGCGAAAT ACTCAGAGGA
GACGGATCAC TTGGATCCGG GCATGGATAG CGACGGCGAC ACGGTGAGTT TCGAGGCGGT
GGTCACAGAA CTCCTGGAGC TCGTCATCAA CATCGCGGAG CAGCCTAAGT TGAATAAGCT
GCTCGAGCCG AATTTAGCGG ATACGTTGTA CGTCACGATG GGTTACATGA CGATGAGCTC
ATCGCAAGAG GAGATGTGGA TGGATGATCC GAACCAGTTC GTCGCGGACG AGGACGACGA
TTTCGGCAAT GTGCGTGCCG CGTGCGGTCT CATGCTTGAT TCTCTCGGAG AAAGATTCGG
CGTAAAGGCG GTCGCTGCGC TCTGGAACGC ATCAAATAGA CGATTGGCGG AGTCCATATC
GGCGCAACAA ACCGGTGACT CGATGTGGTG GCGACCGCGA GAAGCGGCGC TCCTCGCCGT
GGGCACGATG AACGAGGTCG TCTTGTCGAG CCTGGAGCGC GCGCAAGAAA AGGGCAAACC
CGCACCTTTC GACATCGCCG CTTTCATGAA AACGGTGATT GAGAACGATT TGCACGAGAG
CACGGCGGCG TCGGCGCCGT TCTTGCGTGG CCGAGCGTTG TGGGTCACGG CTCGCCTATC
GAGCGGCGTG CCGACGGAGA TGGCGGATGC CATTCTCAGA GCTTCGGTAA GTTCGCTGGC
GCCCGGTTTA GCGCCACCAC TGCGTATTGG TGCGTGCCGC GCGATCGCTG AGTTTTTACC
GATCGCGAAG AAGGAAGTGA CGACTCCGTA CATTGGTGAA ATTTACAAAG GCTTGGGCAA
TTTGCTCGTC GACGCCGGCG AAGAGACTCT GCATCTCATT TTAGAAGCCA TGCTCGTGCT
CATTAAGGCT GACTCCGACG CTGCAGCGGC GTGGTTGAGC GCACTCGCGC CGGCGGTGGT
GAAAATATGG GCGGAGTACG TCCGAGATCC CTTGGTGAGC GCAGATACCA CCGAGGTGTT
TGAGGCGCTC GCGGAGATTC CTGCGTGCCA GGCGCAGTTG CACACCATGC TCGTGCCGAC
GCTTTCGCAC ATACTCGCAT CGCCGAGCGA ACAACCTGAA ATGTTAGTGG AGGCGACGCT
CGATCTATTG ACAATCATTC TTCGCCCGGC GTCGCCTGAG ACGGCCAAGG CTACGCACGA
CGTGTGCTTC AAGTACGTGT GCGGTCTCAT CATGCAGAGC GACGACGCCG GCGTGATGCA
GGGTGCGTCC GAGACGTTAC GCGCATTTCT TCGCGCGGGA AAAGAGAACA TGCTCGAGTG
GGGTAGTGGT GACCCGACCG TGGGCGGTGG CGACGTGTTG CGCGCGATGT TTGAAGCCGC
GTCGCGCTTG TTGGATCCAA ATTTGGAGGA CAGCGCAAGT CTGTACGCGG CACCTTTGCT
GTGTCAAATG CTTCGTCGTT TGCCGACCAA GGTGGGCCCG GTGCTTCGCG ATATCACGGC
TGCGGTCGTG GCGCGCTTGC GCTCTTCAAA GCAGCCCAAT CTGTCGGCGT CGTTGTTGAC
GGTGTTCGCG CGCATCGTGC ACGTGGACGC CAACGCGTTC ATCGAGCTCT TGATGTCGCT
TCCGAGCGGC GGTGACGAGC CGAACGCGTT CGATTTCGTC ATGCGACAGT GGTCAGAGAA
GCAATGCGAT GTACACGGTT CGTTTGACAT CAAGTTGACC ACGACCGCGC TCGGTTTGCT
TCTCAACACG CAGAGTCCAG CGTTGCACGC CGTCGTCGTC AAGGGCCAGC TCGTGGAGAC
GCCCGCGGAG AGCGGCCGCA TTCGCACGCG CGCGCGCGCT CAAGCCAACG GCCCAGAAGT
CTGGACCCAA ATCCCACTCT CCGCCAAAAT AGTCGAACTC TTAGCCGACG TCCTCATCGA
GTACGCCGAA GGCATGGCCG GCGCCGAAGA CGACGAAGAC GAATGGGAGG AGGAGCGCGA
CGACGACGAA GACGACCCCG ACGACGCCGC AGACGACGAC GACTTCACCG GCGAGGAAAA
GGAATTCACC GGCGACTTAT TCGAGCGTCT TCTCATGCGC GGCGGTCTCG ACGCGTTCGA
TCCCGACGAC GCCGACGAGG CCGAGGATCC CGTGAACGAC ATCGACGTTC GCGCCTTCGT
CGTCGGCGGC TTTCGCGCGC TCCACGCATC CGGCGTCCTC GCCCCGCTCG CGCAGTCCAT
CGCCACCAGG CACCAGCGCG CCATTCACGA CGCGCTCACG CATCAGTAA
 
Protein sequence
MRADEVARCL AETLSPDAVA RAEAQRAIER MGGEPGFAET LASIALRGVE GAVVDISTRQ 
LSAVLLKKHV REHWNALDER FVAPELTEAE KQGLKTVLPK GLADESSKMR TAFAAGIAQA
AASDGAIWDE LTTTLVEGIR AKRSRSEVLG CLKCYEIIAG EIDAKDVATV GPTLFPELLT
LARHGEDGAL RRRAETAFSS TVSALTTLTG TEQKEMRDML LPYLPTWLET VAIALEGMPN
PNNFDALAST MAALTSLALA VQYFTKPAGE ALMPALSRGA IMFHTIAPVW AKYSEETDHL
DPGMDSDGDT VSFEAVVTEL LELVINIAEQ PKLNKLLEPN LADTLYVTMG YMTMSSSQEE
MWMDDPNQFV ADEDDDFGNV RAACGLMLDS LGERFGVKAV AALWNASNRR LAESISAQQT
GDSMWWRPRE AALLAVGTMN EVVLSSLERA QEKGKPAPFD IAAFMKTVIE NDLHESTAAS
APFLRGRALW VTARLSSGVP TEMADAILRA SVSSLAPGLA PPLRIGACRA IAEFLPIAKK
EVTTPYIGEI YKGLGNLLVD AGEETLHLIL EAMLVLIKAD SDAAAAWLSA LAPAVVKIWA
EYVRDPLVSA DTTEVFEALA EIPACQAQLH TMLVPTLSHI LASPSEQPEM LVEATLDLLT
IILRPASPET AKATHDVCFK YVCGLIMQSD DAGVMQGASE TLRAFLRAGK ENMLEWGSGD
PTVGGGDVLR AMFEAASRLL DPNLEDSASL YAAPLLCQML RRLPTKVGPV LRDITAAVVA
RLRSSKQPNL SASLLTVFAR IVHVDANAFI ELLMSLPSGG DEPNAFDFVM RQWSEKQCDV
HGSFDIKLTT TALGLLLNTQ SPALHAVVVK GQLVETPAES GRIRTRARAQ ANGPEVWTQI
PLSAKIVELL ADVLIEYAEG MAGAEDDEDE WEEERDDDED DPDDAADDDD FTGEEKEFTG
DLFERLLMRG GLDAFDPDDA DEAEDPVNDI DVRAFVVGGF RALHASGVLA PLAQSIATRH
QRAIHDALTH Q