Gene OSTLU_31875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31875 
Symbol 
ID5001794 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp688571 
End bp690313 
Gene Length1743 bp 
Protein Length526 aa 
Translation table 
GC content59% 
IMG OID640417215 
Productpredicted protein 
Protein accessionXP_001417843 
Protein GI145346743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC GACGCGAGAC GACGACGGAC GACGCGAGTC GCGCGTCGGA CGCGCGCGAC 
GACGCGTCGA CGCCGGAGAC GTCGGCGAAA CGCGACGCGC GATACGAATA TGAGATCGCG
TTGGGGAGCA GTGGATGGTT GTTCGTGTAT TACGTCGGCG TCGTCAAGGC GATGCGCGAA
CGAGGGATGG CGAGGTGCGC GTTGAAGATG TTTGCGCGGT GCGCGCGCGG TTCGGAACGC
GACGGCGCGC GAGAGGGTGG TTCGGTGGCG TAAGATGGGG ACGCCGACGC CGGACGACGC
GTCTCGCGCG ACGACGGATG ACTGACGAAA CGGCGGCGAT TTTACGATGC GCGCAGTAAA
ACGAAGGTGT ACGGGACCTC TGGTGGGGCA CTTTCGGGGG CGTTGTTGTT CATGGACTGC
GATTTGGACG CGTTGGCGCA GTATGTGTAC ATTTGCGCGG CGCGAGCGCG GCGGTCGGTG
TTGGGGGCGT TTCAACTGCG CGCGTACTGT CGAGGCGCGA TGACGGAGTT TTGTGATCCC
AAGGCGCATG AGTTACTGTC CGGGAGGTTT GAAGTGTCGA TCACGAGAAT ACTGCCGTCG
TGGAAGAATC TACGAATCAG TTCGTTTCCG ACGTATGACT TTTTGATTCA GGCGTTGTTG
TGCTCGGCGT GCATCGTGCC GCTGAGCGGG TTGCCGATGT GGTTGCGAGG TTTCGGTTTG
TGCCTGGACG GTGCTGTGAC GGACATGCAG GTTTGGAAAG GGTTTAAGAA AGATGGCACC
TTCAGCAAGT TGCATTGCAA GGAGGCGAAC CCGAACATCG TCATCGTCAA TCCATTTTAT
TCGTCGCGGG CGGACATCAA GCCGAGCAAG TACATCCCGG TTTGGTGGTG CTTTTACCCG
CCAGAGCCGT ACAAATTGAA GCAGCTCTTC GAGATGGGAT ACGCAGACGC GCACGATTGG
TACGAGCGCA CGCACGGCTT AGCGAAAACA TTGAAGTCGT CGCCCAAAAC GCGAGCGGTG
GGCAGCGACG ACGGCGCTTC CGCGGAAGAC GAAGAACCGC CGCGAAGCTG GGCGACAGAC
TGTCAAGAGT GGGCGAGCCA GAGCGCAGAG ACGGCGGCTC GCCGCGCGGC CGCTGCGGCT
GAATCCGCGC GTCGTGCGGC GGCAAAACAC GCCGACGAGT TTGGATCTTT CATGAAGCAC
GAAGCCGAGC TCGTGACGCG CATGAGTTTC AAAGCAGCCG AGGTTGCGTC CGCGGCGGCG
CGAAGCGTGA GCGAGATGAA GGACGACAGA AATTTCGCAG AGCGCGCCGA CGCCGCGGCG
CACGCCACCT TAGACGCGTT TGTCGTCGTC GGCAAGGGTA CCATCATACC CTTCAGATTG
GTTCTCAAAA TTGCCGCCTG TTTGCTCGTT TACATGGAGC TCGTAGTTCA AGCGTTCGTC
GCCTTTGTCG GCGCCACGGT GGCGTGCATC TTTCCGGGTA AAGTCGGTCG ACAGGGCACT
GAGTTGTGGC ACAGGTGCCG GGCATTTTCC CAACCGTTGC CGCGTGTGTT GCTCCAAGCC
GTACCAGGAA TCAAAATTGA GGCGAAAATC AACGAACGTA CCGCTAAACA GCTCGGAGAG
CTGTCCGCGT TGTACCGTTT GCTGTGTTAT CTCGTGCACA TGGAGGAGCG CGAGTTCGAG
ATTCTTCGAA AACGACTTTC GCAAGGCAAT CTAAGCGCGC TCGTAGACGA CGTTAAAACG
TAG
 
Protein sequence
MRARRETTTD DASRASDARD DASTPETSAK RDARYEYEIA LGSSGWLFVY YVGVVKAMRE 
RGMASKTKVY GTSGGALSGA LLFMDCDLDA LAQYVYICAA RARRSVLGAF QLRAYCRGAM
TEFCDPKAHE LLSGRFEVSI TRILPSWKNL RISSFPTYDF LIQALLCSAC IVPLSGLPMW
LRGFGLCLDG AVTDMQVWKG FKKDGTFSKL HCKEANPNIV IVNPFYSSRA DIKPSKYIPV
WWCFYPPEPY KLKQLFEMGY ADAHDWYERT HGLAKTLKSS PKTRAVGSDD GASAEDEEPP
RSWATDCQEW ASQSAETAAR RAAAAAESAR RAAAKHADEF GSFMKHEAEL VTRMSFKAAE
VASAAARSVS EMKDDRNFAE RADAAAHATL DAFVVVGKGT IIPFRLVLKI AACLLVYMEL
VVQAFVAFVG ATVACIFPGK VGRQGTELWH RCRAFSQPLP RVLLQAVPGI KIEAKINERT
AKQLGELSAL YRLLCYLVHM EEREFEILRK RLSQGNLSAL VDDVKT