Gene OSTLU_41587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41587 
Symbol 
ID5005149 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp234346 
End bp235446 
Gene Length1101 bp 
Protein Length339 aa 
Translation table 
GC content57% 
IMG OID640420570 
Productpredicted protein 
Protein accessionXP_001420943 
Protein GI145353273 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0182638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0311879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCG GCGTGTGCGT CATCACCGGT TTTCTCGGCA GTGGCAAGAC GACGCTGGTG 
AATTACATCT TAAAAGCTGA TCATGGGTAC AGGATAGCCG TCATTCTCAA CGATTTCGGC
GCCGAACTCG GCGTGGAGAA GATGCTCGTC CAACAGGATG GCGTCGACGG GGACAACGCC
TCGCGCACGC TCGTCGAGGA TTGGGTCGAA CTTAACAACG GATGCGTGTG TTGCACGGTT
AAGGGGAGTT TGGTGCAAAC CATCGAGGGT TTGCTGGAGA AGCGGAAGGA GATGGGGGAA
AAGTTTGACT TCATCCTCCT CGAAACCACC GGACTCGCTG ACCCGGGGCC AGTGGCGCGC
GAGCTGTGGG TAGACGACGA ACTCGTGGAG GAAGACGGGG CAGTTTTAGA CTCCATCGTG
ACTTTAGTCG ACGCATCGAA CATCGAGAAG CAGATCGAGG AGAACAAAGA GGCGACGCTG
CAGGTGGCGT ATGCGGATAC GATTTTGTTA AATAAAGCCG ACTTGGTGAA GGCGGAAGAC
TTAGAGCGCG TGAAAGCGCG AATAGCAGCG ATTAACGCCG AAGCCGAGAT AGTGGTGACG
ACGAGATCGA GCGTGGATTT AGGCATCGTG CTCAATCAAG GCACGGTCGC CGGTGGAGGT
TGCGGTAGGA AGCCGGTTTT AGGCGAATTC GCGTCGGCGC CTCCGAGCGC GGTGGTCGCC
TCGGGGGGCG GTTTCTGGGC CAAGGGTGTC GAAAAGTACG CACCTGCGTC GAGCGTGCAC
AATTCAGACA TCAGAACCGT GTGCATCGCC ACGAGTGGGT TCTTGGATGG CGAATCCTTT
CAAAACTGGC TCGAAGACTT GCTGTGGGAA CGACGAAACG CCGAGGGCGG CGCGGACATT
TTACGCGCCA AAGGTCTCGT TTACACCGCA GGATCGGACA AGCGCCGAGT GTTACAAGCG
GTTCGCGAAG TGTACGAAAT CACGGACGGG CCCGTCGAAG AGAACTCGGA CGAAGTGATG
AACAAGCTGG TGTTCATCGG ACGTAACTTG GACGAAGACG CCTTGGCAGT CGGCCTGAAA
TCGTGCGTCG CGCGCGACTG A
 
Protein sequence
MPVGVCVITG FLGSGKTTLV NYILKADHGY RIAVILNDFG AELGVEKMLV QQDGVDGDNA 
SRTLVEDWVE LNNGCVCCTV KGSLVQTIEG LLEKRKEMGE KFDFILLETT GLADPGPVAR
ELWVDDELVE EDGAVLDSIV TLVDASNIEK QIEENKEATL QVAYADTILL NKADLVKAED
LERVKARIAA INAEAEIVVT TRSSVDLGIV LNQGTVAGGG CGRKPYAPAS SVHNSDIRTV
CIATSGFLDG ESFQNWLEDL LWERRNAEGG ADILRAKGLV YTAGSDKRRV LQAVREVYEI
TDGPVEENSD EVMNKLVFIG RNLDEDALAV GLKSCVARD