Gene OSTLU_52043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52043 
Symbol 
ID5006971 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp173552 
End bp175800 
Gene Length2249 bp 
Protein Length634 aa 
Translation table 
GC content64% 
IMG OID640422392 
Productpredicted protein 
Protein accessionXP_001422913 
Protein GI145357412 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.493636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0345389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCGCG CGTCACGCAT CCCCGCGCGC GCGCGTTGGC GCATCGCGTC GCTCGCGTCG 
CCGCGGCGCG CGCCGTCGCC GCGTCACCGC CGCTCGCTCG TCGTCGTTCG CGCCGCGCGC
GCGACGCCCG CGACGTTCGA TTACACCGCC CTCGTCGCGA GCGTGCGGGA GATAAACGCG
CTGTCGACGC CGGTGAAGAT CGACGCGTGC GCGCAGGTGG ACGCGCACTC GATGACGCTG
GACGCGCGCG CGGCGGATGG ACGGCGGAGC GTGCGAGTGA GTTGGCACCC GGTGACCGCG
CACGTCGCGA TGAGCGCGCG CGGGGCCAAG GTGGAGAAGG GGATGAATCT TAGTTTTGGC
GAGAGCGCGA ACGCGCTGCT CAAGGGGAAG GTGCTGCTGG ATGCGCGCGT GGGGACGGCG
TGGGAACGCG CGTGTCGGTT GCGATTCGGC GATAGGCCGG GAGGTGAGGC AGAGTACGAG
CTGTTGTGTG AGGTCATGGG GCGGTACAGC AACGCGTTCT TGCTTGATAG TAAAAATAAT
GATGAAGTTT TAGCGTGTGG GTATCAGGTT GGGGAGAAAC AGACGAGTGT GCGGCGATTG
GCGATCGGGT ACGCGTACGA GCCGCCGCCC TCGGCGCCCG GGATCGATCC CACGACCGGG
GTTGAAGGGG GCGTGGACGG ATGGACCGCG GTGTTGCGCG ACGTCGCACG CGGTCGCAGT
GGTGATATGG GCGAAGCGCG TTTGGATGAG TGCATGGTGC GCGCGTTTCG AGGGGTGTCA
CCGGCTTTGG CCAAGGCGTT GATTCGCGGC GCGGGCGTCG CCGATGGCGA GCGCGTCGAA
ACCGCGAGTG CGAGCGATTG GGTCGCCGTG TACGATGAGT TCATGCGATG GATTGCGAGC
GTTTCAAGTG ATGCCGACAG TGCGGGTACG CTCGCAAACG CCACGTGGTG CGAAGAGTTC
GGGCAGTTGT TATTGCACCC GTCGTCCGCG GGCGTCGTCT TGCCGCCCGC CGCCGACGAT
TTGAGCGTTG CCGGCGGGCC GATCGGCGCT CTTTTCGGCG CTGTGTACGG CGAAGCGGGC
GAAACCGACG TGTTCGAGCG CGAACGCTCG CGCATTTTAC AAGCCGTGCG CGCGCGTCTG
AAAAAGTTGG CGAGCAAAGA CGTTGGGTTC CGCAAGCAAT TAGAGGCGGC GAGCGGGCAC
GAAGAAATTC AACTGCTCGC CGACGCGTTG ATGGCGTACA GTTACAACTA TAAGCCTGGT
TCGAGCGAGC TTGAGGGTCA AGACTTCACC ACGGGCGAGA CAGTGATGTT TAAAGTAGAT
CCCGAGAAAG GTCCTGTGGG CACCGCCGAG GCGCTGTACA AAAAGACGAG AAAGCTGCGT
CGCACCGCCG ACGCCGTCGA GCCGCTCATC GAACAAAATG CGAGCGAGAT TGAGTATTTA
GAGGGCGTCG AATTTAGTAT TCTCGAAGTC AGCGAGTTCA AATCGCGCGA CGATTTATTG
TTCATCGAGG AAATTGGCGC CGAACTCGTG GACGGTGGAT TCTTAAAGCC CACGGGCAAG
GGCGCCGACG CGAAGACGCG CGCGATGGCG GCGGCGAAGA AGGGCGGCGG CGCGAAGCCG
AAAACCGGTA AGCAAGCGCG TAAGCAAGAG ATGATGAACG CCATTCGCAT CTATACCGCG
CCGAGTGGAA AGGAGGTGTA CTGCGGAAGA AATAGCCGCG GAAACGAAGC GGTTTCGCTT
TTCTTCGGAC AAGACCAAGA CGTGTGGTTT CACGTTCGCG GCGCCCCGGG CGCGCACGTC
ATCCTTCGCC AACAACCAGG CGAAACCGCC TCAGACGACG ACATCCAATT CGCCGCCAAC
GTCGCGAGCT TTCATTCGAA AGTGAGAACG GGCGGCAAAG TCAACGTTTC GTACACGTCT
CCTAAGTACG TGAAAAAGCC CAAGGGCGCC AGGCTCGGCA TGGTCACCAT CGACCGCGAA
GACGTCATCG TCGGACGACC GGACGACGTC GCGGACGTCT GCGACGGGCG ATGAATCGCG
CGCATCGCTC AAACTTGTAA CATTCGCCCC CTCGTCCCGA GGACCCATAA AGTCATAAAG
CGTCCATCGC GTCGTCGCGC GCACCGTCGC GTCTCGCCGC GCCCTTCGCC GCGCGCCCCG
CGGGCTTCGC GCGTCGCGCG CGACGCCGAC GCCGACGCCT CGCCTCCGAC CGCGCGCGCG
TCGACCGCGT CGACCGCGGC GACCGCGCG
 
Protein sequence
MYRASRIPAR ARWRIASLAS PRRAPSPRHR RSLVVVRAAR ATPATFDYTA LVASVREINA 
LSTPVKIDAC AQVDAHSMTL DARAADGRRS VRVSWHPVTA HVAMSARGAK VEKGMNLSFG
ESANALLKGK VLLDARVGTA WERACRLRFG DRPGGEAEYE LLCEVMGRYS NAFLLDSKNN
DEVLACGYQV GEKQTSVRRL AIGYAYEPPP SAPGIDPTTG VEGGVDGWTA VLRDVARGRS
GDMGEARLDE CMVRAFRGVS PALAKALIRG AGVADGERVE TASASDWVAV YDEFMRWIAS
VSSDADSAGT LANATWCEEF GQFVAGGPIG ALFGAVYGEA GETDVFERER SRILQAVRAR
LKKLASKDVG FRKQLEAASG HEEIQLLADA LMAYSYNYKP GSSELEGQDF TTGETVMFKV
DPEKGPVGTA EALYKKTRKL RRTADAVEPL IEQNASEIEY LEGVEFSILE VSEFKSRDDL
LFIEEIGAEL KGGGAKPKTG KQARKQEMMN AIRIYTAPSG KEVYCGRNSR GNEAVSLFFG
QDQDVWFHVR GAPGAHVILR QQPGETASDD DIQFAANVAS FHSKVRTGGK VNVSYTSPKY
VKKPKGARLG MVTIDREDVI VGRPDDVADV CDGR