Gene OSTLU_88476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88476 
Symbol 
ID5003965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp452667 
End bp455876 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table 
GC content62% 
IMG OID640419386 
Productpredicted protein 
Protein accessionXP_001420008 
Protein GI145351275 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.289922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.626674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC AAAAGTACAC CGCGTTCGAC GTCGCCGCCG TCGTCGCCGC GCTGCGCCGC 
GCCGCGCTCG GGTGCTGGCT CGCGAACGCG TACGACGTCG ACGCGACGTC GGGAAACAAA
AAGTTTCTGT TAAAGCTGAA TAAACCGTCC GGCGCGGTGG CGCGAGACGC GCGCGCGGAC
GCGACGACGG CGGAGAGCGA GAAAATACTG GTGTTCATCG AGTCGGGGAC GCGGGTGCAC
ACGACGAGGT ACGAGCGAGG AAAGACGACG GCGCCGACGG CGTTCACGGC GAAGCTGCGA
GCGCGCGCGA AGGGGAAACG CTTGACGGAC GCGAGGCAGC TGGGAAGGGA TCGAGCGATC
GATTTCACGT TCGGGGGGGG AGGGGAGAAT GAGTGTCATC TGATCGTGGA GTTGTACTCG
CAAGGGAACG TGATTTTGTG CGATGGGAAT TACACCGTGG TGGCGCTGTT GCGATCGTAT
CGAGACGGCG GCGACGTCAA CATTTTGCCG AATCATCAGT ACCCGTTGGA GCGGCTGAAG
GGATTTCAGC TCGGTGGGTA CACCCGGGAG GACGTGGTGA GCGCGTTGGC GCGCGGGGTG
TTGGCGACAG AGGAGGAGAC GATGGGTGGG GACGCGCGGC GGGCGCCGGC GACGCTTCGC
GAGGCGTTGT GTCGAGCGTT CGGGTACTCA CCCGCCATCG CGGATCACGT GGCGTTGACG
GCGTCGATCG AGCACGGCTC GAACGCATCG CTACCGCTGA GTGAGGCGTG CGTCGATCGG
CTGACGGCGG CTGTGCGAGA TTTAGAGAGT TGGTTCGAGG GCGTCACGAC GGGCGACGTC
GTCGCCGTGC CCAACGTGTG CACGAAGATG GACGCCAACG CCGACGGCAC GGACGAGATC
GAGATCTTCG ATGATTTCTC CCCGTTCTCG TTGAAACAAA ATGAAGGCCG ACCGACGAGG
AAGTTCGAGC TTCCCAAGGG GTTAGACCCG GTGTGTGCGT TCGACCACGC AGTCGACGAG
TACTTCATCG CTCTCGAGGC GCAGTCGCAA ATCTTAGCGC GACGTAAAGC TGAGGCGCAA
GCGTTGGCTA AATTAGAAAA ATCACTCAAG GATCAGAAAA GTCGCGTCGA GCAGCTCGAA
CGCGAGCGCG AGAAAGAGGA GCAACGCGCG GTTCTCATCG AGTACAATCA CGAAGCCGTG
GACACGGCGA TTGACGCCGT GAATTCGGCG TTGGCGAGCG GAATGTCCTG GCCCGAGCTC
GAGGCCATGA TCAACGAAGA GCGCCGGCTC GGGAATCCGG TGGCGGGGAT GATCAAGTCA
CTCGATCTGG CGAATAATCA AATCACTATC ACGCTCGCGA ATCATCTCGA CGAAGTCGAC
GAAGTCGACG CGGCGAGCGG TAAGCGCAAA CGAGTCGCCG TGGGCGTGGA TTTGGGATTG
AGCGCGCACG CCAACGCGTC CATGCGCTTC GCAGCGAAGA AGAAACACGC GGAAAAGTTT
AGCAAGACAG TGGATGCGCA GTCCAAAGCC GTGGCCGCGG CTGAAGCCAA GGCGAAGGCG
GCGATGGAAA AGGCTGCGAA CGGATCGTCC ATCGCGCGCG CCAGACAACC GCTTTGGTTT
GAAAAGTTCA ACTGGTTCAT CACGAGCGAA AATTGTTTGG TGCTTCAGGC GAAAGACGCG
ACGCAGGCGG AGATGCTCAT CACACGGTAC ATGCTCCCAG GCGACGCGTT CGTGCACGCA
GAGGTACCGC AGGCGCCCGT CACCTTGGTG AAACCGCCGC CTGGCGTCGA CGTGCGAGCG
GTGCCGGCGT ATTCGCTCGT ACAGGCGGGC GCAGCGGTGA TGTGTCGTAG CAGCGCTTGG
AATTCACGCG CGGTTAAATC GGCGTGGTGG ACGAGCTCTG AGCGCGTGAG CAAGATTTCC
CCGGTCGCCG GCGACGCGCT TCCGCCAGGC GTCACGCACG TCGCGCACGC GGACAAGCAA
TTCCTGCCGC ACGCGCAACT CGTCATGGGG TTCGGATTAA TGTTCGTCGT GAGTGAGAAG
AACGCGGAAG CGCACAAAAA CGAACGATTG GTGCGAAGCG ATTTCAACAT TTTAGAAGAA
GAGGGCGACG AGACGTTCGA CGAGGACGAC GAGCAAGAAG ACGAGCGAGA CGACGCGAAT
GGCGGCGCCG AGATTAGAGG AGAAGTCAAC AAACTCGCGG CGTTCTTAGA TGGTGCCGTG
GGTTTCGCAG GAGAAGATGA GCGATCGAGC GTAGACGACG GCGACGAGAA CGACGATGAC
GACGGCGACA CCGTCGCCGC CGCGCCGTCG CCCGCGAAGC CGGCGACTCC GCGCATGTCG
GCCAAGGAGC GCAAGACGCT GAAGAAGAAG AAGGGCGGCA AGGGAGGGAA TGATTCGGAC
GGCGACGGCG ACGACGATTT CATCGACCCA TTAGCCGAGA TGAAGAAGAA ATCATCGTCG
AAAACCGTGC CGATCGAGAC GAAGAAGGCA CCTCGCGGCA AAGCGGCGAA ACTCAAGCGC
GCCAAGGCCA AGTACGCGGA TCAAGACGAG GAGGATCGGG CGCTTGCGAT GGAATTTCTC
GGTGCGAGCG GTGGCAGCGG CGGGAAGAAA GTCACGGGCG CGTCCAAGAA AGCGGCCAAG
GCGGCGGAAA AGTTTGAAGA GCGCAAACAG GAGAAACCGA CGGCGCCGAG CGCGCCCGAA
CCGGCACCTG CGCCGTTCGT CAAGCGTCGC GAAGCCGCGG CGGCCGCGTT CGCGCGCAAG
GCGGCCGCTG ACGACGATTT TCCGGCAAAA TATCAGGACG ACGAGGACGA CGACGAGTCA
AAGGCGTCTC TCGTCCCAGA CGAAGCGTCC ATCGAAGAGC GCCTAAAGCT CGACGCCGAA
CGTCTCGAAA TCGTCAACCG AATCGTCTCT GCGCCGTTCA AAGACGACGA CATCGAGTAC
TGCCTTCCCG TGTGCGCCCC GATCACCGCC ACCAACGCGC TCAAATACCG CATGAAAGTC
ACCCCTGGCT CGCAGAAGAA AGGTAAAGCC GCGAAACTCG CGATGGAAAT CCTTTCCCGC
GCGCCCTTCG CCACGCCTCG CGAGCTCGCG TGCGTCAAAG CCGTCGCCGA CGTCGACGCC
GCCGTCGCGC TCCCCGCGGG GTGTAAGATT AGCTTACCAC CCGGGGCGGC GAAATCCATG
TCCAAGGGCG GTAAGAAGAA GCGCCGTTGA
 
Protein sequence
MPKQKYTAFD VAAVVAALRR AALGCWLANA YDVDATSGNK KFLLKLNKPS GAVARDARAD 
ATTAESEKIL VFIESGTRVH TTRYERGKTT APTAFTAKLR ARAKGKRLTD ARQLGRDRAI
DFTFGGGGEN ECHLIVELYS QGNVILCDGN YTVVALLRSY RDGGDVNILP NHQYPLERLK
GFQLGGYTRE DVVSALARGV LATEEETMGG DARRAPATLR EALCRAFGYS PAIADHVALT
ASIEHGSNAS LPLSEACVDR LTAAVRDLES WFEGVTTGDV VAVPNVCTKM DANADGTDEI
EIFDDFSPFS LKQNEGRPTR KFELPKGLDP VCAFDHAVDE YFIALEAQSQ ILARRKAEAQ
ALAKLEKSLK DQKSRVEQLE REREKEEQRA VLIEYNHEAV DTAIDAVNSA LASGMSWPEL
EAMINEERRL GNPVAGMIKS LDLANNQITI TLANHLDEVD EVDAASGKRK RVAVGVDLGL
SAHANASMRF AAKKKHAEKF SKTVDAQSKA VAAAEAKAKA AMEKAANGSS IARARQPLWF
EKFNWFITSE NCLVLQAKDA TQAEMLITRY MLPGDAFVHA EVPQAPVTLV KPPPGVDVRA
VPAYSLVQAG AAVMCRSSAW NSRAVKSAWW TSSERVSKIS PVAGDALPPG VTHVAHADKQ
FLPHAQLVMG FGLMFVVSEK NAEAHKNERL VRSDFNILEE EGDETFDEDD EQEDERDDAN
GGAEIRGEVN KLAAFLDGAV GFAGEDERSS VDDGDENDDD DGDTVAAAPS PAKPATPRMS
AKERKTLKKK KGGKGGNDSD GDGDDDFIDP LAEMKKKSSS KTVPIETKKA PRGKAAKLKR
AKAKYADQDE EDRALAMEFL GASGGSGGKK VTGASKKAAK AAEKFEERKQ EKPTAPSAPE
PAPAPFVKRR EAAAAAFARK AAADDDFPAK YQDDEDDDES KASLVPDEAS IEERLKLDAE
RLEIVNRIVS APFKDDDIEY CLPVCAPITA TNALKYRMKV TPGSQKKGKA AKLAMEILSR
APFATPRELA CVKAVADVDA AVALPAGCKI SLPPGAAKSM SKGGKKKRR