Gene OSTLU_18123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18123 
Symbol 
ID5005595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp421910 
End bp423706 
Gene Length1797 bp 
Protein Length598 aa 
Translation table 
GC content55% 
IMG OID640421016 
Productpredicted protein 
Protein accessionXP_001421465 
Protein GI145354383 
COG category[A] RNA processing and modification 
COG ID[COG5186] Poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.622688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCG CGCAGACGCC CTCCGCACCG TCGGTCATGA CCCAGTCGGC GTCCGCGGAG 
GCGCTGAATC GCGTGATGTC GAACGAGATG CCGACGGCGT TCGACGAAAA GTTGAGCAAG
GCGCTCGATG ACAAACTGCG GGAAGAGGAC GTGTACGAAG ACGCGGACGA GTGCGTGCGA
CGCGAGGAGG TGCTGGGGGA GATAAACGCG TTGTTGCAAG ACTGGGTGCT GGCGGCGAGC
GAGCGAAAGG GGATCACAGA GGATATGCGG CCGTCGTGTA ACTTGTACAC GTTTGGGAGC
TATAGATTGG GCGTGCACGG ACCGGCTGCG GATATCGATA CGCTGTGCCT CGGGCCGAGA
CATCTGAGCC GAGAGGAGGA TTTCTTTGGA TGGGATGAGA ATGATTACGA AGGGTCGTTT
TATGACGTGA TGCGGAAACA CCCTGGAACG GAGAGCATCG TGCCCGTGCG CGACGCCATC
GTGCCAGAAA TCAAACTCGT GTTTAGAGGT TTTGAAATAG ATATGGCGTA CACGAGCTTA
CCGTCGTACA CGCACGTGCC AGAGGACTTA GACGTGTGTC AGACGTCGGT GATGATGAAT
TTGGACGACC CAGGGGTGAA ATCCCTGAAT GGTTGTCGCG TGGCGGATCA GTTGTTGCGC
GTCGTGCCAA ACCACGACGC GTTTCGAGTG GCGCTTCGCA CGTTGCGTCT GTGGGCGCAG
CGGCGCGGGG TGTATAGCAA CGTCGTCGGG TTTTTCGGTG GCGTAAACCT GGCGATTTTA
GTAGCGCGCG TGTGTCAGTT GTACCCCAAC GCCGCACCTT CCATGCTCGT GTACAGCTTT
TTCCAACTAT GGTCGGCGTG GCAATGGACG ACGCCGGTGA TGCTCGTACC CATCGTCGAC
GAAGGCTTAC CCGGGATGCG AGTGTGGGAC GAGCGTGTGA ACAAGGCAGA GCGATATCAG
TTGATGAAAA TCTTAACTCC GGCGTATCCG GCGCAAAACT CTACGTTCAA CGTGAGCGTC
AGCACGCTCG AGGTGCTCAA AGCCGAGTTC AAGCGCGGCA AGGAAGTGAC CAAGATGATT
CTTTTAAACA CCGCGAAGTG GGAGCAGCTG TGGTGTTCGT TGAACTTTTT AGAAAAATAC
AAGCATTATT TGATGGTTAC CATCTCGGCC AAGAACGAAG ATGACTTTAA GAAGTGGGAA
GGTTGGGTCG GCTCGCGAAT CAAGCTTTTA ATCCAAGGCA TCGAAAACGC CACGGGAGGG
CAAATGCTTG CGCATCCCGG CACGTCGAGG TACAAAGATC CGGAGAAGGA CGAAAACTCG
CACGTCACGC TGTTCTTGGG GTTATTTCCT TCGAGCCTGA AGAAGAAGGA GGAGAAGGTT
TCGCTCAACT TGAATCCCGC GGTGGAGCAG TTCCAAATGA CTGTGACGTC GTGGATGGAT
CGCGCGACTG GGGAATCGAA CTGGGTACCC GGGATGCAGG CGAACGTCAA ATATTTGAAG
CGCAAGGATT TACCGAGCTT CGTGAAAGAA GAAATCGACG GCTACGTCAA GGACATCTAC
GCGCCGGAGA AGGAGAAGAA AGCGGCGAGC GCGGAGGACG AGGAAAAGAA AATCTCCGAA
GAGAAACCTT TGCCCCTGGG AGACGATACG AAGGCGGCTC GCAAACGCAA AGAAATGCAA
GAAGACGATT CAATCACGCA ATTGGACTCG CTCAACGATG ATTTACACGC GGGGACGGAG
ACCGCGGCGG CGAAGAAGGT CAAAGTGAGC TTCGCGCAAG TCGTCGCGAA GAAATAG
 
Protein sequence
MIGAQTPSAP SVMTQSASAE ALNRVMSNEM PTAFDEKLSK ALDDKLREED VYEDADECVR 
REEVLGEINA LLQDWVLAAS ERKGITEDMR PSCNLYTFGS YRLGVHGPAA DIDTLCLGPR
HLSREEDFFG WDENDYEGSF YDVMRKHPGT ESIVPVRDAI VPEIKLVFRG FEIDMAYTSL
PSYTHVPEDL DVCQTSVMMN LDDPGVKSLN GCRVADQLLR VVPNHDAFRV ALRTLRLWAQ
RRGVYSNVVG FFGGVNLAIL VARVCQLYPN AAPSMLVYSF FQLWSAWQWT TPVMLVPIVD
EGLPGMRVWD ERVNKAERYQ LMKILTPAYP AQNSTFNVSV STLEVLKAEF KRGKEVTKMI
LLNTAKWEQL WCSLNFLEKY KHYLMVTISA KNEDDFKKWE GWVGSRIKLL IQGIENATGG
QMLAHPGTSR YKDPEKDENS HVTLFLGLFP SSLKKKEEKV SLNLNPAVEQ FQMTVTSWMD
RATGESNWVP GMQANVKYLK RKDLPSFVKE EIDGYVKDIY APEKEKKAAS AEDEEKKISE
EKPLPLGDDT KAARKRKEMQ EDDSITQLDS LNDDLHAGTE TAAAKKVKVS FAQVVAKK