Gene OSTLU_16663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16663 
Symbol 
ID5003642 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp89074 
End bp92392 
Gene Length3319 bp 
Protein Length1048 aa 
Translation table 
GC content60% 
IMG OID640419063 
Productpredicted protein 
Protein accessionXP_001419704 
Protein GI145350630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.207195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGC TGTCGAGCGT CGCGCGCGCG CGCGCGTCGG GAACGACGGT GCGCACGCGC 
CGTCGCTCGC GGCGCGCGCG CGGGACGCGA AGGGCGACGA ACGAGGACGA CGCGGACGCT
GGGTTCGACG TGGAAATCGC GGAAAAACCG ACGCACTGGC GCGTCGTCGA TGGCTTCGTG
AGCGATTTCG CGAACGATTT GCGCGCCGTG TACGATGGAC GCACGCGCGA TCCGCTGCGC
GTGGATCCGG AGCGGTTCGT GTGGGATTAC TGGCACGTGC CGAATCAGTA CACGCTGCTG
CGAACGCCCG CGGAGGACTA TTTCGGCGCC GAGCGGTACG GCGAGTTGGA AAAGGCGCTG
TTGGAGTACG GACGGAGAGA GTTGGGATGC TCGGCGATCA CGCCGGTGTG GATGTCGTGT
TACGTGCACG GGATGCGGCA AGAGTTGCAC GCCGATGTGC CGCACGGACC GTGGGCCTTC
GTGCTCAGTT TGACAAAGAA CGACGCGGAG GATGGGTCGT ACGGCGCACA TTTTAGGGGC
GGTGAAACGC AAATAATGCG CCCCGAGCGA TTGAATTATT GGAAGAATTT TGACGCCGGC
GAGGTCGTCG AGCGGTCGCA AATCATGGAG ACCATCGCGC CGACGTTTGG GCGTCTCGTG
GCGTTCGACC CGCGCCTACC GCACGGCGTC ACTGAAGTTT TCGGGACTCA AGATCCGAGG
CACGGAAGGT TGGTACTACA CGGATGGTTT AAAGACCCCG AACCGTCGTT TTCTGGCGCA
TTGACGGAGG AGGACGCGTC GCCGACGCTC GAGGGCGCGC TCGGGGAACT GTACGGACGT
CTCGTCGAGC TTCCCCGAGC GAGCGGCATG GTTTGCGCCG CGCTCACCGT AGCTCCAGAT
GGTGCGGTGA CCAATCTACG ATGGACGTGC GATACCCTGA CGCCGATTCC TTTGCCCGAG
CTCCCGAGTG AGACGGACAT TCGCGACGCC ATCATGCTCG ACATCGCGAG CGCGCTGTTG
GAGTTGAAAT TTCCGGCTCG CGATGCCGAG TCCGTCATCA CCTTACCGTT TTGTTTCGAA
TGAGCGTCAC AGACGCACTT TTTTCACGAT CGCACGTAGA TTTTAGGCGA TAGTTTTGTA
TCACAAACGT AGAGTAGCCA CTCGCCGCCG AGCGTCCAAC CGCCGAGGCA CAGCGCGCCT
CCCGCGCCGA CTCCATGACA GCACAGAAAT GACCCCGACC GCGTCCGACG GCGCCGACGA
TGATATTGTC GACGTGCACG CCTTCAAAAC CCCTCGCGGA GTCGTGCGCG TGCGCACGCG
GCGTCCGAAA CTCTTTCCAG AAGACACTTC AACCGCCGAG TGCGACGAGA CTGGACGCGT
CGCCTGGCGC GCTCTCCCAG CGCTCTGCGC GTATTTAGCC AGCGACGAAG GATACTCGAT
CGTGGTCGAG CGCTCGCGCG TGCTCGAGCT CGGCGCGGGG CTGGGGACGC CTGGGATGCT
GTGCTGGCTG AGCGGCGCCG CGCGCGAGAC GACGCTGACG GACGGAAACG CCGACGTCGC
GAGCGATCTA CGGCGATCGA TCGAGATGAA TCGAAACTTT GTAGATGACA ATGTATTGAC
GTCGCTCGGG AGCGCGACGG CGGGGACGCT GGAGTGGGGG CGAGGGGACG CAGATGATTT
CGCGAGAAAA ACGTTTCCAC TCGTCGTCGC GAGCGACGTG GTGTACAGCG AGGCATCGGC
GCGCGATGTG TTAGACGTCG TGCACACAAA GCTCGAGCGT GATAATGGGA TGCTTTTACT
CGCGTACGTG TCGCGTTGGG CGCACGTGGA TCGGGCGCTG TACGAGGCAA TCGTAGCCGG
TGGGTGGACC GCAACGCTGA TTCCCGTGGC GGATTTCGAC GAGCTCGCTC GCAAAGAATC
GTTGGAAGGT CGACCTTGTT TGTTCGAAAT CACACGCGGT GGGTGCGGCG CGCGCGGGCA
AATTCGAGCA AGAATGCCGA AACGCGCTCG TGAGTGGTTT GATGCTGAAT TAAACGTGCT
CAAGATCACA CCCGCGGACG CGTTTACGGA GCGCTTTGCT GATGACTTGT GTTCAACACT
CGAGACGCAC GGAGGCGCGA TCGAGAGTCT GGAGATTAAC GCGAAAGGGC CGTTTAGAAT
ACATCAAGAC ACGCTTCGTT GTTTGCTCGA CGTGTTTGCG AAACACGCGT CACGAGTGAA
GCTAATTAGG TTGAAGTTGT GCGAAACTTG GCTCGATGTC GACGGATGGC GCGCCTTGGG
AGATTTTCTC TGCGAATCGG ACGTGCGCGA TTTGGAGATT ATCGGTGAGG ATGTCGACGA
GGAAATTTTG CACGCGATGA CGAATTCGAG TGGAAACGTT GCAAACTCGT GGGTCAGTCG
ATTGAAGAGC TTAAAGTTCA GCCGTTGCGA CCGATTGAAC GCTCAAGCCA TTGTCGCGCT
TCGACAGTCG TGGTTTCCTC CGCCCGCTCT CGCACGCGAG TTGGAATGTT TCGAAGTAAG
CTTCTGCCCA ATCGGCGACG AAGGCATAAA ATCAATCTGT GACATCGAAT TTGTTGGCTT
GCGCGAGCTT CGTCTGGCGA ACGTTGGCGT GTCCGCGATT GGCGGCGCCG ACGTCGCTTC
AAGACTCGTG GCGCGGTGCG AACAATTACG AAACCTCGAT CTGAGCGGCA ACGATTGTTT
TGACGCCAAC GGCGCGGCCG AGCTTTCTAC ATATCTCGAA ACCATAGGGA AATCACTCGT
GATCTTGGAT CTGAGTGGTT GTTCTCTAGG AGATAACGGA TTGATTTGGT TGTGCGAAGC
GCCTCGAGGG CTGAAAACGC TGCGTCGCCT CGAGACTCTT CGTCTTGGTT CGAACGGAAT
CGGCGACTCG GCGATGCCAG CGCTGGCCAA TCTCTTCCAA AATGGTCACT TTCCTATGCT
AAAGTGTTGC GATCTGTCCA TGAACGTCAT CTCATGGCGC GGCACGTACG ATTTTACCGA
AGCCTTCGAC GTCGCCGCCG CCGCCGCCGG CGCGTCGCCT ACTCCCCTCG AGATCCTCAG
CCTCCGCGGA AACCACATCG GCGATGACGG CATCGACGCC ATCACCGATA TTCTCCCCAA
TCTCCCTCAC TTGCACACGC TCGACGTCTC CGACTGTGAC CTCACCGCCG TCGCCGTCCG
CCGTCTCGCG GAATCGTCCT CGTCTCGCAC GTTCGCCGCG ACGCTCTCCC GCAATCCCGG
CATCGACCGA ACCGCCGTGG CCGCTCTCGC GCGCGAATTC GATGATTTAG CTTTTGCTCG
CGGACTAGAT GGATTTTAA
 
Protein sequence
MSALSSVARA RASGTTVRTR RRSRRARGTR RATNEDDADA GFDVEIAEKP THWRVVDGFV 
SDFANDLRAV YDGRTRDPLR VDPERFVWDY WHVPNQYTLL RTPAEDYFGA ERYGELEKAL
LEYGRRELGC SAITPVWMSC YVHGMRQELH ADVPHGPWAF VLSLTKNDAE DGSYGAHFRG
GETQIMRPER LNYWKNFDAG EVVERSQIME TIAPTFGRLV AFDPRLPHGV TEVFGTQDPR
HGRLVLHGWF KDPEPSFSGA LTEEDASPTL EGALGELYGR LVELPRASGM VCAALTVAPD
GAVTNLRWTC DTLTPIPLPE LPSETDIRDA IMLDIASALL ELKFPARDAD TEMTPTASDG
ADDDIVDVHA FKTPRGVVRV RTRRPKLFPE DTSTAECDET GRVAWRALPA LCAYLASDEG
YSIVVERSRV LELGAGLGTP GMLCWLSGAA RETTLTDGNA DVASDLRRSI EMNRNFVDDN
VLTSLGSATA GTLEWGRGDA DDFARKTFPL VVASDVVYSE ASARDVLDVV HTKLERDNGM
LLLAYVSRWA HVDRALYEAI VAGGWTATLI PVADFDELAR KESLEGRPCL FEITRGGCGA
RGQIRARMPK RAREWFDAEL NVLKITPADA FTERFADDLC STLETHGGAI ESLEINAKGP
FRIHQDTLRC LLDVFAKHAS RVKLIRLKLC ETWLDVDGWR ALGDFLCESD VRDLEIIGED
VDEEILHAMT NSSGNVANSW VSRLKSLKFS RCDRLNAQAI VALRQSWFPP PALARELECF
EVSFCPIGDE GIKSICDIEF VGLRELRLAN VGVSAIGGAD VASRLVARCE QLRNLDLSGN
DCFDANGAAE LSTYLETIGK SLVILDLSGC SLGDNGLIWL CEAPRGLKTL RRLETLRLGS
NGIGDSAMPA LANLFQNGHF PMLKCCDLSM NVISWRGTYD FTEAFDVAAA AAGASPTPLE
ILSLRGNHIG DDGIDAITDI LPNLPHLHTL DVSDCDLTAV AVRRLAESSS SRTFAATLSR
NPGIDRTAVA ALAREFDDLA FARGLDGF