Gene OSTLU_88208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88208 
Symbol 
ID5003656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp262185 
End bp263615 
Gene Length1431 bp 
Protein Length476 aa 
Translation table 
GC content65% 
IMG OID640419077 
Productpredicted protein 
Protein accessionXP_001419539 
Protein GI145350277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0321312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.112837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCCG CGTCCGCGCG CGTCGCGCGA TCCGCGACGC GCGGCGCGCG GTGTCGACGC 
CGACGCGCGC GCGTCGCGCG ATCGACGCGC GCCGCGAGGC CGACGGCGCC GTCGATCGCG
TTCGTCGGGT GGGAGTGGAT CGAACCGTCG TCCTCGGCGG CGGGCGCGCG GTCCGTCGCG
CTCGTCGCGG AGGCGCTGCG GCGAGGGTGG CGCGTGACGT GCGTCGCCAG CGCGGAGGCG
CGCGCGGGGC AGGAGGACGC GCTCCGCGCG ATGGGCGCCG AGACGCGCCG GATGAAGGCG
AATCGAGGGG ATGAATTGCG CGCGCTCGTC GAGGCGGCGA GACCGGATTG CGTCGTGTTC
GATAGGTTCC TCGCGGAGGA GGCGTACGCG GGGAGGCTGC GGGAGATCGC GCCGAGCGTG
GTGCGAGTGC TCGACATGCA GGACGCGCAC GCGCTGCGAA GGGCGAGGGA ACGCGCGGCG
AAGGCGGCGA AGACGCGCGG CGAAGACGCG GCGAAGGAGG TTTTGAAGAG CGCGCCGAAC
GCGAGGGACG AGGATTTGAT GCGCGAGATC GCGAGCGTGC AGAGGAGCGA TTTAACTCTG
GTGTGCTCGC CCGTGGAAAA GGAGTGGTTG ACGCGCGAGT GCGGCGTGGC GGAGAGGAAA
TTACGCTTGG CGTCCTTCTT CGTCGATTCC GTCGACGCCG AGGCGATGAA GAGGGATTTT
AGCGCGCGGA AGGATTTCGC CACCATCGGT ACGTTCATGC ATAAACCAAA CGTAGATTCT
GTAGAGTGGT TGTGCGAAGA GGTGTGGCCG TTGGTGCGCA AACAGCTCCC CAACGCGACG
ATGCGCGTGT ACGGATCTTA CGCCACCGAA AACCATCGTC GACGATTTCA CAAACCAGCG
CAAGGATTCT TGTTCGAAGG TTTCGCGGAG GACTTGGGCG AGACGTTGCG CGCGCATCGC
GTGTTGCTCG CCCCGCTTCG GTTCGGCGCC GGTATCAAAG GCAAGATTCT CGACGCGTGG
CGATACGGAT TACCCGCGTG CACGACGCCC ATCGGCTCCG AGGGATGCGT GCCGGACGTC
GTCGAGTTTT GGTCGCCCAC TTCGAGCGCG CCGATAGACC CCGAGCACGG GTGGGGCGGG
TTTGGCGATT TCACAAACCC GCAAGATATC GCCGACGCCG CCGTTCGTCT TCACGAAGAC
GAAAATTTGT GGCGTCTCGC TCGCGGAAAC GGCGCCGACC TCCTCGATCG CCTCTTCTCC
GCCCGCGTCA ACTTACCCGA CGTCTTCGAC GCCATCGAGC GCGTCATCGA CGATGTCGAT
GCCGTTCGAG ACGCGGATTA TTTCGGCCAG TGCTTATGGC GAGAAGACGT GCGTTCGACG
ACGTACTTTT CCAAATGGAT CGAAGCGAAA GAAACCGGCG CGACGAATTA G
 
Protein sequence
MRSASARVAR SATRGARCRR RRARVARSTR AARPTAPSIA FVGWEWIEPS SSAAGARSVA 
LVAEALRRGW RVTCVASAEA RAGQEDALRA MGAETRRMKA NRGDELRALV EAARPDCVVF
DRFLAEEAYA GRLREIAPSV VRVLDMQDAH ALRRARERAA KAAKTRGEDA AKEVLKSAPN
ARDEDLMREI ASVQRSDLTL VCSPVEKEWL TRECGVAERK LRLASFFVDS VDAEAMKRDF
SARKDFATIG TFMHKPNVDS VEWLCEEVWP LVRKQLPNAT MRVYGSYATE NHRRRFHKPA
QGFLFEGFAE DLGETLRAHR VLLAPLRFGA GIKGKILDAW RYGLPACTTP IGSEGCVPDV
VEFWSPTSSA PIDPEHGWGG FGDFTNPQDI ADAAVRLHED ENLWRLARGN GADLLDRLFS
ARVNLPDVFD AIERVIDDVD AVRDADYFGQ CLWREDVRST TYFSKWIEAK ETGATN