Gene OSTLU_93816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93816 
Symbol 
ID5005793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp394048 
End bp396090 
Gene Length2043 bp 
Protein Length680 aa 
Translation table 
GC content61% 
IMG OID640421214 
Productpredicted protein 
Protein accessionXP_001421648 
Protein GI145354767 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.145875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACGG CGATCGGGGC GGGCGGGAGG ACGTGGGCGC GGGCGACGAG CGCGCTGTGT 
CGAACGGCGG CGCGGTTGGA CGCGCGCGAT ACGGAGGATG GGAAACGGTA CCTCGACGAA
TCGTTGCGGC TGCTCGAGAC GAGGGAGAGC GAGCGCGGGA GTGGGGACGC GGACGCGCGC
GCGAGAGCGG CGATGTGCGC GCGATTGAGC GGCGCGCGGG TGTTGGGATC GATGGGCTCG
GCGGCGAAGG AGAGCGCGAG AGGCGCGTTC GACGAGACGG GCGCGCTTCG AGGCGAGTTC
TTCGCGCGCG CCGTTCCCGC GCTGTTGGAA CATTCGTATA AAGATAAAGC GGGAAGCGTG
CGGCGGGCGT GCGCGGAGGC GCTCGGGCGC GTGGGAGGGT TGAAGACGAC GGACGTTCGC
CAGCTCGGAC GCGCGTTGGG CGATAAGCGC GACGACGTGC GAACGGCGGC GACGCGGTCA
GTCGGCGCGC AGGGTGGAGT GGCGAAGTTT TTAGTGCCCA AACTCGTCGA GAACGCGAGG
CGCGATGATC CAGAGTTGCG TCAAACCACG TGCGAGGCGC TGGAGAGCGT GTGGAAAGGA
CGCCAAGGCG ATCCCGCGCA AGACGGTTCG GAGGACGTTC TTTTGTTTCA AACCGCGGAG
TGTATGGGCG AACTCACGAA TGATGCAGAC GCTGGCGTGC GCGCGGCGGC GACGAGGTGC
TTAGGTCACC TTCGAGGCGC CGCGGCGGCG AAAGTGAGCT TGATTAACAA GCGCGTAGAC
GATAGCGATG AAGGCGTGCG AGACGCCGCG GCGGAGACTT TAATCAAGCT CGGATTCATC
AATCCGAGCA CGGGCACGCT ACGAAACAAG GGCAACTACC GCACGTCAAA GTTTTCGGCG
AATAAAAACT TATTCGCCGT CCTGACTGGC AAGCAAGATT CCCTCGCGAC GACGTCGCGC
GTCAACGTTC CCGCGGGAGC CATCGTGCGC GTGTGGTGGC CTCTGGATGA GTGCTACTAC
GAAGCAAAGG TCAAGGGATA CGACAAGTCA ACGCGCAAGT ACAGGCTTCT ATACCTAGAC
GACAACGTAG AAGAAGAGGT GAACTTTAGG AAGGAAAAGG TTGACTTGAA GCACAAGCCG
ACGAAGAACG CCAGGGCGAC GTGGATCCCG TGCGGCGCGG TGCAGCGCAA GCCAAAGTCG
AAGGACAAGG ATCAACCAAA ACACAAAATA GCGCGAAAGC GATCGTCGGC GACGAGCGCG
TGGGTGGATT ACGACGCCGA AGAGCAAAAA CGGATCGGTC GCGAAGCGCT CGTCGGTCGT
CGAATGAAAG TTTGGTGGCC AGACGACAAG GCGTGGTACG CCGGCGAGAT TCGTCGCTTC
AGCGCCGACA CGGGCAAGTA TACGGTGTTT TACTTCGAGG ACGGCGAAGA GGAAGATTTA
GATTTCGACA AAGAAGAGCG CGCGCCAAAG ATTTACGAAC CTGTTCAAGA CGAATCCGCG
ACGCCTACGA TTCACGGTTT GGAACCGCGG CGACTGCCCG TTCGCTCTGG CGAAAAGGAC
GCGCTGTACG AACCGGGATG TTTCCGCGGG AGCGAGTGCT GCAGCTTCGT GAACGAAGAG
GGCGAGAAAA CAACAATGTT GTGCTCGGAC TTTGACAAAA TGTACGGCGG CGGAAGTCGC
TGGCGACGTT CCATCGTCGT GACTTCCGAA ACCGGCGCCG AGCCGATTGA CACCTTCTTC
CGCGTCAACG GCGAGCGCTG GGGGAACGCC GTCTTAGGTT ATGAGTTCGA TATGGACGTC
GCCGTCGACA TGCGAGACTT TCCCGTCGAT CGACAGCCCG TCGAACCGTC TTGGCAACGC
GTGAAAATCA TCTCGTACAA CCCGACGAGC GGCGAGCACC AATGCGTCGA CATCAAGGAC
GACGGCGAGC CCGATCCCAG TCGCGCCGTG TGGCTTCCGC TCTGCATGCA ACGCACGCGC
GCGCGCGCCG CCGCCGACGA CGACGACGGT GAAGATTCGA TTTCTGACAT CATTAGTTCA
TAG
 
Protein sequence
METAIGAGGR TWARATSALC RTAARLDARD TEDGKRYLDE SLRLLETRES ERGSGDADAR 
ARAAMCARLS GARVLGSMGS AAKESARGAF DETGALRGEF FARAVPALLE HSYKDKAGSV
RRACAEALGR VGGLKTTDVR QLGRALGDKR DDVRTAATRS VGAQGGVAKF LVPKLVENAR
RDDPELRQTT CEALESVWKG RQGDPAQDGS EDVLLFQTAE CMGELTNDAD AGVRAAATRC
LGHLRGAAAA KVSLINKRVD DSDEGVRDAA AETLIKLGFI NPSTGTLRNK GNYRTSKFSA
NKNLFAVLTG KQDSLATTSR VNVPAGAIVR VWWPLDECYY EAKVKGYDKS TRKYRLLYLD
DNVEEEVNFR KEKVDLKHKP TKNARATWIP CGAVQRKPKS KDKDQPKHKI ARKRSSATSA
WVDYDAEEQK RIGREALVGR RMKVWWPDDK AWYAGEIRRF SADTGKYTVF YFEDGEEEDL
DFDKEERAPK IYEPVQDESA TPTIHGLEPR RLPVRSGEKD ALYEPGCFRG SECCSFVNEE
GEKTTMLCSD FDKMYGGGSR WRRSIVVTSE TGAEPIDTFF RVNGERWGNA VLGYEFDMDV
AVDMRDFPVD RQPVEPSWQR VKIISYNPTS GEHQCVDIKD DGEPDPSRAV WLPLCMQRTR
ARAAADDDDG EDSISDIISS