Gene OSTLU_31950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31950 
Symbol 
ID5002597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp16303 
End bp19902 
Gene Length3600 bp 
Protein Length1199 aa 
Translation table 
GC content55% 
IMG OID640418018 
Productpredicted protein 
Protein accessionXP_001418349 
Protein GI145347799 
COG category[O] Posttranslational modification, protein turnover, chaperones
[Z] Cytoskeleton 
COG ID[COG5234] Beta-tubulin folding cofactor D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCA TCGTTCGCGC GTTGACTCGG CGAAGCGCGC GTGGACGCGA CGAAGAAGAC 
GACGCCGCGG ACGAAGACGA CGTGCGGGCG TTCGTGGGAG TGATTGAAAA GTATAGAGAA
CAGCCGACGG TGCTGGATCC GATGCTGGGG GGCGTGATCG AGCCTTTGAT GGACGCGGTG
GCGCGAGCGT CGACGGAGGC GAACGAAAAC GAAAACGAGA ACGCGAACGC GAAGGCGAAC
GCGAACGCGT GCTGCAGGGC GCTGGATGCG CTGTCGAGCG TGCGCGGATG GAAGACGTGC
GTGCGGTTCT ATCCGAACGC GGCGAAGTAT TTAGAGCCGG CGGTGCGTCT GCTGCGGGAA
GCGCGAGTGC GTGGAGACAA TACGTGGGAA ACGCAACGCG TGCTCACGAG TTGGTTGTCG
ATTTTGGCGT TGGCGCCGTT TGACTTGGTG TCGATCGATA GCGCGATCGA TCCGCATTCG
TCGCGTTCGA AAATTCCGAG CGTCGTGAGT GATTTGATGC GAGAGTGCAA GCACTTTTTA
GGCGATCCGT CCGCGGTGCG CGACGTGGCG GCGCAAACGC TCGCCAAGCT TCTCACGCGA
CCGGACATGA GCGAGGCGTT GCGCGAGTTC ATGACCTGGT CGAGCGCGAC GCTTCGTGGA
GACGTCAATG ATGAAAAGGA AAGAGAAATG ATTTTCTTAG TACCGGGTGT ACTGCGGGCG
CTCGCGGCGA TATACAAGAT TGGATCTCGA GAGCAGTTGC TGCCGTACGC GGAGGGAAAC
TGGGACGACG CGCAGTATTG CGCGACGCGG CTAAGTTTGG CGAAACGCTC GACCATGGTG
AGACAACTGA GTATCAAGCT CGCCAGTCGC GTGGGTTTGG TTTTTATGAA ACCTCGAGTG
GTGTCGTGGC GATATGATCG TGGTGCGCGG TGTTTGCAAG ATAACTTGAG CGGGGCGATG
CAAAAGCCGC CGACGAAGCA ACTCACCACG GCGGCGGACG AAGATGATAA ATGCGACGTG
CACATGGCTG TTGATGATAT TGTTGAAATA TGTCTCGTCG GTTTGCGAGA TGCGGAGACT
ATCGTACGCT GGACATCGGC GAAAGCGTTG GGGAGAATTA GCTCTCGACT CCCGCGTGAT
TTTGGTGACG AAGTCGTTGG AGCGGTTTTG GCGTGCTTAT CGGTTATCGA AAGCGATTCA
ACTTGGCACG GCGCATGTTT AGCTCTGGCT GAGCTCGCTC GACGTGGATT GTTGTTGCCG
AATAGATTGG TGGAGGCCGT ACCGCGATGC ATGGACGCTC TCATCTACGA CGTTCGACGA
GGAGCGCACT CAATCGGTGC GCACGTGCGA GATGCGGCAG CATATGTATG TTGGGCGTTT
GCGCGCGCAT ATGAACCGGG CGTTTTCGAA CCTTTTGTCG ACCAACTTGC ACCGAGGCTT
CTCATGATAT CGTGTTTCGA TCGTGAAGTT AATTGTCGCC GAGCCGCATC CGCTGCCTTT
CAAGAAGCCG TCGGACGGCT CGGCAAGTTT CCTCACGGCA TCGACATTGT CACCGTGGCG
GATTACTTTT CGCTTGGATC GCGAACCCGA GCTGCGTTGA CGGTGGCACC ATTCATCTGT
CAGTTTGAAG AATATAGGCG TTCGTTACTC GAGCACGTGT TGGACACGAA GCTCACGCAC
TGGGAACTTG CCACGCGGCA ACTCGCGACG AAAACAATCA GAGCTCTGGG TAATTTAGAC
CCGCAGTGGA TCGGTGACGT AGGCATAAAA ACAGTTCTGT CGCGCGCGAC GAGTTCTGAC
TTGTCGACGC GCCACGGCGC TGTGCTTTCG ATCGGTGAGA TGTTACTCGT GACACAGCGT
GCAAAAACAA AACTCGAAGA CGACTGTTTC GAACGAGTAG CCGATTTAGT TCAAAGTATG
GAAAGGGAGA AGATGTACAA AGGAAAAGGC GGCGAAATAA TGCGTGGCGC GACGTGTAGG
CTCATTGAGT GCGTGTTTTT GTGCTGCGAC GAGAATCACA AGATTGACTC GAAGGCGACA
GATGCCTTTG TCTACTTTGC GGAAGAGAGT CTGCAGTGCT GCAACGGAGA CGTACAGGCT
GCCGCCTCAG ACGCCATCGC CGCATTTACA GAGACGAATT ACGCTTCGCG TGGATCTCAC
CGTGCTCATT GCTTGCTATT GAGGCACGCT GAAATCGTCG TCAACGATCT CGTAGGTGTC
GTGCGGCGTG GATCGGCGCT CGTATTAGGC GGATTTCCTG TGACAAGTCT TCTCGCCGCG
AAAAATAGTG AAGATAAAAG TGCAACTCTG CGCGCGGTCA TCACGGCGCT GTCGGTAGCC
ACGAAACCGG AAGAAGATGT AGAAATGAGA GACGCCGAGA CGCGGGTGAA CGCAACGATC
AGTCTCTCGG AGCTGAGTGT GAAATTAATG TGTGCAGAGT GCCATGATAT CGATGACGAC
GACATCGCCT TTGTATCGGA CACTGCGATC GCCACTTTAC TCGGATGTTT GTGCGACTAC
AGCGTTGACA ACCGTGGCGA CGTCGGCTCG TGGGTGCGAG AATCGGCGAT GAAATGCTTC
CCTGTGCTTG TCGCCGCTTT GCAAATGCGC AATGCTTTAG CTGCAGATCA GTCGCAAAAC
ATCATGACAG CACTTCTCAA GCAAGCATTC GAAAAAATCG ACCGCATTCG ATGTCAAGCG
CTTGTGACGC TCGTGCAGCT CGTGCGTGGT GGTGACGCGA TTCGAGTTAG AATGCGAGTG
CAGGCCAAAC TAACAGTACA CGCGCTCTCT GGTGTGCCAG ACTACGATGT TTTACAATGT
TGCTTGCCAG CGACTGTCGA AACAGCGCCA GATGCTTCCC ATGTCTCGAC AATTTTCGCG
ACGTTAACTC CCGTTCTCGG CGCAGAGGCG TACGTCAACG CCGCGTTGAG CGGATGGTTT
CTGAGTTGCG GAAGTGTAGG CGACAGTCTG GTGCGCTTTT CGACCGATGC GTTGTTACGA
GCGATTAGGC GGTTTGAGGG CCTTCCAGAT ATTGTTGTGG CATCGATTAT ACAAGATCTG
TGTCAAAACA AGCACGTTGA TCGCGTTACG GTACCGGCTT TACGAGTGTG CGATGCCCTA
ATTTCGCACG GCGCGTTGGA TCAGGCGCAC ACGCACGCGA TTCAACTCAT CGAAGCTATT
CGTTGTGAGT GTTTCTCGAG CAGAGATATT TCAAAGCTCG TCACTGGGAG CGCATGCCTG
GCTCACTTCG TCGGTGCTGC TGATAGCGTC GTTCACGAAT CAGCATCGAT GGGGCTACTC
GCACTCATGG CGAACCGCTT CCCTCGCGTG CGTTGCGCAG CGGCGGAGCA TTTGTACATT
GCCCTCCTTG CTGTCGCCGA ACCGAGTCGG GGAACTGAAA ACGCAGCTGA AACCCTGTCG
TTGAATTCGT GGGACGCACC ACCGAGTGTC ATGAAAGAAA CACGCAAAAT AATTTATAGT
TTACTAGGAC TAGACCTCCC AGCTTTCATG CTGAAAGCTT CGGGAAAACT TCGGGACCGA
CGAGCGGATG AAAGAGAGAA CTCGACGTAT GCGTCGCTCG TTGGAGATAC CGGTTATTAG
 
Protein sequence
MDAIVRALTR RSARGRDEED DAADEDDVRA FVGVIEKYRE QPTVLDPMLG GVIEPLMDAV 
ARASTEANEN ENENANAKAN ANACCRALDA LSSVRGWKTC VRFYPNAAKY LEPAVRLLRE
ARVRGDNTWE TQRVLTSWLS ILALAPFDLV SIDSAIDPHS SRSKIPSVVS DLMRECKHFL
GDPSAVRDVA AQTLAKLLTR PDMSEALREF MTWSSATLRG DVNDEKEREM IFLVPGVLRA
LAAIYKIGSR EQLLPYAEGN WDDAQYCATR LSLAKRSTMV RQLSIKLASR VGLVFMKPRV
VSWRYDRGAR CLQDNLSGAM QKPPTKQLTT AADEDDKCDV HMAVDDIVEI CLVGLRDAET
IVRWTSAKAL GRISSRLPRD FGDEVVGAVL ACLSVIESDS TWHGACLALA ELARRGLLLP
NRLVEAVPRC MDALIYDVRR GAHSIGAHVR DAAAYVCWAF ARAYEPGVFE PFVDQLAPRL
LMISCFDREV NCRRAASAAF QEAVGRLGKF PHGIDIVTVA DYFSLGSRTR AALTVAPFIC
QFEEYRRSLL EHVLDTKLTH WELATRQLAT KTIRALGNLD PQWIGDVGIK TVLSRATSSD
LSTRHGAVLS IGEMLLVTQR AKTKLEDDCF ERVADLVQSM EREKMYKGKG GEIMRGATCR
LIECVFLCCD ENHKIDSKAT DAFVYFAEES LQCCNGDVQA AASDAIAAFT ETNYASRGSH
RAHCLLLRHA EIVVNDLVGV VRRGSALVLG GFPVTSLLAA KNSEDKSATL RAVITALSVA
TKPEEDVEMR DAETRVNATI SLSELSVKLM CAECHDIDDD DIAFVSDTAI ATLLGCLCDY
SVDNRGDVGS WVRESAMKCF PVLVAALQMR NALAADQSQN IMTALLKQAF EKIDRIRCQA
LVTLVQLVRG GDAIRVRMRV QAKLTVHALS GVPDYDVLQC CLPATVETAP DASHVSTIFA
TLTPVLGAEA YVNAALSGWF LSCGSVGDSL VRFSTDALLR AIRRFEGLPD IVVASIIQDL
CQNKHVDRVT VPALRVCDAL ISHGALDQAH THAIQLIEAI RCECFSSRDI SKLVTGSACL
AHFVGAADSV VHESASMGLL ALMANRFPRV RCAAAEHLYI ALLAVAEPSR GTENAAETLS
LNSWDAPPSV MKETRKIIYS LLGLDLPAFM LKASGKLRDR RADERENSTY ASLVGDTGY