Gene OSTLU_49043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49043 
Symbol 
ID5000865 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp255025 
End bp257014 
Gene Length1990 bp 
Protein Length536 aa 
Translation table 
GC content55% 
IMG OID640416286 
Productpredicted protein 
Protein accessionXP_001416598 
Protein GI145344145 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02343] T-complex protein 1, epsilon subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0141345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCGGCCCG CGACGGAACC ATGTCGCTGG CGTTCGATGA GTTCGGGCGA CCGTTCATCA 
TCATCAAGGT GCGAGGGCGA CGAACGAGCC GTCGCGGTAT CCGCGCGCGC GGCGGACGAA
CGAACGGACG CGCGCGCGCG ACGTGCGCGC GCGATGGTGA AGGACTCGCG CGATGCGATC
GGGACACTGA CGCTGGAACG CGTTTGTCGT TTACGCGATG CAGGAACAAG GAGCGAAAAC
GCGCGTGCGC GGGATCGATG CTCAGAAAGC GAACATCGCG GCGGCGAAGA GCGTCGCTCG
AACGCTTCGG TCTTCGCTCG GGCCAAAGGT GAGATGAAGA CGCGGCGACG CGCGCGACGT
TTGACTTTTT CGTAAAGCGA AGCGAGACGC GATTCATTCG CGCGGCGACT GACGAGAGGC
GACGCGATGT CGCGAAATAT CGTAGGGTAT GGATAAGATT TTACAGTCTG GCGACGGCGA
CATCACGATA AGTGCGTGTG CGATTTCGAA ACTCGAGGCG AAGCGCTCGC GAGGTTTGAC
TTTCGAACGC GATGACTGAC GGTCATTTTA TCTCGCAGCA AATGATGGAG CGACTATTTT
GGATCAAATG GAAGTCGAAC ATGAGATCGG TAAGCTCATG GTTGAGCTGT CCAAGTCGCA
GGACTACGAG ATCGGTGATG GGACGACCGG CGTGGTGGTT TTAGCGGGCG CGCTGTTGGA
GCAAGCCGAG TCACTTCTTG ACCGCGGTAT CCATCCCCTT CGAATCGCGG AAGGCTACGA
GATGGCGTCC AAGGTGGCGA CAAAGGAGCT GGCAAGAATC AGCGAAAAGT TCGAATTCGA
TGCGGAGAAT ATCGAACCGT TAATTCAAAC GTGTATGACG ACGCTGAGTA GTAAGATTGT
GAACCGGTGC AAGCGCGAAA TGGCAGAGAT TTGCGTGAAG GCTGTGATGG CGGTGGCCGA
CTTAGAACGT AAGGATGTGA ACTTAGACTT GATTAAGGTT GAAGGCAAGG TGGGCGGCAA
GCTAGAAGAT ACCATGTTGG TGAACGGTAT CGTGTTGGAC AAGGACATCA GCCACCCGCA
GATGGCGAAG GAGATTAAGG ACGCAAAGAT CGCCATCTTG ACTTGCCCTT TTGAGCCGCC
AAAGCCGAAG ACGAAGCACA AGATTGAAAT CGACACGGCG GAAAAGTACG AGGAGCTCCG
TCAGCAAGAA GAAAAGTACT TTAACGACAT GGTGAAGCAA TGCAAGGACT GTGGCGCGAC
GCTCGTCATT TGCCAGTGGG GTTTTGATGA CGAGGCAAAT TCGATGCTCA TGCAGCAAAA
GCTTCCCGCC ATTCGCTGGG TCGGTGGTGT CGAGCTTGAG CTTTTGGCTA TCGCCACTGG
CGGTCGTATC GTACCGCGAT TCACGGAGTT AACGCCAGAG AAACTCGGTT CAGCGGGCAT
GGTGAAGGAG GTTTCGTTTG GAACGACTAA GGAACGCATG GTAATCATTG AAGACTGCGC
CGCGAGCAAA GCGGTCACAG TCTTCGTGCG CGGCGGCAAC AAAATGATGG TTGATGAAAC
GAAGCGTTCT CTGCATGACG CGATCTGTGT TGCTCGCAAC TTGGTTCGAT CGAACAACAT
TGTGTACGGC GGCGGTTCTG CTGAAGTTGC GTGCGCAATC GCCGTCGAGG AGGAAGCCGA
CAAGATTCCA AGCGTCGAGC AATACGCCAT GCGCGCCTTC GCGGACGCTC TTGACGCTGT
TCCGAACGCT TTGGCCGAAA ACAGCGGCCT TCCTCCGATC GAGAGCGTGG CCACGATCAA
GGCGCAGCAA TTGAAGGACA AGAATCCGTT CCTCGGCGTC GACTGCAAAG AAATTGGCAC
CAACGACATG AAGTCTCAGG GCGTGTTCGA GACTCTCATC GGCAAGCAGC AACAAATTTT
GCTCGCCACG CAAGTCGTCA AGCTCATTCT CAAGATTGAC GACGTAATCT TAGCGGGAGA
AGGGCAGTAG
 
Protein sequence
MSLAFDEFGR PFIIIKEQGA KTRVRGIDAQ KANIAAAKSV ARTLRSSLGP KGMDKILQSG 
DGDITITNDG ATILDQMEVE HEIGKLMVEL SKSQDYEIGD GTTGVVVLAG ALLEQAESLL
DRGIHPLRIA EGYEMASKVA TKELARISEK FEFDAENIEP LIQTCMTTLS SKIVNRCKRE
MAEICVKAVM AVADLERKDV NLDLIKVEGK VGGKLEDTML VNGIVLDKDI SHPQMAKEIK
DAKIAILTCP FEPPKPKTKH KIEIDTAEKY EELRQQEEKY FNDMVKQCKD CGATLVICQW
GFDDEANSML MQQKLPAIRW VGGVELELLA IATGGRIVPR FTELTPEKLG SAGMVKEVSF
GTTKERMVII EDCAASKAVT VFVRGGNKMM VDETKRSLHD AICVARNLVR SNNIVYGGGS
AEVACAIAVE EEADKIPSVE QYAMRAFADA LDAVPNALAE NSGLPPIESV ATIKAQQLKD
KNPFLGVDCK EIGTNDMKSQ GVFETLIGKQ QQILLATQVV KLILKIDDVI LAGEGQ