Gene OSTLU_34049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34049 
Symbol 
ID5000580 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp507159 
End bp509088 
Gene Length1930 bp 
Protein Length542 aa 
Translation table 
GC content57% 
IMG OID640416001 
Productpredicted protein 
Protein accessionXP_001416968 
Protein GI145344912 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02340] T-complex protein 1, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.013996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAC AATCCGACGT CTTCTTCGGC GAGCGCGAGA GCGGACAGGA CGTGCGACAG 
ACGAACGGTG CGGCGCGCGG ACGACGACGA GCGACGAAGC GCGAGCGAAC GAGGGACGCG
ATCTCGAACG CGATGGGACG AGGACGACGA TCGACGATCG ACGATCGACG GGACGCGCGA
CTTCGAGCGG GGAGACGGAG ACGGAGACGG AGACGGAGAC TGACGACGAC GCGCGCGCGA
CGCGGGATGC GAACAGCGAC GGCGGTCATG GCGGTGGCGA ACATCGTGAA GACGTCGCTC
GGGCCGGTGG GACTGGACAA GGCGCGTGAC GAAGACGCGA AGGCGTTGAC TTTGAGGACG
AGACGCGGAG AATCGACGAA ACGATGCGCG CGCGCGGTGA GGGACGAGGA CTGACGGAAT
GAAAGGGTTT GACGCGTAGA TGCTCGTGGA TGATATCGGT GACGTGACGA TCACGAACGA
TGGGGCGACG ATTTTGAAAC TTTTAGAGAT TGAACACCCG GCGGCGAAGA TCTTGGTCGA
GCTCGCCGAG TTGCAGGATC AGGAAGTGGG AGATGGAACG ACTTCGGTGG TGATTCTCGC
CGCGGAGCTT TTGAAGCGGG CGAATGAGTT GGTGAGGAAT AAGATTCACC CGACGAACAT
CATCGCTGGG TTCAGGTTGG CGATGCGAGA AAGCGTCAAG TACGTCGAAG GTAAGTTGGC
GAGAGACGTG GAGACGCTCG GGAAGGAAGC CTTGTTGCAG TGCGCAAAGA CGAGCATGAG
CTCGAAAATC ATCGGTGCCG AAGAAGATTT CTTTGCGGAT TTGGTCGTCG ATGCGTGCAC
GAGCATCAAG ACGTACAACG ACATGGGCGA CGTCAGGTAT CCCATCAAGG CGATCAACAT
TTTGAAGGCG CACGGGAAGA GCTTGAAGGA GTCATCGGTG TTGCACGGAT ACGCCCTTAA
CCTCGGTCGT GCGGCGGAAG GGATGCCAAA GTTAGTCAAG AATGCAAAGA TTGCGTGCAT
CGACTTCAAC TTGCAGAAGA CAAAGATGTT GATGGGGATT CAAGTGCTGG TGAACGACCC
GAAGGAACTC GAAAAGATTC GCGAGCAAGA GTTTGAAATC ACCGCCAATC GCATCAAGAT
GATCTTAGCC GCCGGTGCCA ATGTCGTGCT CTGTTCTAAG GGCATCGATG ATATGGCGCT
CAAGTACTTC GTCGAGGCTG GGGCTATCGC CTGTCGTCGC GTCAATCGTG ATGATTTGCG
CCGCATCGCC AAGGCGACGG GGGCGCAAGT GATGCTGTCT CTGTCCGACA TGGATGGTGG
AGAAACTTTC GACGAGTCCA TGCTTGGCAC TGCGGGCGAA GTGGTGGAGC AACGCGTGGC
TGATGATGAT ATGGTCGTCA TCAAGGACTG CGCGAGCACC CAATCGTGTA CAATTCTCTT
GCGAGGCGCA AACGATTATA TGCTCGACGA GATCGACCGC TCGGTGCACG ACGCACTGTG
CATCGTGAAG AGGACGTTGG AAAGTGGCAA GGTTGTCGCT GGTGGTGGCG CCGTCGAAGC
TGCGTTGAGC ATTTATCTGG AGAATATGGC GACTACTCTG GGTAGCCGGG AGCAGCTCGC
CATCGCCGAG TTTGCCAACG CGCTCTTGGT CATCCCAAAG GTACTCTCTG TCAACGCTGC
GAAGGATTCC ACCGATCTCG TGGCCAAGCT TCGAGCTATT CATCATCAAG CACAGAGTCA
AGGTAACGAA GAGCTCGCCG GGATGGGCTT GGATCTCGTC AAGGGCGAAC TTCGCGACAA
CATCGCCAGC GGTGTCCTCG AGCCGGCGTT GAGCAAGGTG AAGAGCATCC AGTTTGCCAC
TGAAGCTGCG ATTACGATTC TCCGCATCGA CGACTTGATT CAACTCGAAC CCGAACAGGA
GGGTCAGTAG
 
Protein sequence
MRRQSDVFFG ERESGQDVRQ TNATAVMAVA NIVKTSLGPV GLDKARMLVD DIGDVTITND 
GATILKLLEI EHPAAKILVE LAELQDQEVG DGTTSVVILA AELLKRANEL VRNKIHPTNI
IAGFRLAMRE SVKYVEGKLA RDVETLGKEA LLQCAKTSMS SKIIGAEEDF FADLVVDACT
SIKTYNDMGD VRYPIKAINI LKAHGKSLKE SSVLHGYALN LGRAAEGMPK LVKNAKIACI
DFNLQKTKML MGIQVLVNDP KELEKIREQE FEITANRIKM ILAAGANVVL CSKGIDDMAL
KYFVEAGAIA CRRVNRDDLR RIAKATGAQV MLSLSDMDGG ETFDESMLGT AGEVVEQRVA
DDDMVVIKDC ASTQSCTILL RGANDYMLDE IDRSVHDALC IVKRTLESGK VVAGGGAVEA
ALSIYLENMA TTLGSREQLA IAEFANALLV IPKVLSVNAA KDSTDLVAKL RAIHHQAQSQ
GNEELAGMGL DLVKGELRDN IASGVLEPAL SKVKSIQFAT EAAITILRID DLIQLEPEQE
GQ