Gene OSTLU_19811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19811 
Symbol 
ID5004940 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp509495 
End bp511516 
Gene Length2022 bp 
Protein Length553 aa 
Translation table 
GC content57% 
IMG OID640420361 
Productpredicted protein 
Protein accessionXP_001421169 
Protein GI145353753 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02344] T-complex protein 1, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAC AGGGCGGTCA AGGGCAACAA ATCATGGTGA TGAACACCAA CGCCAAGCGC 
GAGACGGGTT CGATCGCGCG CGCGAACAAC ATCGCGGCGG CCAAGGCGGT GAGCGACATC
ATCCGCACGA CGCTCGGGCC GAGGTCGATG TTGAAGATGA TTTTAGACGC TTCGGGAGGT
ACGACGACGA CGACGACGAC GACGACGACG ACGACGCATA TCGCGTCTTC GCGCGAGTCC
ACGCGTCTCG CGCGCGGCGC GCGCGCGATC GATGAAACGC GGAGCGACAG AGTCAACGCG
GCGAAAAGTC AACGCGAGCG AACGAACGAA CGAACGCGGT TCGACGGTGA CGGGACGCAC
GAACGCGGCT GAACGAGGAC GCGAGCGTTG ACGCGAGCGT GATCAGCGCT CGCTGATGGT
CGAACGTCCG AGCGCGATGG CGATCGGCGT CGGATGGTGA TTGGATGTTT TGGACGGGTG
ACTGACCTCG CGCGCGTTTT GCTCAAACGC AGGCATCGTG TTGACGAATG ACGGGAATGC
GATTTTGAGA GAGATCGACG TCACGCATCC GGCGGCGAAG AGTATGATTG AACTGTCGCG
CACGCAAGAC GAAGAAACCG GGGACGGGAC GACGTCGGTG ATTATTCTGG CTGGTGAAAT
CCTTCATCTG TCGCAACAGT TTTTGGAAAA GAACATTCAC CCGACAGTCA TCGTGAGGGC
GTACATGAAG GCGTTGGACG CCGCATTGAA AGTGATTGAT TCAATTAGTT TTCCCATCGA
TGTCACGAAT CGTGATGAGA TGATGAAGAT CGTCAAGAGC TCTGTCGGCA CCAAGTTTAC
CGCGCGCATG GGCGAACTCA TTCCAAACTT GGCGCTCGAC GCGGTCATGT GCGTGGCTCG
CAAAAACGCC GACGGCACGA ACGACATAGA CATTAAAAAG TACGCCAAGG TGGAGAAGAT
TGCGGGTGGT TCTATCGATG ACTGCACGGT GCTACGAGGA GTGATGATGA ACAAAGACGT
CGTCGCGCCT GGACGAATGA AGCGACGAAT TGAAAACCCG AGAATCATGC TTCTCGACTG
CCCATTGGAG TACAAAAAGG GCGAGAACCA AACCAATGTG GAAATCACCA AGGAAGAAGA
TTGGGCGGTG TTGCTGAAGA TGGAGGAGGA TTGGATCAAA GAGACGTGCG CGAAGATCGC
GGCTTTCAAG CCCGATGTCG TCGTCACCGA AAAGGGATGC AGTGATTTGG CTTGCCACTA
CCTTTCCAAG GCGGGCATCA CCGCATTGCG ACGCGTCCGC AAGACCGATA ATAATCGCAT
CGCGCGAGCC GCGGGGGCCA CCGTCGTTTC TCGCGTGGAT GAGCTTCGTG AATCAGACAT
CGGCACCGGT GCTGGTTTGT TCAACGTCGA GAAAATCGGT GACGAATATT TCACCTTCGT
CGTCGACTGC AAAGAGCCCA AGGCGTGCAC CGTCGTCCTT CGCGGCGCGA GCAAGGATAT
TTTGAATGAA ATTGAACGCA ACTTGATTGA CGCGATGGGC GTAGCGAGGA ATGTCGTCCA
AGACCCGCGA CTGTTGCCCG GTGGCGGTGC GGTCGAAATG GCTGTCTCTC GCGCGATCGC
GGAAGAAGCG ACGAAGATCG AGGGTGTGGA GCAATGGCCG TTCCGCGCCA TCGGCGCGGC
ACTCGAGGTC ATTCCGCGAA CGCTCGCGCA AAACTGCGGC GCCAACGTCA TTCGCACTCT
CACCAAACTT CGCGCCAAAC ACGCCGAGGG CGAAGAAGCG CGAACGTTCG GTATTGATGG
CGATAAGGGC ACGATCGTAG ACATGAAAGA GCTCGGTGTG TGGGATCCGT ACGCGGTCAA
GGTGCAATCC ATCAAAACCG CCGTCGAGAG CGCCACGATG CTATTGCGCA TTGATGACAT
CGTCTCCGGG CTTTCCCAAA AGAACTCCGA CGCGGCGGGC ACGGGCACGG GCGTCAGCGC
GGGCGACGAC GACTAATCGT CGAGCGTAGT ACACTGCGGC GG
 
Protein sequence
MMQQGGQGQQ IMVMNTNAKR ETGSIARANN IAAAKAVSDI IRTTLGPRSM LKMILDASGG 
IVLTNDGNAI LREIDVTHPA AKSMIELSRT QDEETGDGTT SVIILAGEIL HLSQQFLEKN
IHPTVIVRAY MKALDAALKV IDSISFPIDV TNRDEMMKIV KSSVGTKFTA RMGELIPNLA
LDAVMCVARK NADGTNDIDI KKYAKVEKIA GGSIDDCTVL RGVMMNKDVV APGRMKRRIE
NPRIMLLDCP LEYKKGENQT NVEITKEEDW AVLLKMEEDW IKETCAKIAA FKPDVVVTEK
GCSDLACHYL SKAGITALRR VRKTDNNRIA RAAGATVVSR VDELRESDIG TGAGLFNVEK
IGDEYFTFVV DCKEPKACTV VLRGASKDIL NEIERNLIDA MGVARNVVQD PRLLPGGGAV
EMAVSRAIAE EATKIEGVEQ WPFRAIGAAL EVIPRTLAQN CGANVIRTLT KLRAKHAEGE
EARTFGIDGD KGTIVDMKEL GVWDPYAVKV QSIKTAVESA TMLLRIDDIV SGLSQKNSDA
AGTGTGVSAG DDD