Gene OSTLU_19801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19801 
Symbol 
ID5005192 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp361119 
End bp362950 
Gene Length1832 bp 
Protein Length534 aa 
Translation table 
GC content56% 
IMG OID640420613 
Productpredicted protein 
Protein accessionXP_001421126 
Protein GI145353663 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02347] T-complex protein 1, zeta subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.289192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0151262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCA AGTCCATCAA CCCCAACGCC GAGCTCATGA ACGCCGATGC CGCGCTGTTC 
ATGAACATCA ACGCCGCCAA GGGTCTCCAA GACGTGATGA AGACGAATTT AGGCCCGAAA
GGGACGATCA AGATGCTCGT CGACGGTGCT GGAGGTGCGT GTACGATCGT TTCGTCGATA
CATCGCGGTT TCGCGCGTCA CGGCGGCGCG TGAGAGATGC GCGGTGACGG TGATCTGTTG
GATTTCGCGC GAACGCGGCG CGGCGTTGAC TCTGTTGACC GAACGACTGA CGGCGATGTT
GAAATCGATG TGCGTAGGTT TGAAGCTCAC GAAGGATGGG AACGTGCTGC TGAGAGAGAT
GCAGATTCAA AACCCGACGG CGATCATGAT CGCGAGGACG GCGGTGGCGC AGGATGATAT
CACCGGGGAC GGGACGACGA CGACGGTGCT CGTCATCGGA GAGTTGTTGA AGCAAGCGGA
GCGGTATTTG AATGAGGGTT TGCATCCGAG AGTGATCGTG GAGGGATTCG ATGTGGCGAA
ACGCGAGTCG CTGAAGTTTT TGGAAACGTT CAAGCGCGCG GCGCCGGGTG TGGAAGCGCC
GGATCGAGAA ATGTTGTTGT GCGTGGCGCG CACGGCGTTG CGAACAAAGT TGCGGGAAGA
GTTGGCTGAT AAGTTGACGA CGATCGTCGT CGACGCTGTG TTGTGCATCG CCAAACCGGA
AGAGTCGATC GATTTGCACA TGGTGGAGAT CATGACCATG AAGCATCAGA CGGATGACGA
AACCAAATTG ATTCAGGGTT TGGTCCTCGA CCACGGGGCG AGACATCCGG ATATGAAGAG
ATACGTGGAG GACGCGTTCG TGTTGACGTG TAACATCAGC CTGGAATACG AACGATCGGA
GGTGAACTCG ACGTTCATGT ACACCGATGC GGAACAGCGC GAGAAGATGG TCGCCGCCGA
GCGCGCGTAC ACCGATGAAA CCGTGCGCAA GGTGATCGCG CTCAAGAAGC AGGTGTGCGA
CGGCAACGAC AAGGGTTTCG TCGTCATCAC ACAAAAGGGG ATCGACCCAA TCTCTCTGGA
CATGCTCTGT AAGGAAGGCA TTATGGGACT TCGTCGCGCC AAGCGCCGAA ACATGGAGCG
TCTCGTCCTC GCATGTGGTG GTCAATGCAT CAACTCTGTT GAGGAACTCT CGCCGGAAAT
TTTGGGCCAC GCCGGCGAAG TGTACGAGTA CGTCTTGGGC GAGGAAAAGT ACACTTTTGT
GGAAAAAGTC GTCAACCCGA CGTCGTGCAC GGTACTTTTA AAAGGCTCGA ACGATCACAC
CATCGCACAG CTCAAGGATG CGGTGCGAGA TGGCTTGCGG GCGGTGAAGA ATGTGTTGAC
CGACAAGGCC GTGGTTCCGG GCGCGGGTGC GTTTGAGATG GCCTTGAACA AGCACTTGAA
GGAGAACGTC ACCAAGATGG TCGAAGGTCG CGCGAAACGC GGCGTCGAGG CGTTTGCGGA
AGCCATGCTC GTGGTCCCGA AGACACTCGC GGAGAACAGC GGTTACGATC CGCAAGACGC
CATCATCGAC ATGCAAGAGG AGCACGACAG AGGCAACGTC GTCGGTTTCG ATATCAGCAT
CGGCGAGCCC TTCGACCCCA CCATGAGTGG TATCTACGAC AACTTTCTCG TCAAACAGCA
AATTCTGCAC TCTGCGCCCA TCATCGCCAC GCAGTTACTC TGCACGGATG AGGTTCTCAG
AGCGGGCGTG AACATGCGCA AGCGATGATC GACGCGAGTC AGAAACTTAT AGAATTTAGT
GGTGGTGCGT ACGCACGTAC GTAGTAGCAG CG
 
Protein sequence
MSLKSINPNA ELMNADAALF MNINAAKGLQ DVMKTNLGPK GTIKMLVDGA GGLKLTKDGN 
VLLREMQIQN PTAIMIARTA VAQDDITGDG TTTTVLVIGE LLKQAERYLN EGLHPRVIVE
GFDVAKRESL KFLETFKRAA PGVEAPDREM LLCVARTALR TKLREELADK LTTIVVDAVL
CIAKPEESID LHMVEIMTMK HQTDDETKLI QGLVLDHGAR HPDMKRYVED AFVLTCNISL
EYERSEVNST FMYTDAEQRE KMVAAERAYT DETVRKVIAL KKQVCDGNDK GFVVITQKGI
DPISLDMLCK EGIMGLRRAK RRNMERLVLA CGGQCINSVE ELSPEILGHA GEVYEYVLGE
EKYTFVEKVV NPTSCTVLLK GSNDHTIAQL KDAVRDGLRA VKNVLTDKAV VPGAGAFEMA
LNKHLKENVT KMVEGRAKRG VEAFAEAMLV VPKTLAENSG YDPQDAIIDM QEEHDRGNVV
GFDISIGEPF DPTMSGIYDN FLVKQQILHS APIIATQLLC TDEVLRAGVN MRKR