Gene OSTLU_38574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38574 
Symbol 
ID5002020 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp754792 
End bp755859 
Gene Length1068 bp 
Protein Length336 aa 
Translation table 
GC content60% 
IMG OID640417441 
Productpredicted protein 
Protein accessionXP_001417862 
Protein GI145346783 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0398973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAAG ACTTTTACGA GACGCTGGGG GTGTCGCGCG CGGACGCGGA CGATCAGGAA 
AAATTGAAAA AGGCGTACAA GAAGGCGGCG CTGAAATCGC ACCCGGATCG ACCGGGAGGC
GACGCCGAAA AGTTCAAGGC GGTGGGTTTG GCGTACGATG CGCTGAGCGA CGCGAACAAG
CGGACGATAT ACGACCGATA CGGTGAGGAG GGGTTGAAGC AAGGGTTCGT GCCGCCGGAA
GCGAGGGGCG AGGCGAGCGG TGCGAGCGCG GGTGGGTTTC CGGGAGGAGG ATTTTCGGGA
AGCGCGCCCG GGAGTGGATT TCGCGCGTCG AGCGGCGGCG GCGGTTTCGG GTTCCCCGGC
GGCGGCGGGT TTCATGAATT CACCGGTGCA GACGCGGAAG ATTTGTTCGC CAGGTTTTTC
GGCGGTGGCG GCGGCGGCGG CGCGGGGTCA CCGTTTGGAG GAGGAATGGG CGACGCGTTC
GGCGCGGGCG TGGGGAGCAA ACGACGTCGT CCCGAGTGCG TGTTGAATCT CGAGTGCACG
CTCGAGGAGC TGTTTAGAGG CGGACGCCGG GACATCAACT ACGTTCGAAA CGTGCGTGCG
GGAACGAGCG GTCAGATGGC TCAAAGTAAT GAGTGCATCT CGATCGATTT CAAACCCGGT
TGGAAAACCG GCACGAAAAT TACATTTGCC GGAAAAGGGA ACGAAGACGC GCAAACCGGC
GAAGCGGCGG ATCTGGTCGT GGTGATCAAG GAAACGCCGC ACAAATTCTT ACGACGAGAT
GGAGATGACT TGGTGTACGA AGTTCCTCAA ATCTCACTTC GCAGCGCGTT GATTGGTTGG
AAGGTTGAAT TCGTCAACGT AGACGGCGAG AAGGTGCGTC TATCGTTCGA CGATCCTACG
GCTCCAGGAT CGGCGCGCGC GGTTCGAGGA AAAGGAATGC CGAATCAGAA GACCGGGCGG
AGAGGCGACC TCATCGTCAC CGTAAAAACC GTCAAGTTTC CCTCGCATCT CAACTCGAAA
CAAAAAACAT TGCTACGCGA AGCCTTCGCT CCAGGTGCCG CGGCGTGA
 
Protein sequence
MGKDFYETLG VSRADADDQE KLKKAYKKAA LKSHPDRPGG DAEKFKAVGL AYDALSDANK 
RTIYDRYGEE GLKQGFVPPE ARGEASGASA GGGGGGFGFP GGGGFHEFTG ADAEDLFARF
FGGGGGGGAG SPFGGGMGDA FGAGVGSKRR RPECVLNLEC TLEELFRGGR RDINYVRNVR
AGTSGQMAQS NECISIDFKP GWKTGTKITF AGKGNEDAQT GEAADLVVVI KETPHKFLRR
DGDDLVYEVP QISLRSALIG WKVEFVNVDG EKVRLSFDDP TAPGSARAVR GKGMPNQKTG
RRGDLIVTVK TVKFPSHLNS KQKTLLREAF APGAAA