Gene OSTLU_2484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2484 
Symbol 
ID5001810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp437438 
End bp438766 
Gene Length1329 bp 
Protein Length443 aa 
Translation table 
GC content53% 
IMG OID640417231 
Productpredicted protein 
Protein accessionXP_001417770 
Protein GI145346592 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID[TIGR01126] protein disulfide-isomerase domain
[TIGR01130] protein disulfide isomerases, eukaryotic 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.632802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAAGCGCCGA CGGATGATCA CGTGTTGAAG CTCGATGCGA GCATCTTCGA CAACGAGTTG 
AAAAAGTCGA AATATAACTT CGTGATGTTT TACGCGCCGT GGGATGGGCA CTCAAAGGCG
TTCATGCCGC GTTGGATGTC TTACGCGCAG TCGCATAAAA TGGCGGGCAC GGAGATGACG
TTTTCGCTCG TGGACGCGAC CAAGGAACGC GATTTGGATA AGCGATTCGA AATCGAGGAA
TACCCGACGC TCATATTGTT CCGTGATGGT GTGCCGAAGA GGTACGTGGG CGATCGATCG
CCGCAACACT TGGATAAGTT TGTTCGAAGA AACTTGCTCA AGCCGGCGCG TTGGCTGGAA
GGCACGGACG ACGTCGAAGT TTTCTTGATG GGTCGCGACG TCACCGTTAT CGGGTTCTTC
GATAACAAGG ATGATTTGGA CGTGTACCAC CACGCCGCGG CTGAGTTTGA TCTCGACTTT
GGCGAGACGA AGAGCAAAAT CGCCACTGAA GACTGGAAAG CGCCGTTCCC GACCATCAAG
ATGTGGCGCG ACTTTGACAA AGAACCCGTT AGGTACCCCG GCGACGTGCG CGATTTGGAT
GCTATCAAGT CCTGGATCGC CACTGAAATG GTCCCACCGA TCGTGAAGTT CGAAAACAAG
AAGCAACTCG AGCGCCTTTT CATGGGTCCG ATCGCTGCGA ACATCTTCGT ATTCTTACCC
GAAGACGCGA CCGAAGCCGA GAAGATGTCG AAATCTTTAG AAAGTGCGGC CAGACAACTT
CGTGGTAAGG TGCACATCAT CACCGTCGAT GCCAAAGAAA CTGTCATGCA TGACTACTTC
TCTCTCCGCG AGAGCGACGG GCCGACGATT CGCCTTCTCT CGCATGACTT GAAGTATCAA
TACAAGGGCT CATTGGAGGC CGCCGAGATC TCAAACGATG TCGTGCACTT TTTCAAGGAA
TTCGAGGCGA AAAAGCTCGT GCCGTTGCTC AAGTCGCAAG ATCCGCTCCC CAAGGACGGT
GACGTTCTGC AAGTTGTCGG TAAGACGTTC CAGTCGTTGC TCATGGATAA CGACAAGCAC
GTCTTTGTTT GGTTCTACGC GCCGTGGTGC CGCACGTGCA AGGCGATGAA GCCGGTGTGG
GATAAGCTCG CCACGCTTTA CAAGGATGAG AAAGACATCA TCATCGCCAA GATGGATGCG
ACGAAGAACG AGGCGAAGGA TTTGCACGTT CGACACTATC CGACCGTGTA CTACTATCAT
TCCGGTGATA AGCCCAGACA CGAGGAATAC GACGGACACA TGGAAACGGA TGCGTTCACC
GATTTCCTC
 
Protein sequence
EAPTDDHVLK LDASIFDNEL KKSKYNFVMF YAPWDGHSKA FMPRWMSYAQ SHKMAGTEMT 
FSLVDATKER DLDKRFEIEE YPTLILFRDG VPKRYVGDRS PQHLDKFVRR NLLKPARWLE
GTDDVEVFLM GRDVTVIGFF DNKDDLDVYH HAAAEFDLDF GETKSKIATE DWKAPFPTIK
MWRDFDKEPV RYPGDVRDLD AIKSWIATEM VPPIVKFENK KQLERLFMGP IAANIFVFLP
EDATEAEKMS KSLESAARQL RGKVHIITVD AKETVMHDYF SLRESDGPTI RLLSHDLKYQ
YKGSLEAAEI SNDVVHFFKE FEAKKLVPLL KSQDPLPKDG DVLQVVGKTF QSLLMDNDKH
VFVWFYAPWC RTCKAMKPVW DKLATLYKDE KDIIIAKMDA TKNEAKDLHV RHYPTVYYYH
SGDKPRHEEY DGHMETDAFT DFL