Gene OSTLU_42712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42712 
Symbol 
ID5003212 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp605067 
End bp606689 
Gene Length1623 bp 
Protein Length513 aa 
Translation table 
GC content61% 
IMG OID640418633 
Productpredicted protein 
Protein accessionXP_001419434 
Protein GI145350046 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.732004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.929724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCC CGACGGCGGT GCGCGGCGGT GGCGCCTTCG GCGCCCCGCA CGGGGCCCCG 
TTCGGCGGCG CGCACGCCCC CACGAGGCTC GGACGCGCGT TTCGAAGCGT AGACTTTTAT
CGCAAACTCC CGCGCGACAT GACGGAGGGC ACGGTGAGCG GGAGCGTGAT ATCCATCTTC
GCCGCGGTGT TGATGACGTT TCTGTTGCTC AGCGAACTGC GGAGTTACTC GTCGAGCTCG
TTCGACACCA AGGTGGTGGT GGATCGGAGC GTGGATGGGG AATTGCTGCG AATCAACTTT
AATCTTTCGT TTCCCGCGCT GTCGTGCGAG TTCGCGAGCG TGGACGTCGG CGACGCCCTG
GGATTGAATC GGTTCAATCT CACGAAGACG GTGTTTAAGC GGGCGATCGA CGCGGACATG
CGAGCGATCG GGCCGCTGCA GTGGGACCGA GCGGTGGACG AGGTGCTCAA GGCGAGCGAC
GAGGAGACGA CGCGCGCCGA GGAGAGGGTG GCGCGGCACA AGGAGGCGCT CAAGGTGCTG
CAGGAGTCGA ATAGGCCCGC GGATGGGCAC GCGCACGTGG TGTACGAGAT CGCGGATCTG
GACGAACTGC AAGCGATGGT GAAGGATCCG ACGCACGCGG TGGTGTTGGT GAATTTCTAC
GCGCCGTGGT GTCCGTGGTG TCAGAGGCTC GAGCCCGTGT ACGAAGCCGC GGGGCTCGCG
GTGCACGAAA AGTACCCGCC CGGGACGAAG TCGCGCGTGC TGTTCACGAA GATTGATTGC
GTGGTGCACG AAAAGTTTTG CATGGCGCAA GTCGTGACGG GATACCCCAC GATTCGCATT
TTCACTCACG GCACCGACAT TTTGATGCAC GAGGGCAAGC GCGAGCACGC GTTTTACAAG
GGTCCGCGCA CGGTGGATGG GCTGACGCAG TTTGTGGACA CGCTCGTGCC ACCGCCGGAG
CCGGTGGGTG AGTCGAGCAT AGAGGCGGCG CAGGAGGAAA ACATGAAGCT TCGGCTTCCG
GCGAGCGTCG ATATGCAAAA GCGCATCATC GGCCCGGGGT GCGCCATCAC CGGTTTCGTG
CTCGTGAAGA AAGTTCCCGG GCACTTGTGG ATCAGCGCGT CCTCTCCGGA TCACTCGTTC
CACGGTGAAA CGATGAACAT GACGCACGTC GTCAACCACT TTTACTTTGG ACATCAACTC
AGCGACGAAC GTAGACGTTA CCTGGAAAAG TTTCACGCCG GAGAAAAAGC GGGCGACTGG
CACGACAGAC TCGCGAGCGA GCGCTTCGTC TCCAACGCCG CGCACGTCTC TCACGAGCAC
TATTTACAAA CCGTCCTCAC GACCATCACT CCGCGCGGGC GATACACCCT TCCGTTCAGC
GTGTACGAGT ACACCCAGCA CTCTCACGCC GTGCACGAAC CGCTTCCAAA GGCAAAGTTT
CATTACCAAC CGAGCCCGAT GCAAATCGTC GTCTCCGAGG AAAAGATGGC GTTTTACTCA
TTCATCACCA GTCTCATGGC CATCATCGGC GGCGTGTACT CCGTCATGGG CATCGCCGAC
GGCGTTTTGT TCAACTCACT CGCCCTCGTG CGCCGCAAGC TCGAGCTCGG CAAGCAAGGT
TAA
 
Protein sequence
MQRPTAVRGG GAFGAPHGAP FGGAHAPTRL GRAFRSVDFY RKLPRDMTEG TVSGSVISIF 
AAVLMTFLLL SELRSYSSSS FDTKVVVDRS VDGELLRINF NLSFPALSCE FASVDVGDAL
GLNRFNLTKT VFKRAIDADM RAIGPLQWDR AVDEVLKASD EETTRAEERI ADLDELQAMV
KDPTHAVVLV NFYAPWCPWC QRLEPVYEAA GLAVHEKYPP GTKSRVLFTK IDCVVHEKFC
MAQVVTGYPT IRIFTHGTDI LMHEGKREHA FYKGPRTVDG LTQFVDTLVP PPEPVGESSI
EAAQEENMKL RLPASVDMQK RIIGPGCAIT GFVLVKKVPG HLWISASSPD HSFHGETMNM
THVVNHFYFG HQLSDERRRY LEKFHAGEKA GDWHDRLASE RFVSNAAHVS HEHYLQTVLT
TITPRGRYTL PFSVYEYTQH SHAVHEPLPK AKFHYQPSPM QIVVSEEKMA FYSFITSLMA
IIGGVYSVMG IADGVLFNSL ALVRRKLELG KQG