Gene OSTLU_38921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38921 
Symbol 
ID5001831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp385855 
End bp386976 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content59% 
IMG OID640417252 
Productpredicted protein 
Protein accessionXP_001418003 
Protein GI145347074 
COG category[R] General function prediction only 
COG ID[COG1090] Predicted nucleoside-diphosphate sugar epimerase 
TIGRFAM ID[TIGR01777] conserved hypothetical protein TIGR01777 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.71449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0770912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCAG CAGTGACTTG TAACGCGTTT TATCGTGCTC CGGCGAGCAA TGCGACGCCC 
TGCGCGCGCC TCCGCTCGCC GTCCCGTTCG GTGGATAAGA CGTGTCGTCG CCATCTTTTC
CGCTCGACGC GCGCACGTCG GGGCGCTCTA GCGCCTTCCG CGAGCGTGGA CGACGACCTG
TCCGTAGCGT CGTCCGACGA TCGGCTCATC GTCGCCATCA CGGGCGCAAC CGGGTTCGTG
GGTAGCAAGC TAGTGGAGAC GCTCCTCGAG CGGGGCGCCG AAGTGCGAGT GCTCACGCGA
GACGTCAACC GCGCTCGCGC GAAGCTTTCT CCACAAAATC TCCCAAAGGG CGACGTCGCG
TTTGTGTCTC CGGATAAGTG GCGACGCGGG TTGCTCGGCG CGACGCACGT GGTGAATTTA
GCTGGCGAAC CAATCAGCAC GCGCTGGGAC CCGAAGGTGA AGGGTGAAAT CATGGCGTCC
CGAGTGAAAA CCACCAAGGC GGTCGTCGAA CACGTGAATT CGATCACAAA CGACGCCAAG
AGACCTAAGG TATTGGTGAA CGCTTCGGCG ATCGGGTACT ACGGGACGAG TGAGACAGAT
ACGTACGACG AAGCGAGCGG GCCAGGCGCG GACTATTTAA GTCAAGTCTG CCAGGCGTGG
GAACAAACCG CGAGTGGGGT TGAAGATTGT AGAGTGGTGC TGCTGCGATT AGGGATTGTG
CTCGATCGAG ATGGTGGGGC GCTCGGGAAG ATGGTGCCGA CTTTCCAAGC GTTCATGGGC
GGGCCCTTGG GCGACGGTCA GCAGTGGTTT AGTTGGATTC ATAGAGACGA CGCGGTGGGG
ATCATAATGG AGAGCTTGAC AAACGTAAAA CTTGAAGGTC CGGTGAATTG CGTCGCGCCA
ACGCCCGTCC GCATGCGAGA GATGTGCGAA TCCCTCGGCG AGACCTTAGG GAAACCGAGT
TGGTTGCCGG TGCCAGATTT CGCGTTGCGC GCAGTTCTCG GCGAAGGATC GACTCTAGTT
CTTCAAGGGC AGAGAATCCA ACCCAAAACC GCGCTTGATG TGGGTTATAA GTTCAAGTAC
GAGAGGATCG ACCAAGCGCT GAAGCAGATT CTTCGCCGTT GA
 
Protein sequence
MTSAVTCNAF YRAPASNATP CARLRSPSRS VDKTCRRHLF RSTRARRGAL APSASVDDDL 
SVASSDDRLI VAITGATGFV GSKLVETLLE RGAEVRVLTR DVNRARAKLS PQNLPKGDVA
FVSPDKWRRG LLGATHVVNL AGEPISTRWD PKVKGEIMAS RVKTTKAVVE HVNSITNDAK
RPKVLVNASA IGYYGTSETD TYDEASGPGA DYLSQVCQAW EQTASGVEDC RVVLLRLGIV
LDRDGGALGK MVPTFQAFMG GPLGDGQQWF SWIHRDDAVG IIMESLTNVK LEGPVNCVAP
TPVRMREMCE SLGETLGKPS WLPVPDFALR AVLGEGSTLV LQGQRIQPKT ALDVGYKFKY
ERIDQALKQI LRR