Gene OSTLU_4881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4881 
Symbol 
ID5004769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp294262 
End bp295215 
Gene Length954 bp 
Protein Length318 aa 
Translation table 
GC content63% 
IMG OID640420190 
Productpredicted protein 
Protein accessionXP_001420651 
Protein GI145352650 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1218] 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase 
TIGRFAM ID[TIGR01330] 3'(2'),5'-bisphosphate nucleotidase, HAL2 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00138812 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.468917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCCGCGCGCG CGGTGCGGCT CGCGGGCGCG CTGTGCCGGA AGATGCAGTT CGAGCTGCGA 
ACGAACGAAA AAGTGTCGAA ATCGGACGAC TCGCCGGTGA CGGTGGCGGA TTTCGCGGCG
CAGGCGGTGG TGTCGCACGT CCTGGGCGTC GCGAGGCCGG ACGTCGGGCT GGTGGCGGAG
GAAGACGCGC GGAGTATGCG GGAACCAGCG GGCGCGAAAT TGCGAGCGAG AGTGACGGCG
GTGGTGAACG ATGCGCTCGA AGGCGTGGTG GAGCGCAGAC TGAGCGAGGA GGAGGTCATG
GACGCGATCG ATCGCGGGGC GACGGACGGC GGCGCGTCGG GGTCGTTTTG GATTCTCGAT
CCAATCGACG GCACGAAAGG ATTCATTAAT GGTCGGCAGT ACGCCATCGC TTTGGCGCTC
ATGGAGGACG GCGAAGTTAC GGGTGGTGTT CTCGGGTGTC CGAACATGCC GAGCGAGAAG
ATACCGCGAG GAGCGACGGA AATTCCGACG GCGGCGCCGG GAGTAATTTT CGTCGCGTAC
AAGGGGCGCG GGACGACTGT GGGGGCGTTC GACGCGGAGC ATCCTCTGCG AGATGGCGCG
AAAATAACGA CGAATAAAGT GGCCAGTTCG AGCGAAGCGA CGTACATGGA ATCGTGGGGG
GACTCCATCG TCGCCGATCA TGGGTTTACG AATTCTTTGA GCGCGGCGAT GGGCGTAACG
GCGCCGCCCG TGCGCATCGA TAGCATGGCA AAGTACGGTG CGCTCGCCCG TGGAGACACG
AATATGTATC TCAGGTTTCC GCCCGCGAGT TATAGAGAAA AAGTTTGGGA TCACGCCGCG
GGCGCGATCG TGGTTCAGGA GGCGGGAGGG GTCATCACCG ATGGCGCCGG GAATCCACTC
GATTTTTCAA AGGGACGATT TTTGGACATC GACATCGGCA TCGTGGCCAC GTCT
 
Protein sequence
AARAVRLAGA LCRKMQFELR TNEKVSKSDD SPVTVADFAA QAVVSHVLGV ARPDVGLVAE 
EDARSMREPA GAKLRARVTA VVNDALEGVV ERRLSEEEVM DAIDRGATDG GASGSFWILD
PIDGTKGFIN GRQYAIALAL MEDGEVTGGV LGCPNMPSEK IPRGATEIPT AAPGVIFVAY
KGRGTTVGAF DAEHPLRDGA KITTNKVASS SEATYMESWG DSIVADHGFT NSLSAAMGVT
APPVRIDSMA KYGALARGDT NMYLRFPPAS YREKVWDHAA GAIVVQEAGG VITDGAGNPL
DFSKGRFLDI DIGIVATS