Gene OSTLU_30844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30844 
Symbol 
ID5001045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp856841 
End bp858148 
Gene Length1308 bp 
Protein Length428 aa 
Translation table 
GC content66% 
IMG OID640416466 
Productpredicted protein 
Protein accessionXP_001416779 
Protein GI145344520 
COG category[T] Signal transduction mechanisms 
COG ID[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0384942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACATCGAT CGATGGATGC GATGCGCGCC GTCGCGGCGC CGCGCGCGAG CGCCGCGAGA 
CCCGCGCGCG CGGCGCGCGA CGCGCCGACG CGCCGACGCG TCGTCGCCGG CGCGTCGCGG
TACGCGCTGT GCGCGCACGG GGAGAATCTG CCGCATCCGG ACAAGACGGC GAAGGGCGGC
GAAGACGCGT GGTTCGCGCG CGTCTCGGCG GCGAACGGCG GCGGCGCGCT CGGCGTCGCG
GACGGCGTCG GTGGGTTCAA CGATCAGGGC GTCGATCCGG GGCTGTACGC GCGCGTGCTG
AGCTACGAGG GACTGCGGGC GTGCGACGGC GGCGACGGCG GATTCTTCGG GTCGAGCGCG
AAGATCGACC CGAGGGCGAT CGCGATCGAG GCGCAGGCGA AAACGATGCT GCCGGGCGCG
GCGACGATGT GCGTGGTGGC GCTGGATGGG AAAAAGCTGA CGTGCGCGAA CGTGGGGGAC
TCTGGGTTCC GAGTGGTGCG ACGCGGGGGG GTGACGTACG GGTCGACGGC GGGGCAGCAT
TATTTTAATT GCCCGTATCA GTTGGCGTAC GAGGCGCTGG CGAAGGATTG CGACAGCGCG
AGAGACGCGG ATGTGTACAG TTTCGACGTC GAGGCTGGGG ACGTGGTCGT GGCGGGGAGC
GACGGGTTGT TTGATAACGT GTTCGACGAG GAAATCGCGA GCGTGGTAAA CGCGGCGTAC
GCGAGCGCCG GGGACGCGGC GTCGGCGGCG GAATCGGCGG CGAAGGCGCT AGTGAAGGTG
GCGAGAAAGC ACGCGGAGGA TAAAAAGTAC GACTCGCCGT ACGCGCGCGA AATGGCGAAG
AGCGAAACCG ACAAGGGCGG CGCGCCGAAA GCCGTGGGAT TGTTTGGGGG ATTTCAGCAA
ATGCTCGGCG GTGGCAACTT GGGCGGGAAG ATGGACGATA TCACCGTCGT TGTTGCCACC
GTCGTCGACA CGGCGTCGTC GCAGCGCGAG CTCGCGCGAT CCGAAGCTGT GTGCGACGCC
AACACCAAGG CGCTGACCAA GGCGCGCGGT TTGGCGTCAA TCGAAGAGAC CAAAGTCGCT
CGTACGGTGG CGCTCCGTAA GGAAATGGAC GACGCGTTCA ACGAAAAAGT CGCCGAAGTA
AACAAGAGAG AGAAAGCCGT GGCGAACGCC AAGTCGGAAT TCACTCGCGC GCAAATCGAT
TCGATGGACG CGCCCACGGT GCGCAAGCTC TTACAAGAGC GCGGACTTCC CACGAGCGGC
AAGATCGATC GATTACGCGA TCGCTTAGCC GAAGTTAAGG CGCTTTAA
 
Protein sequence
MRAVAAPRAS AARPARAARD APTRRRVVAG ASRYALCAHG ENLPHPDKTA KGGEDAWFAR 
VSAANGGGAL GVADGVGGFN DQGVDPGLYA RVLSYEGLRA CDGGDGGFFG SSAKIDPRAI
AIEAQAKTML PGAATMCVVA LDGKKLTCAN VGDSGFRVVR RGGVTYGSTA GQHYFNCPYQ
LAYEALAKDC DSARDADVYS FDVEAGDVVV AGSDGLFDNV FDEEIASVVN AAYASAGDAA
SAAESAAKAL VKVARKHAED KKYDSPYARE MAKSETDKGG APKAVGLFGG FQQMLGGGNL
GGKMDDITVV VATVVDTASS QRELARSEAV CDANTKALTK ARGLASIEET KVARTVALRK
EMDDAFNEKV AEVNKREKAV ANAKSEFTRA QIDSMDAPTV RKLLQERGLP TSGKIDRLRD
RLAEVKAL