Gene OSTLU_38656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38656 
Symbol 
ID5002148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp842502 
End bp843530 
Gene Length1029 bp 
Protein Length342 aa 
Translation table 
GC content60% 
IMG OID640417569 
Productpredicted protein 
Protein accessionXP_001418126 
Protein GI145347330 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1097] RNA-binding protein Rrp4 and related proteins (contain S1 domain and KH domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones84 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTGA GTTTTCGAGA CGCCGACGCG TCGACGACGC GCGCGTTCAA GCGCGCGAAG 
CGCGCGCTGG AGCAATGCGC GACGGCGAGC TCGTCCGCGC GCGCGTTCGT GAGCCCGGGC
GAACGCATCC CAGGGATCGA TTCGTCTGAA GGTTTCTTGC GAGGACACGG CACGCGACCG
ATCGCAGAGG ACAATGGAAC CGACGGTGAC GGCGACGATG ATCGTGGGTT AGTGGCCACG
ACCGCGGGCG TGGTTGAACG CGTGAATAAA CTCGTCTCAG TGCGAGCGTT GAAAGCGCGC
TATGCGCCAG AAACGGGTGA CGTCGTGCTT GGGCGCGTGA AAGAGATATC TGGTAAGCGT
TGGATTTTAG ACGTGAACGC GCGACAGAAT GGGGTATTGC AGCTCAGTGC GGTGCATTTG
CCCGGGAACG TGCAGAGACG ACGGAACGAT GTGGATGAGT TGAACATGCG CATGCTGTAC
GCCGAAGACG ACGTCGTGAG CGCCGAAGTG CAGAGCGTGT ACGCCGATGG CGCCGCCGCG
CTGCACACGC GAAGCTTGAA GTACGGTTGC TTGAAGAATG GTCAGCTCGT GCGAGTGACT
GCGAATTTAG TGCGCCGATT GCCTCAGCAT TTTCACAGGC TTAAGATGGA CGAGTTTCAC
GACGGCGTCG CCGCTGAAGC GAACGACGTG GAAATCTTAC TCGGGTGCAA CGGTTTCATT
TGGGTTGGTG CGCCGAGCGG CGCCACCGCA CCGCGCGAGT CGGAGATTCG CCGCGAGCCG
AGTGATGTCG TGGACGAGCT GCGCGAGCTT CACGGCGATG AAGTGTCTCC GGTTCAACGC
GAAAATATAT CCCGCGTGGC AAATTCAGTG CGCGCGCTCG CCGAGCTGTT CCTTCCCATC
TCGCCGCCGG CGGTCATGGA TGTTTTTAAA GCGTCGAGCG AGTGTGGGGT GGCGGTGAGG
GATATGCTGA GCCAAGGATT CTTAACTCGT ATCCTCGAGC GAGAATTCGA AAAGCGCGTC
GCCGACTGA
 
Protein sequence
MVVSFRDADA STTRAFKRAK RALEQCATAS SSARAFVSPG ERIPGIDSSE GFLRGHGTRP 
IAEDNGTDGD GDDDRGLVAT TAGVVERVNK LVSVRALKAR YAPETGDVVL GRVKEISGKR
WILDVNARQN GVLQLSAVHL PGNVQRRRND VDELNMRMLY AEDDVVSAEV QSVYADGAAA
LHTRSLKYGC LKNGQLVRVT ANLVRRLPQH FHRLKMDEFH DGVAAEANDV EILLGCNGFI
WVGAPSGATA PRESEIRREP SDVVDELREL HGDEVSPVQR ENISRVANSV RALAELFLPI
SPPAVMDVFK ASSECGVAVR DMLSQGFLTR ILEREFEKRV AD