Gene OSTLU_43573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43573 
Symbol 
ID5006804 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp431289 
End bp432299 
Gene Length1011 bp 
Protein Length317 aa 
Translation table 
GC content69% 
IMG OID640422225 
Productpredicted protein 
Protein accessionXP_001422747 
Protein GI145357073 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.021277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.157886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCGC GGCTCGCGCG GACGCGCGTT AAAAATACGC CGCGTCGATG GCGCGCGTCG 
TCGGCGGCGC GCGCGGCGAC GCGCGGGCGC GGACGACGCG CGACGTCGTT CGCGCCGCTC
GAAATATCTG CAGCTTCGCG CGGCGCGGCG CGAGCGGCGC GCGGCGCGGC GCGAGCGGCG
CGCGCGCGCG GGCGTCGGGT CGTCGTCGTG CCGGGATTTT TGACCGGGAG CGACGCGTAC
GAGGGCGTCG CGCGCGCGCT GGCGCGAGCG ATCGGGGACG ACGCGCGCGT GCGCGTCGCG
CCGGTGAAGC GAGAGATGTG GTTCGGGACG CTGCGCGGCG GTTCGTTCGA GGAGATTTTA
GACGTCGTCG ACGCGTGCGC GCGAGAGGCG GCGAGGGATG GCGGTGAGAG GGTGTGCTTG
GTCGGACACA GCGCGGGAGG GTGGTTGGGG CGGTTGTATT TGGGCGACGC GCGGGCGTAT
CGCGGCGAAG CGCCGTACGA CGGCGCGCGA TTCGTGGACG CGTTGATCAC GCTCGGCGCG
CCGCACGGGA GCTTGGAGAA GTATCCGTTC GGTCGCGTGA GAGAGAATAG ACCGGGGGAG
AGCGAGTCGA TGCCGGACGA CGCGCGAGGG TCGTCGCTCG CGTTTACGAA TTATTATTAT
CCGGGCGCGT ATCGCGCCGA CGTGCGATAC GTCGACGTCG TCGGTGATTA CGCCCGCGGC
TCGGCGAATT TCGAGCTCTT TGACGCGCTG TGCGATAGGA GTGACACCAA GCGACCGCTC
GTCGATCGCG TGCGCGCCGC TTGGGAAGCG TTCACGATCG GAGTTTCGTA CGCCGCCAAC
TGCGGAAGAG CCGACGTCCG CGGCGACGGC GTCACCCCGA TCGACACCGC CCACGCCCTG
ACGGGCTCTG AACACGTCAT CTTGCCCGGC GTGTACCACG GCCCGACGAA ACCGACTCGT
TGGTACGGCG CCGATTCCGT CGTTGAACTG TGGTATCCGT ACTGTTTGTA A
 
Protein sequence
MASRLARTRV KNTPRRWRAS SAARAATRGR GRRATSFAPL EISAASRGAA RAARGAARAA 
RARGRRVVVV PGFLTGSDAY EGVARALARA IGDDARVRVA PVKREMWFGT LRGGSFEEIL
DVVDACAREA ARDGGERVCL VGHSAGGWLG RLYLGDARAY RGEAPYDGAR FVDALITLGA
PHGSLEKYPF GSSLAFTNYY YPGAYRADVR YVDVVGDYAR GSANFELFDA LCDRSDTKRP
LVDRVRAAWE AFTIGVSYAA NCGRADVRGD GVTPIDTAHA LTGSEHVILP GVYHGPTKPT
RWYGADSVVE LWYPYCL