Gene OSTLU_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1568 
Symbol 
ID5004343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp210974 
End bp212233 
Gene Length1260 bp 
Protein Length420 aa 
Translation table 
GC content54% 
IMG OID640419764 
Productpredicted protein 
Protein accessionXP_001420275 
Protein GI145351851 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000880884 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCACCGCGG GCGTGCTTTT GGTCATGTCG GCGTTTTTTT CGTTAGCGGA AACGTCCATC 
ACGACGTTGT ACCCTTGGAA AGTGCGAGAG TTGGCGGATC AGGAAGGTTC GACGGGCGGG
GTGTTTCAGA TTATGCGCAA GGACGTGACG AGATTTCTCA CGACGATTTT GATAGGGACG
ACATTTTCTG GGATCATGGC CACCGCGCTC ATTACGGAGG CTGCATTGAT ACTATATGGC
GATGGTGCGA CGACCGCGGT GACGGTGGCG CTGACGATCG TGATGCTGGT GTTTACAGAA
ATCGCTCCGA AGAGCGTGGC GGTGCAGCAC GCCACGGTGA TTGCACGCGT CATCGCGAAA
CCCATTTACT TGCTCTCCTT CGTCGTCTAC CCGCTCGGTC GAACGTGTCA AATCGTTGTG
AACGCCATGT TTGCCCTTTT CGGTCTCAAA ACTTCCGCTG AGCCGTTTGT GAGTGAAGAA
GAGTTGAAAC TCGTCCTCGC CGGGGCGACG AAGAGCGGCG AAGTGGAGAG CGCGGAGAAG
GATATGATTC AAAATGTGTT GGATCTCGAA GAGACGGTCG TGAGAGATGT GATGACACCG
CTCGTGCAAG TGCACGGCGT GCGAAGCGAT GCCACATTGG CTGAATTTCG TACGGAATGG
ATCGAGCACA AGTACTCTCG CGTGCCGGCG TGGGAGGATC GCGTGGACAA CATCGTGGGC
ATTGTTCGAG CGAATCAAAT CATGCAGCTC GGAATAGAAA GAGATCTTCG CCCGGAGCAA
AGTAAGGAAC TCGAAGACGT CCTCGTCCAG GATGTCATGC TTCGTGACAC TTATTTTGTT
CCCGAAAGCA TGTCCGTGAG TAAACTCCTT CGCGAGCTCA TGCAGCGCAA GTCTCACATG
TGCGTCGTGG TGAATGAATT TGGCGGCACT GTAGGTATCG CCACACTTGA GGATTGCGTG
GAGGAGATCG TCGGTGAAAT CTATGACGAA GAGGATAGTC AAAAGGCAAA CGCGGATGAG
GATGAGCAAG ATGCGACGCC GTTCATTCGC GAGGTGGGAC AGGGGGCGTA TCTCGTAGAC
ACTCGCGCGG CGCTATGGAA ATTGGCGGAT GAGTTGTCGC TGGACATTCC CGAGTCGCCT
CTGTACGAAA CTGTGGGTGG TTTCGTGTGT GATTTATTCG GATCTATTCC CGACGTCGGG
GCGTCGATCA CGACAACGTT TGAGCACGTC GAGGACGAGG ACGCTAGTTC GGATGACGAG
 
Protein sequence
LTAGVLLVMS AFFSLAETSI TTLYPWKVRE LADQEGSTGG VFQIMRKDVT RFLTTILIGT 
TFSGIMATAL ITEAALILYG DGATTAVTVA LTIVMLVFTE IAPKSVAVQH ATVIARVIAK
PIYLLSFVVY PLGRTCQIVV NAMFALFGLK TSAEPFVSEE ELKLVLAGAT KSGEVESAEK
DMIQNVLDLE ETVVRDVMTP LVQVHGVRSD ATLAEFRTEW IEHKYSRVPA WEDRVDNIVG
IVRANQIMQL GIERDLRPEQ SKELEDVLVQ DVMLRDTYFV PESMSVSKLL RELMQRKSHM
CVVVNEFGGT VGIATLEDCV EEIVGEIYDE EDSQKANADE DEQDATPFIR EVGQGAYLVD
TRAALWKLAD ELSLDIPESP LYETVGGFVC DLFGSIPDVG ASITTTFEHV EDEDASSDDE