Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_1568 |
Symbol | |
ID | 5004343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 210974 |
End bp | 212233 |
Gene Length | 1260 bp |
Protein Length | 420 aa |
Translation table | |
GC content | 54% |
IMG OID | 640419764 |
Product | predicted protein |
Protein accession | XP_001420275 |
Protein GI | 145351851 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00000880884 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.122051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCACCGCGG GCGTGCTTTT GGTCATGTCG GCGTTTTTTT CGTTAGCGGA AACGTCCATC ACGACGTTGT ACCCTTGGAA AGTGCGAGAG TTGGCGGATC AGGAAGGTTC GACGGGCGGG GTGTTTCAGA TTATGCGCAA GGACGTGACG AGATTTCTCA CGACGATTTT GATAGGGACG ACATTTTCTG GGATCATGGC CACCGCGCTC ATTACGGAGG CTGCATTGAT ACTATATGGC GATGGTGCGA CGACCGCGGT GACGGTGGCG CTGACGATCG TGATGCTGGT GTTTACAGAA ATCGCTCCGA AGAGCGTGGC GGTGCAGCAC GCCACGGTGA TTGCACGCGT CATCGCGAAA CCCATTTACT TGCTCTCCTT CGTCGTCTAC CCGCTCGGTC GAACGTGTCA AATCGTTGTG AACGCCATGT TTGCCCTTTT CGGTCTCAAA ACTTCCGCTG AGCCGTTTGT GAGTGAAGAA GAGTTGAAAC TCGTCCTCGC CGGGGCGACG AAGAGCGGCG AAGTGGAGAG CGCGGAGAAG GATATGATTC AAAATGTGTT GGATCTCGAA GAGACGGTCG TGAGAGATGT GATGACACCG CTCGTGCAAG TGCACGGCGT GCGAAGCGAT GCCACATTGG CTGAATTTCG TACGGAATGG ATCGAGCACA AGTACTCTCG CGTGCCGGCG TGGGAGGATC GCGTGGACAA CATCGTGGGC ATTGTTCGAG CGAATCAAAT CATGCAGCTC GGAATAGAAA GAGATCTTCG CCCGGAGCAA AGTAAGGAAC TCGAAGACGT CCTCGTCCAG GATGTCATGC TTCGTGACAC TTATTTTGTT CCCGAAAGCA TGTCCGTGAG TAAACTCCTT CGCGAGCTCA TGCAGCGCAA GTCTCACATG TGCGTCGTGG TGAATGAATT TGGCGGCACT GTAGGTATCG CCACACTTGA GGATTGCGTG GAGGAGATCG TCGGTGAAAT CTATGACGAA GAGGATAGTC AAAAGGCAAA CGCGGATGAG GATGAGCAAG ATGCGACGCC GTTCATTCGC GAGGTGGGAC AGGGGGCGTA TCTCGTAGAC ACTCGCGCGG CGCTATGGAA ATTGGCGGAT GAGTTGTCGC TGGACATTCC CGAGTCGCCT CTGTACGAAA CTGTGGGTGG TTTCGTGTGT GATTTATTCG GATCTATTCC CGACGTCGGG GCGTCGATCA CGACAACGTT TGAGCACGTC GAGGACGAGG ACGCTAGTTC GGATGACGAG
|
Protein sequence | LTAGVLLVMS AFFSLAETSI TTLYPWKVRE LADQEGSTGG VFQIMRKDVT RFLTTILIGT TFSGIMATAL ITEAALILYG DGATTAVTVA LTIVMLVFTE IAPKSVAVQH ATVIARVIAK PIYLLSFVVY PLGRTCQIVV NAMFALFGLK TSAEPFVSEE ELKLVLAGAT KSGEVESAEK DMIQNVLDLE ETVVRDVMTP LVQVHGVRSD ATLAEFRTEW IEHKYSRVPA WEDRVDNIVG IVRANQIMQL GIERDLRPEQ SKELEDVLVQ DVMLRDTYFV PESMSVSKLL RELMQRKSHM CVVVNEFGGT VGIATLEDCV EEIVGEIYDE EDSQKANADE DEQDATPFIR EVGQGAYLVD TRAALWKLAD ELSLDIPESP LYETVGGFVC DLFGSIPDVG ASITTTFEHV EDEDASSDDE
|
| |