Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88181 |
Symbol | |
ID | 5003670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 188869 |
End bp | 190725 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419091 |
Product | predicted protein |
Protein accession | XP_001419518 |
Protein GI | 145350233 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.642963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAT CACCACGCGA GAAGATCAAG CGCCTCGCCG TGGCATGCGT GCGACGCGGC TCGTTCATCT CGCTCTGTCT CTACGTCCTC GGCCTCGTCG TCGCCTCGCT CGCGCCGTCG CTCGCGAGAG ACGTCTACGT GGATGAGAAT GCATTCCTCG TAGGCTCCAC GCACGCGACG TTTGACGACC TCGACGGCGC GCGCGCGGAT GATTACGTCG AGACATTGAC TAAAATCACG TTCGACGCCC GATCGCGGGC GCGAACGACG CGTGAGCGCT TGGAATGGGT TTTAAACGCC CTCGATGAGC GCGGCTTTGA GAGTTACAAA TCATGGCTCG ACGGCGCTGG TGACTTGTTC AACGTCCACG GCGTCGCTCG CGCGGCGCGA GGGAACGGAC GAGAATCGAT GGCGTTGGTC ACGGTCCTGG GTGACGGTGA CGCTGACGCT GAGGCGGCGA CGGTCGGATT GGCGTTGCGC GCGTTTGAGA AGATTGGTCG CGCTCCGTGG TTGGCGAAGG ATTTAATGTG GGTCTGCGTC GATGGCTCGC GAGGAGAGAT CGATGGGACG ATGGCTTGGT TAAAGACGTA TTATTCATCT AGCGTTGGGG ACTTGGGTGG GGGATTTGAG CGCGCGGGGG CGATACAGCA GGCGTTTGTG TTTCGCGCGG CGAACCGTGG CGCCGCGGCG TCGGCGGTGC GCGTCAAGTT GGAGGGTTGG AATGGGGCAT ATCCGAATCA AGACATATTT ACGATGTTTC GTAGCATCGT GGAGACGTAC CCTGTGAGCA TGAGAGTAAG CTTGGAGTCG GATGTCGAGG TGCGAGAAGA TGATCTTTCG CGCTGGAGTT TGATGAAATC GACGGCGCGC TTCATGTGGC GCGCCGCCAC GGGAATTCCT TCGGGTGCGC ACGCCGCGTT CAAGGCGCAC TCAATCGATG CGATTAGTTT CGAAGCGATC GAACGTCAAC AAGACGCTTA CGTGCGAAGC GGTTTGCGAG CGTACGTCAC TTTGGGACAG ATGCTTGAAC TTACGTTTCG AGCGTGCAAC AACTTGCTCG AGCTCTTGCA TCACAGTTGC TTTTATTATA TCTTGCTCGG TCCAAACAAG TTTCTTGGCA TAGCAGAGTA CATCGCGCCG CAAGCAATTC TTCTCGTCAG TCTTTTACTC ACCGCGTTGA AAATGACGAC GTTCGGGATG GAAGATACGT CGTCAACAAG CGATGGCGAA ACGCGAATGT CACATGATTG GTTTGCGGCA ATATCGAAAC TTTCCTTGGC ACTCGTGTTC GGTGCCATCG TAGGTACGAG TTGCGTTTCA TTGCATTTGA GGGAGCTAAA TCACGTGACG GTGACTTTGG GTACAGTCGC CGTCGCATTC ATCGCTTTCA TTACCTTTTT ACGCTTGACA CTCGACGGCG AGGCGTCTCC GGTTCGCTCG ACGACAAAGG TTTGCGACGT TACCATCGTC CGACAAGAGC AGTGGGTCGG TGTGAAAGTG ATCAACATCG CGTGGTTGCT ATTTACGATG AGTGCGTGTA CGTTTTTCAA CTTCGCTCTC GCGTTTTTAA CCACCGTGGC GTTAGCGCCC GCGTGCTTGC TCTGCGCGCC GAGCGGCGAC GCCGCGAAAC GAAACCGCGC AGTTGCGGTT CTCGGCGCGC TTCCGCCGAC ATGGATGTTC GTTCTCAGCC GTTTTGCTGG ATCACCGGTG TACGAGTCTT TCGGTCTGCT CGCCGAACAC CACGTACGTT GGAAAACGTT CGCGTTGCCA GTCGTCTTTG GCATCGCGTT CCCGGTGCTG TTAATTTGTT TTGATGTTGC GCGTTCTCCA GTGAAATCGA AAACAAAAAA GGCTTAG
|
Protein sequence | MSASPREKIK RLAVACVRRG SFISLCLYVL GLVVASLAPS LARDVYVDEN AFLVGSTHAT FDDLDGARAD DYVETLTKIT FDARSRARTT RERLEWVLNA LDERGFESYK SWLDGAGDLF NVHGVARAAR GNGRESMALV TVLGDGDADA EAATVGLALR AFEKIGRAPW LAKDLMWVCV DGSRGEIDGT MAWLKTYYSS SVGDLGGGFE RAGAIQQAFV FRAANRGAAA SAVRVKLEGW NGAYPNQDIF TMFRSIVETY PVSMRVSLES DVEVREDDLS RWSLMKSTAR FMWRAATGIP SGAHAAFKAH SIDAISFEAI ERQQDAYVRS GLRAYVTLGQ MLELTFRACN NLLELLHHSC FYYILLGPNK FLGIAEYIAP QAILLVSLLL TALKMTTFGM EDTSSTSDGE TRMSHDWFAA ISKLSLALVF GAIVGTSCVS LHLRELNHVT VTLGTVAVAF IAFITFLRLT LDGEASPVRS TTKVCDVTIV RQEQWVGVKV INIAWLLFTM SACTFFNFAL AFLTTVALAP ACLLCAPSGD AAKRNRAVAV LGALPPTWMF VLSRFAGSPV YESFGLLAEH HVRWKTFALP VVFGIAFPVL LICFDVARSP VKSKTKKA
|
| |