Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34972 |
Symbol | |
ID | 5003884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 499632 |
End bp | 501305 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419305 |
Product | predicted protein |
Protein accession | XP_001419825 |
Protein GI | 145350884 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.217943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0580524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGGA ACGAGAGGGC GCTCGCGCGC GAGCTTCGGT GGGCGGGACA CGTCGGCGCG CACGCGGCGA TCGTCGATGC GACGTCGAAC GAGGGCGAAG GGGGGGACGT GCGCGTGGCG CGGGTGATTT TGATGAATAT TGATGCGCTG ACGCACACGA AGGTGTGGGT GAGGGTGCTC GGGGCGAGCG GGGACGCGGC GACGGACGAG GACGCGCACG CGCGATGGCG AACGACGGAC GAGGCGTGCG GAAGAAGGTC GAATGTGTAC GCGTTGCTTC ACGTCGTGGC GGCGCCGATC GGACGCGAAT GGTGGGAAAG ATGGATCGGT GAGCGCGTGG GCGCGTGTGC GCTGAGCGTT CGAGCGTTCG TGAAGAACGC GCGGGGGTTT CCGGTGCTGC CGAAGGAGGC GCAAGCGATG GTTCGCGACA TGTTTCGCAG GAATATTCAA ATCATGTTGA CGGACTGGAA CGGCGCGCCC GATGGGTGCG CGCGCGAGAC CATCGCGCCC GGAGATGTGG CCGGGGACGA AGGGCGCGTG GGCGTCGACT CTGCGCATCC AATGCGGCTG TATTGGGAGT ATCTCGTCTA CCTCTTTCGC GGCGTCGAGC CCGCGAGCGA ACAGGCGTTG GCGGAAGCGC CTTACAGGGA TTATTTACAA GCGCCCTTGC AGCCGTTGAT GGATAATCTG GAGAGCGTGA CGTACGAAAC GTTCGAAAAG GATGCGAGCA AGTATATTCA ATACGAAGAA GCCGTGCGGT GCGCGCTTCT TGATCTCGTG CCAGAGGGCG ACGAGGGCTC TGTGATGGTC GTCGGCGCAG GTCGCGGACC GCTGGTTCGC GCGTCATTGC GTGCGTCCGA ACGCGCCAAT AGGAACATCA AAGTGTGCGC CGTGGAGAAG AATCCAAACG CGGTCGTCAC GCTGCAGCAC CTCGTCGCGA AAGAGGGTTG GGGTGATAGA GTACAAATTT TCCCGGGAGA TATGCGCACG TGCGCCGCGG ATGTTCGAGT CGACGTTTTG GTGAGCGAAT TGCTCGGGAG TTTCGGAGAC AACGAGTTGA GCCCAGAGTG TTTGGACGGC GCGCAGCGAT TTCTCAAACC AACGGGCGTG AGCGTGCCGC AGTCGTACGA ATCTTTCGTC GCGCCCATCG CTGCGGCAAA ATTGCACGAC GCCGTCGTCT CGTACAAGGA TTTGAAATCC ATCGAAACAC CGTACGTGGT CAAGTTTCAC AGAGTGCATC ACATCGCGGA ACCGAAGAGC GTGTGGGAGT TCGAGCACCC GAACAACGCC GCGCGAATCG ACAACGAGCG TTACGCGCGC GTCGAGTGGT CGAGTGAAGA ACTTGGATCA GCCTCGAGCA CGCTTCACGG TTTCGCCGCG TACTTTGACG CGACGCTGTA CGACGGACCT GCCGGATGCG TGCGCTGTAG CATCCATCCT CATAACCACA CTTTAGGTCC GACGGGCGAG CTCATGTTTT CGTGGTTTCC AATGTTCTTT CCCATTCAAA CGCCAGTGTA CATCGATAGA CGCGGCGCTT CGCCGACGAA GATTGAATTT TATATTTGGC GCCGCGTCGA CGCGCACAAA ATGTGGTACG AATGGACGAT TGCGAAACCG GTTCAAGGGC ACATACACAA CCCGAATGGG CGATCGTACT GGATCGGCCT ATAG
|
Protein sequence | MNGNERALAR ELRWAGHVGA HAAIVDATSN EGEGGDVRVA RVILMNIDAL THTKVWVRVL GASGDAATDE DAHARWRTTD EACGRRSNVY ALLHVVAAPI GREWWERWIG ERVGACALSV RAFVKNARGF PVLPKEAQAM VRDMFRRNIQ IMLTDWNGAP DGCARETIAP GDVAGDEGRV GVDSAHPMRL YWEYLVYLFR GVEPASEQAL AEAPYRDYLQ APLQPLMDNL ESVTYETFEK DASKYIQYEE AVRCALLDLV PEGDEGSVMV VGAGRGPLVR ASLRASERAN RNIKVCAVEK NPNAVVTLQH LVAKEGWGDR VQIFPGDMRT CAADVRVDVL VSELLGSFGD NELSPECLDG AQRFLKPTGV SVPQSYESFV APIAAAKLHD AVVSYKDLKS IETPYVVKFH RVHHIAEPKS VWEFEHPNNA ARIDNERYAR VEWSSEELGS ASSTLHGFAA YFDATLYDGP AGCVRCSIHP HNHTLGPTGE LMFSWFPMFF PIQTPVYIDR RGASPTKIEF YIWRRVDAHK MWYEWTIAKP VQGHIHNPNG RSYWIGL
|
| |