Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25379 |
Symbol | |
ID | 5005073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 76938 |
End bp | 78736 |
Gene Length | 1799 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 55% |
IMG OID | 640420494 |
Product | predicted protein |
Protein accession | XP_001420898 |
Protein GI | 145353175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.078128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00312899 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCACG TGTTCGAACA CCTGCAGCTG GAGCTGATGC TGCTGGGGAC GCTGTCGCTG CTGCTGACGG CGCTGCAGGA CGCGCTGATG AAGATTTGCG TGAAGGAGGA GGGGTCGTAC GGGACGGATG AGTGCCCAAA AGGCGAGGGA CCGCTGTGGA GCGCGACGAC GCTGCATCAG ACGCATATTT TCATATTCAT TTTGGCGTGC ACGCACGTGA GTTACGTGGC GGTGAGCGCG TACGTGTGCT CGTGGAAGCT GCGGCAGTGG CGGCGTTGGG AGACGGAGGG GGAGGTGAAG GTGCACGCGT TGAATCCGAA GATTAACCCG AGGAACGCGA CGGGGATTGT GCATTTGGTG TGGCGCGCGT TTTGGTCGCA GTTTCGATTC GCGGTGGACA AGGGGATGTA CTTGTCGTTG CGAAGATTGT TTTTGGAGCG CACGGGGGCG ACGCACGATT TCAACTTTTA CGATTACTTG CGCGAGTCCA TGGAGGAAGA CATGAGCTCG CTCATCGGGA TGACGGTGTT GATGTGGTCG ATGGCTACGA TTTTCGTGAC GGTGCCGCAG GCGTTGTTCT TATCCGCGGG GATCGTGTGT CTGGGCGTCA TGCTCTTCGT GGGGACGATG CTCGAGAGCG TGGCGTTGCG TCTCGCGCAA GCTGCGTACG AACGTTTCGC CGACGAGAAC GAACTCGAAG AGAGGTTGGC GGAAGAAGAG CTAGACATTT CTATCGAAAA ATCACCCACT GTGCGGCGAC GCGAGCTGCG AAAAGAGATT GATTCGCAAA ATTTCTTTTG GTTGGGTCGA CCGCGTTTGC TGCTCAAAGT TTACCAATTC GTGTTGTTTG AAAACGCCAT TTCACTATCC ATGCTCATCT TCAGCATGTG GCAAGATAAA AAGTGGTTGA CCTACAACGC GAGTATGTCG GTCGGCACCG CCTGGGCGCT GTTTGCCGTC GACGTCTGCG TGTTGATGCA CAGCGCTTTG TTCATTCTTC CCGTGTACGC CATCACGTCG ACCGTTGGTT CACATTGCGC GACTTCTTTG CAAGAATACG CCGATAAACT CGGCATCACG CGCGAGGCCG CTTTACAAGC GTATTTAGAA CGCGCCAATG AGTCGATGTC CACAGCTGAC GCCGCAGAGG TGGCGGCGTA CGATTTGAGT TTACTCGGGC GTGACGTCAA GGAGGTCGTC TCGAGCGGAG ATTTCGATCA AGAAGCCGTG CCAGTTTCGG CGCTACCTAA AGACTTACAA GCTGTCGCGG GTGCGTCTAG TCGTCAATTA TCCGCTGGTC TCAAAGATAT CCAGGCCAGA GGCCAGGCGC GAAAATCGTG GGCGAAGGCG GCGGGCAAGC TCGCGGCTGG GCAAAAAGAT TACTCGCGAG AGAATGAAAA ATCAATCACC AGTTTGCTCG GCGCTATCTT GTCAAATCAA ATGAAAGCCG AGCTCGCAAA GCAAAAGCGA GAAAAAGAAG CGAAAGAAGC GGCAAGAGCG GCGAGTCCTG GTGTTGTCGG AGCGTTGCAG CGAACATTTT CGAAGAAGGA TATAGCCACG ATGTCCGTCG GTCAACCATC CGCCGACGAC GTCACCGAAG ACTCGCCAAT TCCAAGTTCG AAGGTCTTGG TTAGTAAACC GTCGATGAAG GACGTCTTTA GCATGGCCTC GCCTCCTCCT AAGCCAACAA TGCGTAGCTC GCTCGACGAA GAAAAAATCG TGGAGGAGCC GTAAGATGAG ACGTCTGAAT GAATCGCGAA ACGCGTAAAC GTGTTGATAT TATCCTAATA CGATCAAAC
|
Protein sequence | MLHVFEHLQL ELMLLGTLSL LLTALQDALM KICVKEEGSY GTDECPKGEG PLWSATTLHQ THIFIFILAC THVSYVAVSA YVCSWKLRQW RRWETEGEVK VHALNPKINP RNATGIVHLV WRAFWSQFRF AVDKGMYLSL RRLFLERTGA THDFNFYDYL RESMEEDMSS LIGMTVLMWS MATIFVTVPQ ALFLSAGIVC LGVMLFVGTM LESVALRLAQ AAYERFADEN ELEERLAEEE LDISIEKSPT VRRRELRKEI DSQNFFWLGR PRLLLKVYQF VLFENAISLS MLIFSMWQDK KWLTYNASMS VGTAWALFAV DVCVLMHSAL FILPVYAITS TVGSHCATSL QEYADKLGIT REAALQAYLE RANESMSTAD AAEVAAYDLS LLGRDVKEVV SSGDFDQEAV PVSALPKDLQ AVAGASSRQL SAGLKDIQAR GQARKSWAKA AGKLAAGQKD YSRENEKSIT SLLGAILSNQ MKAELAKQKR EKEAKEAARA ASPGVVGALQ RTFSKKDIAT MSVGQPSADD VTEDSPIPSS KVLVSKPSMK DVFSMASPPP KPTMRSSLDE EKIVEEP
|
| |