Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33779 |
Symbol | |
ID | 5001024 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 706040 |
End bp | 707879 |
Gene Length | 1840 bp |
Protein Length | 578 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416445 |
Product | predicted protein |
Protein accession | XP_001417029 |
Protein GI | 145345035 |
COG category | [S] Function unknown |
COG ID | [COG3349] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGCG CGCAGGCCTC GCGCGCGCGC GCGCGCGCGC ACGTCGCGAC CAAACACCGC GCTCGTCCTC GTGCGCGCGC GTCGACGCGC GTCGCGGCGT CAGCGACGTC GTCAGCGCGC GCGAATGCGA TATCCAAAGT CGTCGTCATC GGTGCCGGCT GGGGCGGCAT CGGCGCCGCG AAATCGCTGT GCGAAGCCGG GGCGGACGTG ACGCTCGTCG ACGTGCAAGA CGACCCCACG GGCGCGACGC CGACGCTGAC GAAGAGCGGG AAACCGTTCG AGGCGGGAAC ACGCGGTGCG TGTAGAATGA CGACGAAAAG AACGACGACG CGCGTCGCTC GCGTGCGCGA GAGAGCGCGA CGACTGACGA TGAAACGAAA CTTTGTGCGA TGATTCAGGG TTTTGGAAAG ATTATCCTAA TATCTCCGAT CTGTGTCGAG AGATGAACAT CGACGAAAAG GATGCGTTCA CAGAATTCAC GCCGAGTTCG TTTTGGTCGC CGGACGGGTT GGAGGCGACG GCGCCGGTGT TTGGGGATTC GATGGCGCTC CCGAGCCCTC TCGGACAGGT GTTTGCGACT TTCGATAACT TTAAACGACT GCCTCTGAGC GATAGGGTGA CGATGGTCGG CTTGCTGTAC GCGATGTTGG ATTTGAATCG CGATGAGAAG ACGTTTGAGG CGTACGATAG GCTCACTGCG CACGAGCTGT TCATTCGCAT GGGATTGAGT AAGCGACTGG TGGACGATTT CATTCGCCCG ACGTTGCTCG TCGGGTTGTT CAAGCCACCA GAGGAGCTTT CGGCGGCTGT CGTGATGGAA TTGTTGTATT ACTACGCGTT GGCGCACCAA GACTCGTTCG ACGTGCGCTG GATCAAGACG AAGAGTATAG CCGAAGTCAT CGTCGGGCCC ACGATGGCTC GCCTGCAGAG CGAATACGGA TTGAAAGTCA TGGGCTCGAC GTTTGTCAGC AAAGTTGAAG TGGACGAGGC GACAAAAAAG GCGACGGCGG TGCACTACCT GAAAAAGGAC GGCGGGAAAG CAGGCGTGAT CAAAGACGTT GATGCGGTGG TCTTCGCGCT CGGCGCAAAG GGCATGAAGA GCGTGGTGTC CAACTCTCCC GTCTTGGCTC GCATGGCGCC AGAGTTTAGC GCCGCAGCGT CGCTCGGCGG CATTGACGTC GTGGCGACGC GAATCTGGTT GGATCAGTAC GTGGACGTGC AGCATCCAGC GAATGTATTC AGTAGATTTG AAGCCCTTCG TGGCGCCGGG GGTACTTTCT TCATGTTGGA CCAGTTGCAA AAAGACTCGG AAGTTGAATT GTGGGGTGGC GAAGAGCCAA AAGGAAGCGT CATCGCCGCC GATTTCTACA ACGGCGGCGC CATCGCGTGC TTGAGCGATG ATGATATCGT AAAACTATTA ACGGACGAGC TCCTTCCGGC CGCCGTACCG GGATTCAAGG GCGTCAAAGC TGTCGACTTT GAAGTGCGAA GGTACCCGGG AGCGGTTTCC TGGTTCTCTC CAGGCTCGTA TTCGAAAAGA CCGCCGCTCG AAACGTCCAT TTCAAACATC GTATGCGCGG GAGATTGGGT GCGCATGGGC GACAGAGAGC ACGGCGCTAA GGGCTTGTGC CAAGAGCGCG CCTACGTCAG CGGTTTAGAG GCTGGAAATT CTCTTTTACG CAGATGCGTC GTCTCCGGCG CGGGCGTTTC CGGCGGCGCC AGCCATCCCG TCATTCCGAT TCGCCCCGAC GAAGCCCAAG TCGTGCTCGG CCGCGCGCTG AACAAGCAAA TTATGGACAC CCTGAGCCCG TTCGGCCTGG CGTCGCCGTG GATTCGTTAA
|
Protein sequence | MLGAQASRAR ARAHVATKHR ARPRARASTR VAASATSSAR ANAISKVVVI GAGWGGIGAA KSLCEAGADV TLVDVQDDPT GATPTLTKSG KPFEAGTRGF WKDYPNISDL CREMNIDEKD AFTEFTPSSF WSPDGLEATA PVFGDSMALP SPLGQVFATF DNFKRLPLSD RVTMVGLLYA MLDLNRDEKT FEAYDRLTAH ELFIRMGLSK RLVDDFIRPT LLVGLFKPPE ELSAAVVMEL LYYYALAHQD SFDVRWIKTK SIAEVIVGPT MARLQSEYGL KVMGSTFVSK VEVDEATKKA TAVHYLKKDG GKAGVIKDVD AVVFALGAKG MKSVVSNSPV LARMAPEFSA AASLGGIDVV ATRIWLDQYV DVQHPANVFS RFEALRGAGG TFFMLDQLQK DSEVELWGGE EPKGSVIAAD FYNGGAIACL SDDDIVKLLT DELLPAAVPG FKGVKAVDFE VRRYPGAVSW FSPGSYSKRP PLETSISNIV CAGDWVRMGD REHGAKGLCQ ERAYVSGLEA GNSLLRRCVV SGAGVSGGAS HPVIPIRPDE AQVVLGRALN KQIMDTLSPF GLASPWIR
|
| |