Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_95034 |
Symbol | |
ID | 5004730 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 498440 |
End bp | 500101 |
Gene Length | 1662 bp |
Protein Length | 505 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420151 |
Product | predicted protein |
Protein accession | XP_001420711 |
Protein GI | 145352772 |
COG category | [R] General function prediction only |
COG ID | [COG0679] Predicted permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.234943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGATA GCGCAGTCAT TCTCCATCGC GTCGCTCTGA ACGCGGCGTT CATCGCCGTC GGCTACGCGC TGCGCGCGCT GAAAATACTT ACCCTCGAAG ATGGCAAAAC CGTCTTTCGC TTTGCCACGA ACGTGACGCT TCCAGCGCTG TTACTGTACG TCATGACGCG CGCGAGCGCG GTGGGCGCGT CCAGTGGATT GAGTGCGACC GTATCCACGG TGAGCGCGAT CATTCCCGCG TGCTCGTTGC TCGTGGGCGT GGGATGTTCG TTCGGGGCGT ACCTCGCGTA TCGCAAGTCG CCGGCGCGCG CGAGAGGGTT AGCCGTCGGG AGCGCGACGG GGGTGAATTT AGGAATGTTT GCGTACCCGT TCGTGGAAGC GATATGGGGG GTGCCTGGTC TGGCGCTATG CGCGATGTGG GACGCTCCGA ACGCGGTGGT GGTGTTCGGC GCGGCGAAGG CTATTTTCGC CGCCGAGCAA AAGCACGGCG ACGCGTCTCG AGCCGTGCAC GACGACGGCG GGATTTACGA CGGGGAGTGG TTAGATAAGA AAAAGCACGG GTACGGGTGT TACAAGTACC CGAGCGGGGC GACGTACGAA GGGCAGTGGA AGAATAACGT CAAGGATGGC TTGGGGGTGT ACACGTACGG CAAGGGCGGT TCGTACGCCG GCGAGTTCAA GCGCGGTCGG TTCGACGGGA CGGGGATTCG CGTGCTGCGC ACGGGCGCTG TCAAGGCGGG ATTATGGGAA GACAACGAGT TTGTCGAGGC TACGACGGTA AAGGATTGCG AAGGGACGAT TGCGGCGACG AACGCGGCGG TATCGACGGC TCGCAAAGCC GCCGAGGCGA GCAACCTCAC GATGAAGGAT TTATTTTGGA AGGTGGCGAA ATTTCCACCG GTGATCGCGG TGACATTGGC CAGTATGATG AACTTCACTG GTATTGCACT CCCTCAGACT GCTTCGCAGC TCGTCGTGCC GCTGGCGAAC GCGAATAACC CGATCGTGTT GCTCACGCTC GGCGTCCTTT TCAAGCCAGC GATGGACCGA ATGCAAGTGC AAGCGGTGGC TAAATTTATC GGCGTGAAAT ACGGTCTTGG GTTGCTATCG GCGGCGGTAT GTACATTATT CATTCCACAA AGTTTCGCGC TCGCGCGAGG CGTCATAGCC GCGCTGTGCG TGATGCCTGT GCCGTCCATC GTCATGCAAT ACTCGGCCGA GCACGAAAAC GACGGCCAAC TTGCGGCGGC GATTGTCTTG AGCTCGCAAG CCATGACGCT CGTTTTGATC TGTTGTTTCG CCGTCGTCGC GCCGTACATC GTGAGTATTG ATAAATTCGT GTTTTCGGGC GCCCTTCTTG CTGGCGCCGT TGCCGTGGGC GTAGCGAGCG CCGTCGGCGT CCTGGCGTTG AAGCCGTCTC GGGTAGACAA GGCGAAGAGT CCGGGCGTGG CGGTGGCACC GACGGCGAGC ATGCGATCGA CATCCCCACG AAACATCGCT CATCGGCGAC AACGGCGCGA TGTTACGGTA AACATCGCCG TTAATGCCCC TCTTCGTGCG CTGACTGCTC GAGGTTCTTC CTTTACGTCG GCGAAGCGGT CGGCGCCGCA AGCGCCTTTG CGAGCGGCTC TGAGCGGCGG TGTAAAATTA GTAGGATTGT GA
|
Protein sequence | MTDSAVILHR VALNAAFIAV GYALRALKIL TLEDGKTVFR FATNVTLPAL LLYVMTRASA VGASSGLSAT VSTVSAIIPA CSLLVGVGCS FGAYLAYRKS PARARGLAVG SATGVNLGMF AYPFVEAIWG VPGLALCAMW DAPNAVVVFG AAKAIFAAEQ KHGDASRAVH DDGGIYDGEW LDKKKHGYGC YKYPSGATYE GQWKNNVKDG LGVYTYGKGG SYAGEFKRGR FDGTGIRVLR TGAVKAGLWE DNEFVEATTV KDCEGTIAAT NAAVSTARKA AEASNLTMKD LFWKVAKFPP VIAVTLASMM NFTGIALPQT ASQLVVPLAN ANNPIVLLTL GVLFKPAMDR MQVQAVAKFI GVKYGLGLLS AAVCTLFIPQ SFALARGVIA ALCVMPVPSI VMQYSAEHEN DGQLAAAIVL SSQAMTLVLI CCFAVVAPYI VSIDKFVFSG ALLAGAVAVG VASAVGVLAL KPSRVDKAKS PGVARSAPQA PLRAALSGGV KLVGL
|
| |