Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29678 |
Symbol | |
ID | 5006866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 104439 |
End bp | 107904 |
Gene Length | 3466 bp |
Protein Length | 1150 aa |
Translation table | |
GC content | 56% |
IMG OID | 640422287 |
Product | predicted protein |
Protein accession | XP_001422808 |
Protein GI | 145357198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.00854714 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00187759 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACACGA ATTGGACGCC AGCGACGCCC GCGCGCGCGC GTGCGGACGC ACGAATCGCG TCGGATAAGG TGCTGTCGAC AAAGGCGTTC GAGGACGCGC GGGTGAAGGA TCACCGCGAC GTGGAGCGGG AACGCGGGGT GTACCTGCCG CCGAGCGCGG AAGAGGCGGC GACGCACGCG GACAGCGCGC AGCCGAGACT GCTGAGCGAG GATGGGAAAT TGGAAAAACT TAAATATTTT GACTCCGAAG TAGACTTGGG ATTGGAGTTC GGGGCGGGGG TATCGGGATA CTTTTTCGTG CTGCGGGCGT TCGCGTGCTT GTTCTTCGCG GCGTTCGCGT TGAATTTGCC CGCGATGTTC ATTAATTACA CGAGCACGTA TTACGCGTCG CATTACGAAG CGGAGCCTGA AACCGCGACA GCGCCGACGG CGGACGCGCT CAGTGGACGC CCGACGACGT CGTTTCACTC GTGGTTCGTC GATTGGCACT GGGCGAATCC GATGAGCGCG AGCTTGGGGA CCGTGGCGCC TGACGATGTG CCGCAACGAG CGTGGGTGCT CGCGGGGACA ACGAGTGGGG TGCTGCGGTG GAGCTCCAGT AAGATAGATT TCATTCGATT TTCGGTGGTG ATGGATGTCG TGGTGTCCTT GATAATTATA TTGAGCGTTC CAGTGATTAT GCTTGTGTTG ATGCGTACGG AGCGACGGGT CGAGAGAGGG ACGGCGACGT TGAAGGATTA CACGGTATTA GTTACAGGAT TACCAAGCGA TGCGACCGAT GAAGAAGTGC GATGCTTTTT CGCGCTCAGG TTTGGGACGG TAGCCGACGT CGTGCTCGTA AAGACGGAGT GTATGCAAAT CAACGCGCAG CGACGGCGTC GGCGCTTGTT GGAGGATTAC GACGAAGCCG AAGCTGCGCT CATCGCCGCA GGTAATCGCG GAGGTGACGG CACGAAGACG GCTATAGAAA ACGAGATATT GGCTGTGGAT AGGAAATTGA AGCGCAGAAG AGCGCGAACG CGGGCGAAAC AGTGTAGTGC TTTCATCACG TTTGAAACCG AAGGCTCAAA GATTGATTGT ATATTGCGCA ACACGCGCAG TATAATGTCG TACATCTTTG CGTTTCCTAG GAAAGAGCGT TTCCGAGGAA AGCGGAAGTA CAGAGTGCGA GATGCTCCCG AACCTGAGGA CGTACGATTC GAGAATCTGA ATCTCTCGAA TCGCTCATGG CGTCGACTCG TCGTGCTATT CTCGTGCTCG GCCGTCGTGC TCTTGTGTTA CGGCTTTTTG AAGATGCTCG TGGATGACAA GGAGAAGCTG TGGGAGAACG CAGACATGAT GGTCACTACG CTTGCGGACG ATGTTGGCAT TGTCGTCGCC CACGGTAATC CGGTTGAGCA ATTCGAGACG CATAAGAATC AGTTCAGGAC AGCATGCAGA GCTCGCTTGG ACCAATGCGG CGTGGCTTTT TCCAAGGACA AAACGTACGT TGGCATGCCT TGGGGGGCGC CAATTTACGC TTTCTACGAC TACCCCAACG CCACACTCTT AGATCGTCGT TACGCTCAGC AGGATGCGGT TCGTGATTTG ACGAATTGCG CCCAAGACAC CAACCGTTGT CCGGGAGGTC CGACGATGCA AAACTGTCAC GCGTGCTACT GCGCCAGTCT GAAGTACGGC TTAACTTCGG AGGTTGTTCG GGCATACAAC AAAGCCATTC GCCACTCGTG CGCGAAATAC GTGAACCTCG GTCCAGGAGA GTATTACAAC TGGTTGTGGG TGTCGTTTTG CATTACGCTC ATGAACGTCC TCCTCGAATG GATTGTGCCC CTCCTCGTCG CCGCAGAACG TTTGCGCACG CGCAGTGCCA CGAAGGTGCT CAAGACGAAA ATAATCTTCT GGGTGCGATA TTTGAACGTC GCGGTTATTT ATGGCTTGCT CAACGCCAAT TTCTATCACA TTGGCAGGTA TTTCCCGCTC ATCAAGCAAA TGTTTGGGCT CAAAGGCGAG TACGCAGATT TCACGAGCGA ATGGTTCAAC GACGTCGGCT TGGTGTTGTT TTTCGCAATC ATGATGAGCG TGACGATACG TATTTTGGCT CGAGTACTTG TAGACATCAT CACGCATGTC GGTCGAAAAT TCTCCGTGGC GTACTGTCAC ACGCAAGCAA AGCTGAACAA GGCGTTTGAA GGGCCATCGT TTGACACCGG CGCCAAGTGC GGCGACGTTT GCTTTACGAT TATGGCGGCG ATGACCTTTT CGAGCGGTAT GCCGCTGATA TACTTGGTTT TGTCGATGTA CTTCGTGTTG GTGTACCTGT ACGATTACCG CCTCCTGTTG AAAGTGTGCA AGTTGCCCGA AAGATCGAAG AGCACGCTTC CCATGACGGC GGCGAAGGTG CTTTTCATCT CAGTCTCGAT TCACGCTCTC ATCGGTCTTT GGATGTTCTC GTACCACTGG ACACCTGATT TGGCGAAACC GACGAAAGAC TTTGAACATT CGAGTAAGAA CAATGCTCCT CCGCTTGCGT ATGAGATTGG GGGTGAAATA TTAAACCCGC CGCACGACAA CGGCGCGCTC ACGGCCATCG TGCAAAGCAC CGGCGTCGCG ACGACGTACT TTCATCAGTA TCGCGACGCG ACAGTCGGGG CGTTGTCGGC GGGAGATTTC GTCGCTCCAC CGCCACGCGT ACAGCTGCGT TTCGCAGAAC GACCGTTCAG CGAAGCGGGG ATGCCGTTCA TGGGTATGTT CTTCGCCCTT TTGGGCGTCA TGGGTTTGTG GCAAATCGCA GTCGCGTGGC ATAACTGGGG CAAGTCGCGT CGAGACATCG CGCGCTCGTG GAAAAACTTA CCTCAGTATC ACGAAGCCAT CATGACCGGG TTAATCGTCG GTTCGGAGAC GTACCGTCCC GAATATCAGC CCGATTACGC GTTCTTGTTC GACAAGAGCA CTGTCGCGGC GGCCAAAATG AAACTAGGCT CGTACGCGGG AGGCCCCGTG CGCGGCGACT CGGTGAACTC GCGAGACGCG TGGTCGCTCG GACAAGGCGA CGACGATGAT ATCGCGGAGC TGATTCATCA TCGTTATGAT AACGAACGCG AGTACGACGG TGGCGCGCCT TGGGTGCGTA AGACCGGTTC AAAGATATAC GGCGGCGACG CCGCGACCGC GAAGCGAGGC TCTCGCGGCA CGCGCCAGCG CGACGGCGGA CATTACGGCG TACCAGTGGT CGATGTTCGC GCTTTGGGGG TTGGAAGCGA CTACAATCAC GTCGAAAACA ACGCGTTCGG GCACTCCGAC GCGTTCGTCG CGGATGATCA TGAGTGGGAC TCCGCCTCCG ACGCCGACAC GCTCGACGAC GACGCGTCGA CGCAGCGAAG TCAATCCTTT AGCGATTACG AGAACGATGA TCAACGATTC GACGGCGCCA GGCGACCGGC ATGGCTCGAC TAGGACGAAA CGTACA
|
Protein sequence | MDTNWTPATP ARARADARIA SDKVLSTKAF EDARVKDHRD VERERGVYLP PSAEEAATHA DSAQPRLLSE DGKLEKLKYF DSEVDLGLEF GAGVSGYFFV LRAFACLFFA AFALNLPAMF INYTSTYYAS HYEAEPETAT APTADALSGR PTTSFHSWFV DWHWANPMSA SLGTVAPDDV PQRAWVLAGT TSGVLRWSSS KIDFIRFSVV MDVVVSLIII LSVPVIMLVL MRTERRVERG TATLKDYTVL VTGLPSDATD EEVRCFFALR FGTVADVVLV KTECMQINAQ RRRRRLLEDY DEAEAALIAA GNRGGDGTKT AIENEILAVD RKLKRRRART RAKQCSAFIT FETEGSKIDC ILRNTRSIMS YIFAFPRKER FRGKRKYRVR DAPEPEDVRF ENLNLSNRSW RRLVVLFSCS AVVLLCYGFL KMLVDDKEKL WENADMMVTT LADDVGIVVA HGNPVEQFET HKNQFRTACR ARLDQCGVAF SKDKTYVGMP WGAPIYAFYD YPNATLLDRR YAQQDAVRDL TNCAQDTNRC PGGPTMQNCH ACYCASLKYG LTSEVVRAYN KAIRHSCAKY VNLGPGEYYN WLWVSFCITL MNVLLEWIVP LLVAAERLRT RSATKVLKTK IIFWVRYLNV AVIYGLLNAN FYHIGRYFPL IKQMFGLKGE YADFTSEWFN DVGLVLFFAI MMSVTIRILA RVLVDIITHV GRKFSVAYCH TQAKLNKAFE GPSFDTGAKC GDVCFTIMAA MTFSSGMPLI YLVLSMYFVL VYLYDYRLLL KVCKLPERSK STLPMTAAKV LFISVSIHAL IGLWMFSYHW TPDLAKPTKD FEHSSKNNAP PLAYEIGGEI LNPPHDNGAL TAIVQSTGVA TTYFHQYRDA TVGALSAGDF VAPPPRVQLR FAERPFSEAG MPFMGMFFAL LGVMGLWQIA VAWHNWGKSR RDIARSWKNL PQYHEAIMTG LIVGSETYRP EYQPDYAFLF DKSTVAAAKM KLGSYAGGPV RGDSVNSRDA WSLGQGDDDD IAELIHHRYD NEREYDGGAP WVRKTGSKIY GGDAATAKRG SRGTRQRDGG HYGVPVVDVR ALGVGSDYNH VENNAFGHSD AFVADDHEWD SASDADTLDD DASTQRSQSF SDYENDDQRF DGARRPAWLD
|
| |