Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_86314 |
Symbol | |
ID | 4999386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 1067041 |
End bp | 1070166 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | |
GC content | 60% |
IMG OID | 640414807 |
Product | predicted protein |
Protein accession | XP_001416019 |
Protein GI | 145341857 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAGCG ACGGCGATCT CGCGAACGAT GAGAACGCCG ACGCGGTCGC GCCGAAGAAA CGCGCCGAAG CGATCGCGAA CGATGACGGT GACGCGTTGA ATCCGTTGGT GCCGCTGCGC GCGCTCGACG ATGAGACGCT CGCGAGCGCG CGCGCGACGC GGGAGAGCGA CGCGGACGCG CGGACGACGG TGGAGGATGA CGTGCGCGCG GAGGAGTTCG CGTTCGCGGA GGCGCTGATG CGGGTGAAAC GGGAAGGCGC GGGCGCGGGG GGGGCGGCGG CGATGTTCGA AGACGCGTGT CGGTTGCGAG CGCGGGAACT GCGGACGCGA GCGGAGGCGC GAAGACGGCG AGCGCCGAGC GTGGCGAAGC GTGAGGTGGG CGAGGCCGAG GCGCTGGAGC GGGAGGCGCA CTCGTGGTCG TTGATTTATC ACTTGCTCGG CGACGGGGCG ACGGTGGAGC GGGAGAGCGC GGAAAAGGAA ACTGAAGTGC TGCGCGCGAC GCCGAGGGAG GAGGGCGGGA CGAGAGGAGA CTTTCTTCCG CCGCCGCTTC GCAGTCGACT GCGATGCGCG TCTCGAGACG AAGAGCGAGA TCCGGTGACG TTTAGATTGA ATAGAATAAT CGCGTGGTTG GAGGCCAACT CGGCTTCAGC GCTTCGCAGA GCGGAGCTGG ATGGCACGGC GTACGATGGA AGGTTTTTGC GAGATGAGTG CGATTGGCGC GCAACCGCGG ATGCCATCGA CGCGTCGGCT AAGTGTGATC CGGATGGTAA TCCGCTGTCG ACGTCGCTGG ACCCGGACGG GCCCATGCGC ACGAAATCGG CGCTGCACCC GTCCAACGCG GACGCGGAGG TTCGATTGTG TAAACGGTTG TGGAAGCTCA TTCGCGCTGG GAGCGTGCAA GAGGCGCGCG ATTTGTGCTC CAAAGTCGGT CAGCACTGGC GCGCCGCCTC GCTTGGCGGT GCCTCGGGCT GGGGACCGGC TCCAGTCGGC AGCACTGCTG ACGAAGAGCT CGAGCGAGAC ATTCGTAAGC TCTTGGCGCT TCGCGACGAG GACGCGCTCG CGGCACAAAA TGAAGTTGAC CTCAACGACG ATGCTACTGC GGCCGAGTGC GACGGTATCG GCACTGCGCG TCGCGCGCTG TGGAAATGGA CCTGCATGGT AGCCGCACGT CACATCGACA AAGCCGGCAA GCTTTCGCAA ACGCCCGCGG CCAAGTATGA GGCGAGTGTG TACGGGGCGC TATGCGGTGA TTTACAAACG ATGCTCGCCG TTTGCGAGGG TGATTGGGAG TCCACGGCGT GGGCTTACAC CCGAGCGCTG TTCGATCTTC GGGTCGATGC CGTCGTGAAC ACGGGCAAGG TGCTCGACGA CGTGTCAAAC TTTGAACCTG GCGAAGTCGT GCGAGATCCG ACTGAGCTAG AGACGACGGA TGACGCTGTG GATCGTTTGG GCGAACCACG ATGGCCAACG CGAGACGTCA TCAACGCGAC GCCAAAGACA GTGGAAGAAA TTTTGCTAGT CAAGATGCCC GAACGTTTTC CCGACGCCGA CGCGCATCGA ACGGTGCAGA CGCACTTGAT TCTCGGTAAG ATGAAGGAGT TATTGTTGGA CCATATGATG CGATGGATCT TCCCTGAGGA CGAGCTTGAT TCAAATGTCG AACGTGTAAG TTCCGAGCCG CTCGACATTG GTCTCACCCG CTTTACCGCG CATGCGCTAC TGTTTTTAGA GTCGTTGTTG CCGGAAGGTG GTGGATTATC TCCCGGAGGC GAGCTTTACT TTCACTTGAA TAAGGTGCTC AACTTGTACG TTGTGCACTT GATCGCAAAC AAGCGTTACG CGTTGGTGCC AGCGTACGTG GTGCACTTGA GACACCCTCT GTTGATTGAA ACGTACGCCA ATTTCTTGGA TCTCCTGGCG CCCGCCGTGC TTTCTCGCAA GACGCTGTGT TACGCCGAGG CGGCGCTTTG GATGGAAATA GAGGGCCCGG GCGGGTGGCG AGAAATCGTC ACGAGAGCCT TAAGCGACTC CACGAGTCTC GTGAACGTTC ACCGAGGACC CGAGTATCGA CGTTTGATGC TCCAGTGGGC GTGCGTTACG AGTGAGACGT ACCCAGAAGC GGTGAAACAC GCGTGTTTGC TGCTCCGCCA ACTCATGTGC CAGCGAACGT CGATCGAGGT CTTTGCGAAT GACGCCCCAG TGGATGGTGA ACTCCGTGCT CGCGTCATCC TACTCGAGGA GCTTCCAGAA GTCGCGCAAG AGGAAGCGAG AGCGAACGGA GCCGCGGCCG CTGCGGCTGA ACTTGCCGAC TGGGCGAGAT ACCTCGCCGC CACGGAGGCT ATTTCGCAAT GGAAGCAAGT CTGGAGCGTG AACGAATCGA GACGACTAGA CGTAGCGGCA CAAAACACTC GCCCGTACGC TCCAAATTCA GCGGGGGAAG TGACTGAAGA CGAGCTGGCG AAGGCACGCG ACGCCATCGA CGCCGTCGTG GCGCTTCTTC GGTCGGAGAA TTGGCTCGAC GACGAAGCGT TGTACGACGA CATGGAACAA ACGGGCGACG CGACGCTACG GGTTGTCGCG ATTCCCGTCG CCAAAGCTTT CGACGACCCA TTGACTTCGG CGAATATGAG TATTGAGCAA ATCGCCCGAG ATCTCGAGAC GTTGCTTGGC TCAAAGTTCG CTCAAGGCGT CGTTGAGGTG AGCGCGACTG TAGGCGTCAC GCCCGGCGAG TGTCCTGCGC GCGTGGAAGG TGAGTACGGT CAAGTCGTCG TGCAGATCTC CACCGAATGC AACGATGAAG ATCGAGCGTC GCTATACCAA GACGTCTCGC TGGCGATGGC CGACTGCGTC AAGGGTGATC TCCCAGGACA AGAAGTCACG CTCGACGTAC AATCAGTCGG TGGCAGTAGC GAGACGTTGG TACATGCGTT ATGTCGGGCA ATCTGCGTGC CTTCGCTCGT GATCCAAGCG GCGCAAGTCG AAGCCGCCAC GCGCACTGGA ACGACACAAA TAATTGAAAT GACCGCCGAC CCGAAGTTTG GTGTGCACAA GTATTTCGCC CCGACGGAAC TCCGATGGTT GTTGGAACTC GGACGCGAGA TCGGTTTGAC AATTTTGGAC AAATAG
|
Protein sequence | MPSDGDLAND ENADAVAPKK RAEAIANDDG DALNPLVPLR ALDDETLASA RATRESDADA RTTVEDDVRA EEFAFAEALM RVKREGAGAG GAAAMFEDAC RLRARELRTR AEARRRRAPS VAKREVGEAE ALEREAHSWS LIYHLLGDGA TVERESAEKE TEVLRATPRE EGGTRGDFLP PPLRSRLRCA SRDEERDPVT FRLNRIIAWL EANSASALRR AELDGTAYDG RFLRDECDWR ATADAIDASA KCDPDGNPLS TSLDPDGPMR TKSALHPSNA DAEVRLCKRL WKLIRAGSVQ EARDLCSKVG QHWRAASLGG ASGWGPAPVG STADEELERD IRKLLALRDE DALAAQNEVD LNDDATAAEC DGIGTARRAL WKWTCMVAAR HIDKAGKLSQ TPAAKYEASV YGALCGDLQT MLAVCEGDWE STAWAYTRAL FDLRVDAVVN TGKVLDDVSN FEPGEVVRDP TELETTDDAV DRLGEPRWPT RDVINATPKT VEEILLVKMP ERFPDADAHR TVQTHLILGK MKELLLDHMM RWIFPEDELD SNVERVSSEP LDIGLTRFTA HALLFLESLL PEGGGLSPGG ELYFHLNKVL NLYVVHLIAN KRYALVPAYV VHLRHPLLIE TYANFLDLLA PAVLSRKTLC YAEAALWMEI EGPGGWREIV TRALSDSTSL VNVHRGPEYR RLMLQWACVT SETYPEAVKH ACLLLRQLMC QRTSIEVFAN DAPVDGELRA RVILLEELPE VAQEEARANG AAAAAAELAD WARYLAATEA ISQWKQVWSV NESRRLDVAA QNTRPYAPNS AGEVTEDELA KARDAIDAVV ALLRSENWLD DEALYDDMEQ TGDATLRVVA IPVAKAFDDP LTSANMSIEQ IARDLETLLG SKFAQGVVEV SATVGVTPGE CPARVEGEYG QVVVQISTEC NDEDRASLYQ DVSLAMADCV KGDLPGQEVT LDVQSVGGSS ETLVHALCRA ICVPSLVIQA AQVEAATRTG TTQIIEMTAD PKFGVHKYFA PTELRWLLEL GREIGLTILD K
|
| |