Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_48436 |
Symbol | |
ID | 4999753 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 785164 |
End bp | 789811 |
Gene Length | 4648 bp |
Protein Length | 1503 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415174 |
Product | predicted protein |
Protein accession | XP_001415599 |
Protein GI | 145340990 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAGT GGGGCGTGCG CGCGCACGGT GGGAGAATCG CGGTGGTGGT GCCAAACGGG CCCGAACTCA TGACTTCTTT GATGTGCGTG TTGCAGCGAC ACTGCGCGGT GCCCATCAAT CCGGTGACGA CGCAAGAGGA GATTGAAGAA GAGCTGTTGA ATACGAACGC GAAAGTGTTG TTGTATCAAC GTGACGGGGG TAAGGGCGAT GTTAAGATGC GTCGGTTGTG CAAGAAGCTC AAGTTGACGC CTTTGATCAT CACGCCGAGC CCGACAGTGG CGGGCGATTT CACTCTCATC GGCGATCCGT ACGGCTTAGC CACAGAAGAG GACTCGGCTG GGAACGACAT CGAGCTCATG CAAGCCGACC GCTTGGCTTT GATTCTACAC ACGTCTGGGA GCACGGGGAA AAAGAAGGTT GTTCCCATTG CGATGTCGCA AATCCTTATC GGCGCCGCCG CCATCGCGGC GTCTTGCGGC TTGAATGAAG ACGATATTTG CTGCAACTTC ATGCCGTTGT TCCACGTCGG AGGTATCTTA AGAAATGTAC TGGCGCCAAT CATGTCCGGT GGATCCACGG TTGCTATGCC TTTCTTCGAT GTCGACAACT TTTGGGAAGT CCTCGAGAGC AAAAAGTGCA CGTGGTACTA CGGCGCGCCG ACTATGCACA TGCTCATCGT AAAGTCCGCG GAAACCATGG CGAAGAACGA CAAGGGATCA GTAAAGACAT GTGTGAGGTT TGTCGCCAAC GCTGCGGGCC CGTTACAACC GGCGACCGCG ACGGAACTTC GACGCTTGTT CAACAACGCC TCGGTTTTGC CTTCGTACGG TATGACGGAG TGCATGCCTA TCTCGTGCCC GCCCATGGGA TACGCGTTAG AGCGCCCGGG GACTTCTGGT CGATCAATCG GGCCGGAAAT CGGCATCATA GACGATAGCG GTAATCTTTG CCCTTCTGGT GCGGTTGGGA ACATCATGGT GCGTGGCCCT CTCGTGTTGA CCGGGTACGA GGGCGAAGCG CCCGGTTCGA GCGGATTTGA GCCGGGAGGC TGGTTTAACA CTGGCGACAT GGGTAGGATG GATGATGATG GTTATTTGTA CGTCACTGGT CGGACGAAAG AAGTCATTAA TAGAGGAGGG GAAATCATCT CGCCGGTTGA GATTGAGGAA GCGCTCGCGT CGCTACCGGC AGTGTCTGAG TGCGTCGCCA TCTCTGTCCC ACACGGTACG CTTCAAGAAG TTGTTGGGGT CTTGGTTGTT CCAGTGAAAG GTGGACAAGT TCCCGGTATG CGGCAAATCG TACAGCACGT CGCCAAACGT TTGCCACCGT CCAAGTGGCC GCAGTGCCTT ATTTTAGCGA GTGGCATTCC GAAGTCCATC ACGGGAAAAG TGAGTCGCAG CGTGATCACT AGGCAGCTGC GACTACCTTC GCTCGAAGAT GGTATGCGGG AGTTAGAAAC AACGTTTGAA GCAGACTTTG CGGACAACGC CCTCTCTGGG GCACAGGCAT TGAATGTAGA CAACGCCTCG TCAGAAGTCG TGCAAGCGCT TCTTCTAGTG CCTGGCGTCA ACGACGCCGG AGCTTGGATG GATAGCACCG GCTCAATCAT CGCTGTCGTT ACCCCGAAAA CGCTCGATGC GTCATCGGTG AGCTCGGCAT CCACGCGTTA CCTTCCTGGT TATCTTGAAC CAAAGGAAAT CTTGCCCGTG GCGTCAGTCC CACGAGATGC GCAAGGCGCC ATCGAAGTTT CCAAAGTTTA CGAAATGATG CGCCAAGATT TGACAGACGC TCCAGCGAAC CCCAATCAAG AGCTTCTCGC GCATTTGTGG GGCGAAGTGC TCGGTACGGA TTCGAGCATG ATATCCATCA AGGATGATTT CTTCCTGAGC GGCGGCTCGT CCATTGCGGC GGGTCAACTC GCGGGTTTGA TTCGCAAGAA GTTTGAAGTC AACATCACCG GAGCGGATAT GTTCGAGCTT CGCACGATTC AAAGAATTGC GCAGATGATC GACAAGCGTA CGAAGGAGGC GGAGAGCTCG GATAGTAAGC CGACGCACGC TCCACGACCC GTCGGCCCGG TTGACTGGGT CACGCCGAAG TCAAGCTTGG GCATCATCCC GAGCATCGTG CAGCTAATGC CCTTGTACCT GATCAAGCCA TTCTACAACA TGGCACGTTG GGTTTTGTTC TTGATGTGCT GGTCGCACGT GTTCCACTAC CTCGGTCCGA GCTTGCATTG GATGAGGCGC GAGCACTTCA TGGCTAAGGC TCAAAAGTTT GGCATGAATG AGGCGCATTC GCACTTTTAC AGTACCGTGC AGCTTCTGTC GTTCTTCGTG GCGATATTCT TTGTAGCTTT GATCGTGTCG ATACTGTTCC CACTTAGCGC GATCGTGTTC AAGTGGGTAG TGTTGGGGCG ATTACAAGCG GGATCGTATC CACTTTGGGG GCAGTACTAC TTGCGATGGT TCTTAGTCGA GCAAGTGACC AAGATTGCGG GTCTCGGCAT TTTTGATCTT TCTCCATGGT TGTTCACTTG GTACATGCGC ATGATGGGGG CTTCGATTGG AAAAAGTTCA TATGTTAGCC CGAAAGCCAT CATCGGCGAC TTTGATCTTA TCACGATTGA AAAGAACACC ACTGTAGATG AAGGGTGCAA CATTCGCGCC TTTGAAGCGC GTCGCGGCGG TATGCGTCTG TCGCCAATTT TCATCGGTGA AGGTTGCACG CTGTGCGTCC ACGCGGTGTG CGGGCCGGGC GCATACGTTC CAGCTGGTTC GACACTTGCC GCTTTCACAT CCTGGAGAGA GATGGATCCG CAACAAGTCC GGCCCGTGAG CACGAAAACG GCACAAATGC ACTCGGAAAC GCCGCGGTTG GCGTCGCGTT TGTTCGTCGC GTTTCCCATC GTTTTCACGT GCTTCTTCGT TCGTTGGCTG CCGTGGGTTT TAGTGCTGCG CGTCGTCCAG CAACACTCCA TCGGAGATAA CTTACTCACG CACGCGGGTA TGGCAAACTG GAGACAAGAG CTCGACACAC TTGACAACGA CTACGACGCG CTCGAAGTGT TCGTCACGCT GCTCGGGCGT TTGAGCTGGA CGGATTGCAT CGAATGGTTT ATGGACCCGT TTCGGCTGGG TTGCATAGCG CTTGCGCGTC TGATGCACGC GACTGCCGGC CCGTTTGTAC AGCTTTTCGC GACGATTCTA GTCAAGCGCG TCATCATTGG TAAGTTTAAA TCTGGCGGCT TACCGCAAGC GCCGGCGGCG CGCGAGTGGG AATTGACACG TCGCTACATC ATGCGTAAAC TTTGCCCTAA TGGCAAGTTT TACGGTGCGA CGGATCTACT AGGTAAGCAT TACCAGTACA CAACGTATAT TTACCGTGCG CTTGGCGCGA CGGTTGGTGA GCGCATATTC TGGCCAGGCT CTGGCGTCAT CGTCGGGGAC GGAATGTACG ATCTTTTGCA CATAGAAGAC GATGTCGTGT GGGGATCACG TTCTGCGGTG TATCCAGCAG ATACCATCGG AGCTCTTCCC ATACGTATTC GGCGTGGCGC GAACATCAGC GACCGTTGCG TCATCTTTGG AGGCGTTACT GTGATGCCGA ACGCGTGCCT CGGCTCTGGG AGCGTCGCCG CTCGCAAGAT GACGATTGCG AGCGGGTCGA TTTGGGTCGG CAGTCGCAAT GGCGTAGCCG TACAACTCGA CCCTGGCGGT GCGTCGAGTA TGATGGATGA GCGCCAGACC GCGAAGCCAT TCGGACGTGC AACGTACTTG GGACAAGCGA CGTACCCCGT CGTCCCGTGG TACTTCATGC CCGTGATTTG TATCAGCGTG CAAATTTTCA AAGCCGCCGC GAACATGTTG CCGATTTACG TCGCGTGGTA CGCCACCGCG TACATCTCCA GAGTATACCT TGACGAAGAT TGGCAAACTC TCGACGCGAC ACGGTACTTG AGCATCTTGT TCGTCGTATT CTTGATTTTG CGCGGTGTTC AAACGACGTT CAATCTCGTG AATTCCATCT TTCTGAAGTG GATCATCGTC GGTCGACGCA CGCCCGGAAA CTATCCGTGG GACACTAGTT CGTATTGTTT CCGCTGGAAA CTGTGCGATA TCATGAGCGA CACGACAGAT TTAATGCTTC TCTCCGGAAG CGAACATTTG TGCCGATACT TCCGCGCGAA AGGAGCCAAC ATTGGGCAGA ACGTTTGTTT GTACCCCACG GGAGCGGATC CGCCCATGCC CGAACCAGAC TTGGTCACTA TTGGCGACGG CGCGTGCATC AATTTCGCCC ACGTCATCGC GCACACGAAT ACGTTGGGCG CTTTTGCGCT GAACAACATT CTCATTCGCG AACGCTCGAC CCTCTGCATG GAGAGTCGTG TCATGGGTGG CACGAAAGTC GGAGCAGATT CCATTCTCCT CGAACACACG CTCGCAATGG TGGGTGACGA TGTTGAACGC GGGGGGATCT GGCAAGGATG GCCCGTACAA ATGGTTTTGA ATTCGTGCGA ACGCCCGCGC GGAAAGTCGT CGCCGTCAGG CCCCGAGAAG GCAAGTCCGA TGGCCAAATC ATATGGGGCC ATGTAAATAC TGTATCTACT ATAGACACAT CACATATTCT ATTCGTTTCA ATTTGGTT
|
Protein sequence | MPEWGVRAHG GRIAVVVPNG PELMTSLMCV LQRHCAVPIN PVTTQEEIEE ELLNTNAKVL LYQRDGGKGD VKMRRLCKKL KLTPLIITPS PTVAGDFTLI GDPYGLATEE DSAGNDIELM QADRLALILH TSGSTGKKKV VPIAMSQILI GAAAIAASCG LNEDDICCNF MPLFHVGGIL RNVLAPIMSG GSTVAMPFFD VDNFWEVLES KKCTWYYGAP TMHMLIVKSA ETMAKNDKGS VKTCVRFVAN AAGPLQPATA TELRRLFNNA SVLPSYGMTE CMPISCPPMG YALERPGTSG RSIGPEIGII DDSGNLCPSG AVGNIMVRGP LVLTGYEGEA PGSSGFEPGG WFNTGDMGRM DDDGYLYVTG RTKEVINRGG EIISPVEIEE ALASLPAVSE CVAISVPHGT LQEVVGVLVV PVKGGQVPGM RQIVQHVAKR LPPSKWPQCL ILASGIPKSI TGKVSRSVIT RQLRLPSLED GMRELETTFE ADFADNALSG AQALNVDNAS SEVVQALLLV PGVNDAGAWM DSTGSIIAVV TPKTLDASSV SSASTRYLPG YLEPKEILPV ASVPRDAQGA IEVSKVYEMM RQDLTDAPAN PNQELLAHLW GEVLGTDSSM ISIKDDFFLS GGSSIAAGQL AGLIRKKFEV NITGADMFEL RTIQRIAQMI DKRTKEAESS DSKPTHAPRP VGPVDWVTPK SSLGIIPSIV QLMPLYLIKP FYNMARWVLF LMCWSHVFHY LGPSLHWMRR EHFMAKAQKF GMNEAHSHFY STVQLLSFFV AIFFVALIVS ILFPLSAIVF KWVVLGRLQA GSYPLWGQYY LRWFLVEQVT KIAGLGIFDL SPWLFTWYMR MMGASIGKSS YVSPKAIIGD FDLITIEKNT TVDEGCNIRA FEARRGGMRL SPIFIGEGCT LCVHAVCGPG AYVPAGSTLA AFTSWREMDP QQVRPVSTKT AQMHSETPRL ASRLFVAFPI VFTCFFVRWL PWLDTLDNDY DALEVFVTLL GRLSWTDCIE WFMDPFRLGC IALARLMHAT AGPFVQLFAT ILVKRVIIGK FKSGGLPQAP AAREWELTRR YIMRKLCPNG KFYGATDLLG KHYQYTTYIY RALGATVGER IFWPGSGVIV GDGMYDLLHI EDDVVWGSRS AVYPADTIGA LPIRIRRGAN ISDRCVIFGG VTVMPNACLG SGSVAARKMT IASGSIWVGS RNGVAVQLDP GGASSMMDER QTAKPFGRAT YLGQATYPVV PWYFMPVICI SVQIFKAAAN MLPIYVAWYA TAYISRVYLD EDWQTLDATR YLSILFVVFL ILRGVQTTFN LVNSIFLKWI IVGRRTPGNY PWDTSSYCFR WKLCDIMSDT TDLMLLSGSE HLCRYFRAKG ANIGQNVCLY PTGADPPMPE PDLVTIGDGA CINFAHVIAH TNTLGAFALN NILIRERSTL CMESRVMGGT KVGADSILLE HTLAMVGDDV ERGGIWQGWP VQMVLNSCER PRGKSSPSGP EKASPMAKSY GAM
|
| |