Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24349 |
Symbol | |
ID | 5001431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 71230 |
End bp | 74169 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | |
GC content | 54% |
IMG OID | 640416852 |
Product | predicted protein |
Protein accession | XP_001417134 |
Protein GI | 145345260 |
COG category | [R] General function prediction only |
COG ID | [COG5038] Ca2+-dependent lipid-binding protein, contains C2 domain |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000638431 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTGC GGGACGTCAG CGAGGTGGAC GGGACGACGC GGTTGGCGTT GAAGATTACG GGGAGGAACG ATAAGGTGCT GTTGCTGAGT TTTACGAATA GTGCGCCGGC GTTTGATTGC TTAGCGACGA GTTGGAAGCT GGTCCGCGCG GATGCCGACG TGCGAATCGC CACCGGTCCG GGATTGCATC GGAAGAAGAA GTCCGGCGGC GCCGAGGCGA AGAATTTGAT AGAGTACGGC AAGGACTTGT TTGAGGAGTA TAAAGAGACG ATGGAGATCA AGGCGAGGAG TTTCGTAGAG GAGGCGAGTA AACTTGGGGC TAAACCGCGG CCGACGCCGA AAATGCCGAT GGTCGATGTC GAGGATGAGG TGTCGAGGGC GCTGTTTATT CGCTTGGTCC GGGCGACGAA TGTCGTGGCG ATGGATTCGG GTGGGACGTC GGACCCGTTC GCGTCGGTGC GCTACCGTGG GCTCGAGTCG ACGTCAAAAA CGATCTGGAA AACGCTCGAC CCGGAATGGG ATGAGGTGTT CACGTTCAGA GTACCCCCCA ACAAGACGAC GTTGGACGAA ACAGATTTCG TAGAGATGCA CATCTTTGAT CGAGACGTCG CGCTCCATGA TTTCATAGGT TATGTTAAAC TCGATCTCAC CGGTACGCGC GTGTACAGCT CAAAGCGCAC GAAGATGACT CTCGAGTTGA AAAATCTTCC CGCCGACCAG CAGCCAGACT TTTTCGACGT CAATCACTTG AAGGAGAAGC TCATGTTTTG GGAGGGTGAG CGCCAAATCA CGGGTACGGT GGAGATTGAA TACTGGCTCG GGAATCGTCA CGACGCGGAC TACAGGATTG CGGGTGTGCC GTTATTGAGA AAACCTGATC CACGAGCCGG GGAAGCGATG AATCACTTCT GCGATCCGGT ATCGGCACTT TTGCGCGTCG AGGTGAAGTG TGGCAGAAAC ATAATCAACC TAGACGACGA CGACGGAAGC GATCCATACG TTGAAGTGGC TGTAGTTCAG CCAGATGGGA CAGAGGAGAA ACATCAAACA CACTACATCG ACGACGCGAC CGATCCCGAA TGGAACAGCA CCTTCAACTT TATCGCCGCA AAGCCGTACA AGGCAGATTT AGTATTTCGC ATGTACGATT ACGATGGCGT GACAAGTTAT GACGATTTGA TCGGCATGGT ACGCATACCG ATCAGTGAAC TACAAACGCA CAAAGGAATC ACAAAGTTTC CAGACTCGCA GTGGTACACC CTGCTCGACG CTGAGGGCAA AGACTGCGAC AAGGAAGGCA CAAAGTACGG CGATATTGAA ATCAGGGCCT ACCTCGACGA AGAATATTTC GAGCACTTGC ACGGTGGTAA CACAAGTAAG GCCGTAGGTA AGCTTACATT GGACGTTTTG GAGGCAAAAG ACTTAGAAGG CGCGCCGGAC ACGTACGTCA TGGTCAAAAC CGGGCCATAT TGGTCGAGAT TATCCGATCA AAAGGCGCAA AGCAATCCGC AGTGGAACGT GCGTTTGAGA TACCCAATCA TAGAACCGAG CGAACCGGTG ACGGTCGGGG TGTTCAATTT ATCTGATGGC TCTATGATTG GAAAGATAAG ATGTGTTCTC TCTGGTTTAG ACGATGGTTT GCGCTACGAG GATGATTTTC CGCTCAAGAC GGTGAACAGG AGCGGTGTCG TCGTGACGAA TGGCACGCTG CGCTGTTCGT TTACGTTCAA GCACAAATCG ACTGCATCTT TCGCGAGTCG TTACATGCAG CCAGTACTTC CCGATAAGTG GTATATACAG CCGCTTTCGG ATACCGAACG ACGGCGCATG CTCAGGGCGC ATTCCATGAT GATGATGAAG CGACTTTACA ACTCGAATCC ATCCATCCCG GAAGTTGTCT CCAAGGAACT CCTCGACTTT TCCAAGCAAG ATGTCAGCAT CAAGAGTATC AAGAGCTCAA TCGCACGGAT GGAGCGCGTC GTAACGAATT TGACATCGAT CGGCGATAAT CTTTCGTACG CGCTGAGCTG GGAGTCTATC CCGCTCACTA TTTTCGTGCA ACTGGTCATG GTGTACGTCA TTCATCATCC GCACATGTTC TTTCCCATGT TCTTCTTGAG CATCGCGTTC CAGTCGCTGA TGCGGTTTCC ATCGCGTTAC CAGCGCACGC TCGACCGCTG CGTACCTGAC GATTGGCTCA CCGTCGGCTT GGCGTTTCCA CCGGATTCCG AAGAAGAGCT CGAGAAGAAG AAAGCGAGCG AAGCCGAGGC AAAGAAAAAA CTTGAAGAGG CGAAGAAACT CGCGTTGGAG GAAGAAAAGC GTAAGGAAGC GGAAAAGAAA GAAGAAGAGA AGGAATCTGA AATTCAGAAG CCTCGCGAAG TGTTTTCGTT CGAAAGCCTA AATCCGCTCG CGGCGTTGCA GCGCCAAATG GATGAAATCA CACAGATGAT TACTGACGCC CAAGTCGTTT TGGATGACGC CGCCGGTATT CTCGAGCGCG TCGTGGGCAT ACTAGACTGG GACGAGCCTC GCGTGACCGC GTGCGTCGTC GTCGGTCTTT TCCTCATCGC TTGGGCTTTC ATCTTCATCG ACGCCGTCAT TCGATTCATA ACAACCGTCG TCGTCGGCGT CTTCGTCAAA ACATTCTTCA CCATCTTCTC CCCGGTCGCC ATCAAATGGG GCGTTTCATT CGCCACCCTC TTCGCTCTAC GCCATCCCGC AATCTTACCC GACGCCGCCA CCGCCGCGAT CGAGGAAGAA AAGCGTCTTC GCCGCGCCGC CGCCCAAGCC GCCGCCCAAG CCTCCAGCGC CGGCGCCAAG GTCGAGAGCA AGGACAAGGC CGAAATCTTC GAGCCCAAAT CCGCGGTTTT CGACCCGAGG CCGCTCGCTC CAGTGAACGT CTTCTACCGA ATTCCGACCC AGGCCACGCG CGTGTTGTAA
|
Protein sequence | MKLRDVSEVD GTTRLALKIT GRNDKVLLLS FTNSAPAFDC LATSWKLVRA DADVRIATGP GLHRKKKSGG AEAKNLIEYG KDLFEEYKET MEIKARSFVE EASKLGAKPR PTPKMPMVDV EDEVSRALFI RLVRATNVVA MDSGGTSDPF ASVRYRGLES TSKTIWKTLD PEWDEVFTFR VPPNKTTLDE TDFVEMHIFD RDVALHDFIG YVKLDLTGTR VYSSKRTKMT LELKNLPADQ QPDFFDVNHL KEKLMFWEGE RQITGTVEIE YWLGNRHDAD YRIAGVPLLR KPDPRAGEAM NHFCDPVSAL LRVEVKCGRN IINLDDDDGS DPYVEVAVVQ PDGTEEKHQT HYIDDATDPE WNSTFNFIAA KPYKADLVFR MYDYDGVTSY DDLIGMVRIP ISELQTHKGI TKFPDSQWYT LLDAEGKDCD KEGTKYGDIE IRAYLDEEYF EHLHGGNTSK AVGKLTLDVL EAKDLEGAPD TYVMVKTGPY WSRLSDQKAQ SNPQWNVRLR YPIIEPSEPV TVGVFNLSDG SMIGKIRCVL SGLDDGLRYE DDFPLKTVNR SGVVVTNGTL RCSFTFKHKS TASFASRYMQ PVLPDKWYIQ PLSDTERRRM LRAHSMMMMK RLYNSNPSIP EVVSKELLDF SKQDVSIKSI KSSIARMERV VTNLTSIGDN LSYALSWESI PLTIFVQLVM VYVIHHPHMF FPMFFLSIAF QSLMRFPSRY QRTLDRCVPD DWLTVGLAFP PDSEEELEKK KASEAEAKKK LEEAKKLALE EEKRKEAEKK EEEKESEIQK PREVFSFESL NPLAALQRQM DEITQMITDA QVVLDDAAGI LERVVGILDW DEPRVTACVV VGLFLIAWAF IFIDAVIRFI TTVVVGVFVK TFFTIFSPVA IKWGVSFATL FALRHPAILP DAATAAIEEE KRLRRAAAQA AAQASSAGAK VESKDKAEIF EPKSAVFDPR PLAPVNVFYR IPTQATRVL
|
| |