Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_52090 |
Symbol | |
ID | 5007018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 312631 |
End bp | 315897 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422439 |
Product | predicted protein |
Protein accession | XP_001422868 |
Protein GI | 145357321 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.0580796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACG AGCCTTTGAC GCCGCACGAA GAGATTCGTC GAGGAACGCT CGCGAATGGG TTGAAGTACG TCATCTTACC GAACAAAGTT CCCGAGGGGA GATTTGAGGC GCACTTGGAG ATGCACGTCG GGTCCGTGGA CGAACGCGAA GACGAGCAAG GGCTGGCGCA CCTCGTCGAG CACGTCACGT TTTTGGGTTC TCGAAAACGA GATCAGTGGC TCGGGAGCGG TACGCGAGGG AACGCGTACA CGGATTTCCA CCACACCGTG TTCCACATCC ACGCACCGAC GACGAACAAG GATGGACATT ACATGCCGCC CAACGTGCTG GACATTCTCT ACGACGTCGC GTTTGCGCCG CAACTCTTGG ACACTCGCGT GGCGAAGGAG AAGAAAGCCG TGCTCGCCGA GGCTCAAATG ATGAACACGA TCGAGTACAG AGTGGACTGT CAGCTTCTAG AGCATTTGCA CTGGGACAAC CTCTTGGGCA CGCGATTTCC GATCGGTAAG CTCGATCAGG TGGAAGCGTG GCCGGCGCAG GCGGTGCGAG ACTTTCACGC GCGCTGGTAT TTCCCAGCCA ACGCCACGCT CTACGTCGTG GGCGACTTCG ATGCCTCCGT GGACGAAGTC GAGGGTATGA TTTCCGCCGC GTTCGACGAA GCGGCGCCCG CGGAAGGCGC GGAAGAAGCC GAGTCTCCTC TCAAACGTCA CGCGGTGCGT CCTCCGGTGA AGCACGCGTA CGGGGCGCCG TCGTACGAGC TCGACGAAAT TCAACGCTCG AACGACGAGG CGAAGGCGTC GGGCAAAGAC GACCCGTTCT TGCCTTTCGT CGCCCCCGAG GGTAAAGTGT CCATGTTCCA GCACGAACAC TTGTCCAACG CGAGTTTCAA CATATTTTCA AAATTGGCCG TCAAGCCGCT CGAAAAGATG GGCGACTTGC ACCGGACGAT TTTGCAGCGC ATTGTGTTGT TGGTGTTGCA GAGCCGCATT CAGGCGAGAT ACGCGGAAAC TAATGCGGAT TACAAACGCA TTGAACTCGA TCATAGCGAC AGCGCGAGAG AGGGGTGTTG CGTCAGCACC GTCACCGTGA CGTGCGAGCC GAGAGAGTAC GAATTCGCCT TGCAAGTCGC CGTCGAGGAA TCTCGACGAT TGCAAAAGTT TGGATTGACG CCGAGCGAGC TCGATCGTTT TAAGGCGGCG ATGTTGAGAG ATTCTGAACA GCTCGCGCAG CAGGCGGGAT GCGTGCCGAG CCTCGAAAAT CTCGACTTTG TCATGGAGCA GGACGCCTGC GGACACGTCG TCATGGACCA GGTCGCGGGT CACGAGGCCT TGGTGCGCAT GTCTGATTAC ATCACGCTCG AGGCGTGCAA CGGCGCGTGC GATGAGCTTT TAGGTTTTAT CGGTGAGTAC GGCGTGGAGA ACAATCGCAA ACCGAACAGC GGTAAGTGCA CCGCCATCGT CGCGTGCGTC CCAGCGACGA TGACGAACGT CGACGGCGAG ACGGTGCCGT TCGATATCAC GCCCGAGCGC ATAGAACAAG TTTTGGCGGC GGATTACGGT GAAATCACAG AGCCCGAGGA TATCTTTGTG CCGGAGGTGT TGATTGCGGA TGACGAAATC AACGCCCTCA TCGAGCAAAC GGCGCCGACG TTTACCGAGG CGACGTATCA CGAGCCGACC GGCGTGTACC AGCGCACGCT GAGTAACGGT ATTCGCGTCA ACTACAAGGT GCTCGACGCC GAGCCTGGGA GCGCGTTCTT ACGACTCGTC GTCCCGGGTG GTCGTTCGGT TGAGAGCCCG AACATCGGGC CCGGCGGAAT CGGTTCGTCG GCGGTCGGTT TGCGCACCGT GCAAGAGGCT GGCGACGTCG GCGAATGGTC GCGCAAGCAA ATCGAGCTCT TGACGATGCA ACACTTGCTC GTGTACGACG TCGAGCCCGA GGTCGAGTAC ATGTTTTTGG ATTCCGCGTT CGCCACGGAT GGAGGCTTGA GGACTATTCT CGAGATCATG CACCTCACGC TGACGAAGCC GACGTGGGAC GAGCAAGCGC TCGAGCGCGT CAAGGACATT TACCGCATGT TTCAGATTAA CACGAACAAG AACATCGAGT TGCTCACGCA TGACACCGTC AACGCCGTGA TTTACGAAGA CCGAAGAATC ATGGACCCGA ACAAGGAGGC ACTCCAGGCG CTGACGCTGG ATGGCGTTCG AGACATGATC GAAGCTCAAT TCGCGAGCGG GGCTTTGGAA CTCAACATCG TCGGCGACAT CATCCCCGAA GAAGTCGACG AAATGGTTCT CGCGTTTATG GGATCGATTC AGACCAAGCC CGCGCCGCCG TTGCCGCAAG TGCCCCCGCT GAAATTGAAA GAGGTGCCAA AGGATGATCC GGTGCGCGCG CAGCGGTTGT GGTTGAAGGA CAGCGACGAA AGAGCGTGCG CGGTGGCCGC CGGACCGGGG CCATCCATGT GGGCGCCGAT GACGAGTCTG TACTCTGAAA TTCCGAGCTT TGTCAATCAA GACGCGTACA CGCCGGAACA GATTGCAGCC TTGCGCGATC CCGTGAACGA GGTGGCGTCG GCGAAGGGGA ACCCATTCGC ACAGCAGTCT GCGCGACGCG CGAATTCTTT GGCGACGTAT TGCGCGGGGA TGATGCTCGC AGAAGTTATC GGTGGTAGAT TGTTCACAAC GGTGCGCGAC GCGTTGGGTC TGACGTACGA CTGCAACTTT ACAATGTCGT TTGGATTACA AAACAACGAC GCGACGACGT ACAGGTTGTT GGTGACGTCG ACGCCGGAAA AAATTGATGA AGCTCTCAAC GCCGGCGTGC GCGTGTTGCG TGGATTTCAG ATGCAGCGCG TGAGCCAACG CGAGGTCGAT CGCGCGCGTT TAACGCTGTT GAGTCGTCAT GAGATGGATT TGAAGACGAA CAACTACTGG GCCGATTTGA TGCAGTGCAC CAACACTCCA GATTTAGCGC CGCTGAAGAA GATTCAGTGC GTCGCTGATT TACCCCTCAT GTACGACGCG ATGACCGTGG ACGATCTTCA AGAAGTTTAC GATCGTCTCG GCTTGAGCGA AGGCGAGATC TTCACCAGCG TGACGATCGC CGGGCAGACA GAGCCGCCTC CATTCCTGTC GAAGAAGGAC GTCGACGCCG CCGCGCTCGC CGCCCAGACG CTGACCGCCG CGCTTGGTGG CATCAACATC GCCGAAGCGA TCAAGAGATT GAAGAAAGAG CAGTCTCCGT CCGCGAGCGC GGAGTAG
|
Protein sequence | MLDEPLTPHE EIRRGTLANG LKYVILPNKV PEGRFEAHLE MHVGSVDERE DEQGLAHLVE HVTFLGSRKR DQWLGSGTRG NAYTDFHHTV FHIHAPTTNK DGHYMPPNVL DILYDVAFAP QLLDTRVAKE KKAVLAEAQM MNTIEYRVDC QLLEHLHWDN LLGTRFPIGK LDQVEAWPAQ AVRDFHARWY FPANATLYVV GDFDASVDEV EGMISAAFDE AAPAEGAEEA ESPLKRHAVR PPVKHAYGAP SYELDEIQRS NDEAKASGKD DPFLPFVAPE GKVSMFQHEH LSNASFNIFS KLAVKPLEKM GDLHRTILQR IVLLVLQSRI QARYAETNAD YKRIELDHSD SAREGCCVST VTVTCEPREY EFALQVAVEE SRRLQKFGLT PSELDRFKAA MLRDSEQLAQ QAGCVPSLEN LDFVMEQDAC GHVVMDQVAG HEALVRMSDY ITLEACNGAC DELLGFIGEY GVENNRKPNS GKCTAIVACV PATMTNVDGE TVPFDITPER IEQVLAADYG EITEPEDIFV PEVLIADDEI NALIEQTAPT FTEATYHEPT GVYQRTLSNG IRVNYKVLDA EPGSAFLRLV VPGGRSVESP NIGPGGIGSS AVGLRTVQEA GDVGEWSRKQ IELLTMQHLL VYDVEPEVEY MFLDSAFATD GGLRTILEIM HLTLTKPTWD EQALERVKDI YRMFQINTNK NIELLTHDTV NAVIYEDRRI MDPNKEALQA LTLDGVRDMI EAQFASGALE LNIVGDIIPE EVDEMVLAFM GSIQTKPAPP LPQVPPLKLK EVPKDDPVRA QRLWLKDSDE RACAVAAGPG PSMWAPMTSL YSEIPSFVNQ DAYTPEQIAA LRDPVNEVAS AKGNPFAQQS ARRANSLATY CAGMMLAEVI GGRLFTTVRD ALGLTYDCNF TMSFGLQNND ATTYRLLVTS TPEKIDEALN AGVRVLRGFQ MQRVSQREVD RARLTLLSRH EMDLKTNNYW ADLMQCTNTP DLAPLKKIQC VADLPLMYDA MTVDDLQEVY DRLGLSEGEI FTSVTIAGQT EPPPFLSKKD VDAAALAAQT LTAALGGINI AEAIKRLKKE QSPSASAE
|
| |