Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34104 |
Symbol | |
ID | 5000899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 850459 |
End bp | 852699 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416320 |
Product | predicted protein |
Protein accession | XP_001416778 |
Protein GI | 145344518 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.121587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGTCG ACGCGCGCGT GCGCGAGGGT TACTTTTTCG GGCGCGCGTG CGCGTGCGCG CCGCCCACGA GCTGGTGGCG CGATGACGCC GAACGCGCCG CGCGCGACGC CGCGGACGCG CGTCGACGCG CGCCGGCGCT GCTGAACACG TTCACGAAGA CCAAAGTACC GTTTAAACCG CTGAGCGGGA ACTCGGTGGG GTGGTACATC TGCGGACCGA CGGTGTACGA CTCGGCGCAC GTGGGACACG CGCGCAACTA CGTCAACTTT GACGTCTTGC GCAGGGTGAT GATGGAGTAC TTTGGCTACG ACGTGCGGTT CGTGATGAAC GTGACGGACA TAGATGATAA AATTATCATG CGCGCGCACA CGAGACGGGC GGAGGCGGTG GTGAAGGCGG CGAGGGAAAC GGGCGAGACG AGACTCGGAG CGGAGACGCT GGCGGTGGAA AAGTTGTTGG CGGAGGGTGG GAAACCGTTA GGGGCGCTCG ATAGCGCCAC GCGGACGCTG GCGAACGCGG TGAAGGCGGC GATAGGAAGC GGGATCGATG CCGAGCACTG TTGCGCGAAA GATTGGACGA TTCAGGATGG ATACCTGACG CTCGCGCACC AATTCGAAGC CGAGTTCATG GAGGATATGA AATCCTTGGG CGTGGCGCGA CCGGACATGC TGACGCGGGT TTCGGAGTAC GTGGATAAGG TGATTTTGTA CATTCAAGTC ATCATTAACA AAGGATTCGC GTACGAGTCC AACGGCAGCG TGTACTTTGA CGTCAAGGCT TTCGAGGCGG CGGAGAACCA CAAGTACGGC AAGCTGAATC AAAACGCCAT GGAAAACATC ACGGAAGCCA TGGATGGAGA GGGCGCACTC GAGGCGGAAA AGAGCGAAAA GAAGTGCGAT TTTGACTTTG TGCTTTGGAA GATGAGCAAA GACGGTGAGC CGTGCTGGAG CTCGCCCTGG GGTATGGGTC GCCCGGGGTG GCACATCGAG TGCTCCGCGA TGTGCAGCGA CATTCTCGGT CAATCCGTCG ATATCAACGG TGGCGGGATC GATTTGAACT TTCCTCATCA CGAAAATCAG CTCGCGCAGT CGGAGGCGCA TTACGATACC GAACAATGGG TGAACTTTTT CATCCACACC GGTCACTTGC ACATCGACGG GTTGAAGATG AGCAAGAGTT TGAAGAATTT CATCACCATT CGCGCGGCGC TCAAGATGTA CTCGGCGAGA CAAATTCGGT TCTTATTCCT GCTTAACCAG TGGTGCGATC CGATGGAACT CACCCCAGTC GCTGCGCCTG ATGGATCTGG CGTCATCGGC TTTAAGCAGA TGGATCTCGC GCTGAGCATC GAACGCTTGT TTGTGGAATT TTTCCACTCC ATCAAGGGCG TGTTCCGCAC CTCTGGAAGT TACCACGTCG ACAAGCAGCA GACTTGGAAC GAGCGCGAGC GCGAACTTAG TGATGCGCTC GACACGTCTC AAGCCGCCGT GCACGAAGCA CTCATCGACA ATATCAACAC CCCCAACACC CTTCTTGCAC TGCAAGATCT CGTCAAGGCG ACGAATAAGT ACCTCGCGGA GACAGGCAGC GTTGATGTTC GACCACTTCT CCTCGAACGC GTGGGCAAGT TTGTCACGAA GATCCTTAAC TGCTTGGGCG TGTGCTTAGA CACCGGCGCG GTCGGGTTCC CGGAGTCTTC GGAGGGTTCG TCCGAAGGTC GTGAAGAAAC GCTTTCACCG TTTCTTGATT TGATGACGAA ATTCAGAGAC GACATTCGCA AGCTCGCCCA AGGCGGGGCG TCCGCGAAGG AACTTCTCAC CGCGTGCGAC AACCTGCGAG ACGTCGGGTT ACCAGAGCTC GGCGTGAAGC TCGACGACAA GGAGGGCGGT GCGTTGTGGA AGCTTTACGA CGCGGACGAG TTGAAGAAAG AAATTGCGCG CGAGCACGAA GCCAAGGAGG AGAAGGAAAG AGCGAAGCGC GCAGCCAAGG AGGAAGCGGC GCGCAAGGCG GCGGAAAAAG AAGCCAGGGC TAAGGTTCCA CCGAGTGAAA TGTTCAAGAC GTTCAGTGAA TACGAAGGAT TGTACTCCAA GTACGACGAC GACGGAGTGC CGACGCACGA CGCCGCGGGC GAGGCTTTGG CGAAGAGCGC GGCAAAGAAA CTGCTCAAGT CGCGCCAACA GCAAGAGAAG GCTCACGAAA CGTACCTCGC CAAGGCGGGC ATGGAAAAGC TCGCAGTCTA A
|
Protein sequence | MPVDARVREG YFFGRACACA PPTSWWRDDA ERAARDAADA RRRAPALLNT FTKTKVPFKP LSGNSVGWYI CGPTVYDSAH VGHARNYVNF DVLRRVMMEY FGYDVRFVMN VTDIDDKIIM RAHTRRAEAV VKAARETGET RLGAETLAVE KLLAEGGKPL GALDSATRTL ANAVKAAIGS GIDAEHCCAK DWTIQDGYLT LAHQFEAEFM EDMKSLGVAR PDMLTRVSEY VDKVILYIQV IINKGFAYES NGSVYFDVKA FEAAENHKYG KLNQNAMENI TEAMDGEGAL EAEKSEKKCD FDFVLWKMSK DGEPCWSSPW GMGRPGWHIE CSAMCSDILG QSVDINGGGI DLNFPHHENQ LAQSEAHYDT EQWVNFFIHT GHLHIDGLKM SKSLKNFITI RAALKMYSAR QIRFLFLLNQ WCDPMELTPV AAPDGSGVIG FKQMDLALSI ERLFVEFFHS IKGVFRTSGS YHVDKQQTWN ERERELSDAL DTSQAAVHEA LIDNINTPNT LLALQDLVKA TNKYLAETGS VDVRPLLLER VGKFVTKILN CLGVCLDTGA VGFPESSEGS SEGREETLSP FLDLMTKFRD DIRKLAQGGA SAKELLTACD NLRDVGLPEL GVKLDDKEGG ALWKLYDADE LKKEIAREHE AKEEKERAKR AAKEEAARKA AEKEARAKVP PSEMFKTFSE YEGLYSKYDD DGVPTHDAAG EALAKSAAKK LLKSRQQQEK AHETYLAKAG MEKLAV
|
| |