Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36485 |
Symbol | |
ID | 5000379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 249815 |
End bp | 250816 |
Gene Length | 1002 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415800 |
Product | predicted protein |
Protein accession | XP_001416109 |
Protein GI | 145342050 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.568714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACC CCAAGCACTG GCGGTCGACG CACGCGAGAA TCGCGAACGG ACCGGTGGTG CCGCAACAAC TCATCGGTGG GACGCCGATG ATCGATTTGA GTGAATTCAG TGCGAATCCA AAGGTGAAGA TTTATGGCAA GTGCGAGTAC TTGAATCCGA GCGGGAGTAT TAAGGATCGA ATCGCGCAGG AAATTTTGGC GAGAGCGCTG GAAACGGGAG AGCTGAAGGC GGGGATGACG GTCGTCGCGG CGACGAGTGG GAACACGGGT GCGGCGATTG CGATGGCGTG TGCGATTCGC GGGTTTCCAT ACATCGTGAT TACGAATCAG AAGACGAGCA AGGAGAAGAT TGACGCCATG CGGGCGTATG GGGGGGAGGT CATCGTCGCA CCGAGCGGGG TGCCGGCGGA TCACCCGGAT CATTATCAAA ACATCGAAGC GACGATGTGC GCAAAAAATC CGAAATTTTA TGGTGTGAAT CAATACGACA ATCCGTACAA CGCGGATGCG TACGAAAAGA CACTTGGTCC TGAGATTTGG TCGCAAACCG AAGGGGCGGT GACGCACTTC GTCGCCGGTG GCTCGACTGG CGGTACGATC ACAGGCACGG GTCGTTATCT TAAGAGTGTC GACCCGACGA TTAAAATCGT CTTGGCCGAT CCCAAGGGAA GCGTTCTGTG GGATTATTTT GTGAACGACA TTCCTGAAGA AGAACTCGTG GCGAAGAGTT GGGAAGTCGA GGGTGTGGGT AAAGATTCCA TTCCGGGTGT ACTCGACACC GAATACATTG ACGGCGCCGT CATGGGTGAC GACAGTAGTT CCTTCCGCAT GGTTCGCACG GTTGCTGAAT CTTCTGGTGT CTTGCTCGGT GGTAGCTCCG GTCTAAACCT GCACGCTGCT CGCGTGCTCT CGAGTCACAT CAAGGAGGGG ACCATCGTCA CGGTACTGTG CGACAGCGGT GTCAAGTACT TGTCCAAGAT CTATAACGAT GAGTGGCTTC AA
|
Protein sequence | MTDPKHWRST HARIANGPVV PQQLIGGTPM IDLSEFSANP KVKIYGKCEY LNPSGSIKDR IAQEILARAL ETGELKAGMT VVAATSGNTG AAIAMACAIR GFPYIVITNQ KTSKEKIDAM RAYGGEVIVA PSGVPADHPD HYQNIEATMC AKNPKFYGVN QYDNPYNADA YEKTLGPEIW SQTEGAVTHF VAGGSTGGTI TGTGRYLKSV DPTIKIVLAD PKGSVLWDYF VNDIPEEELV AKSWEVEGVG KDSIPGVLDT EYIDGAVMGD DSSSFRMVRT VAESSGVLLG GSSGLNLHAA RVLSSHIKEG TIVTVLCDSG VKYLSKIYND EWLQ
|
| |