Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36001 |
Symbol | |
ID | 5000334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 285342 |
End bp | 286616 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415755 |
Product | predicted protein |
Protein accession | XP_001416357 |
Protein GI | 145343495 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0387347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGAG TGAAGCAAGA ATCGTATGAG CAGGTTGGGA TTCCCATCCT CAAAGAAGTC GTATCCGCCA AGCCGAAGCT CAAGCGCGTG CTCGATTTCC TAGTGTCGAC GTCGCCCAAG GAGCAACCCG TTCGACCTCC GGTGGATACT ACTTTGCCGA GCGTCAGTAT CATTGTTCCC GTGTATAACG CATCTCGTTG GTTAGATGAG ACTTTGAGCT CTATATGCGC TCAAACGTAC CGAGGTCCAA TCGAGGTGAG CATCTTCGAC GACAGCAGTA CGGATGGGTC GCCAGACATC ATCAAAATCT GGCGCGAGGC GTTTACCCGA GTGGGCATCG CCGTTGTGGT GAACGGAAGC AGGTGGCCAG AGAGCTCGAA GTTTGTCGCG GAAGACGCGC CCAACTTTGG TATCGGGTTC TGTCGAAATC GTTGCATCGA GCAGAGTCAC GGAGACATTT TAGTCTTCCT GGACTCTGAC GACGTCATGA TGCCACAGCG ACTCGAGCTA CAAGTGCCAC TCGCCGCGGA AAATCGGACG GCCATCGTTG GAGGGTGCTG GAAACGGTAT CCAAGCGGTT CAACGGAGCA TTACGAAGCC TGGGCGAACA TGATGACGCA AAACGAGCTT CACCTCGAAC AATTCCGAGA GTGCACGGTA CTCTTGCCGA CGTGGTGCAT GGCGCGAACG GTCGCAGAAA CCGTAGGCGG TTTTGTTGAG GCACCGCCTG GTTCGGGCGA GGCGGAAGAT TTAATTTTCT TCCAACGGCA CTTGGCGTTA AATTACGAGC AGAACTTGAA AAAGGGCTTG CCATCGCTTT TACGCGCCGG GGACTACCCG CATAACCCGG TGCTGTTGTA TCGCTGGAGT CCACAGAGCG CATCAGCACG GTGTAGTCGC CGTCGGCTGT TGCAGGTTCG CGCGCAAGCG TTCGAGGAAC GCATTCTTCC CATGAAGTCG TGGGAGAAAT TCATCGTGTG GGGCGCAGGG CGTGACGCGA AAAATTTCAT GAATGAAATT TCGCCCGCCG CGAGAGCACG CGTAGAAGCG ATGATTGACA TTGATCCCAG AAAAACGGGA CGTCAGTACA CGAATTCGCA AGCACCCGAC CAAGCGCCAG TGCCTATCGT GCACTTCTCA AAGGCGCCGA GAGGGCTTCC CGTCGTCGTG TGCGTGGCAA AGCGCCGCAA AGGTTCTGGC GATAGCGGCG ACTTGGAATC AAACGTAGAC ACGCTCGGGC TCATCGAGGG AACGACGCTG TGGTATTTCA ACTAG
|
Protein sequence | MARVKQESYE QVGIPILKEV VSAKPKLKRV LDFLVSTSPK EQPVRPPVDT TLPSVSIIVP VYNASRWLDE TLSSICAQTY RGPIEVSIFD DSSTDGSPDI IKIWREAFTR VGIAVVVNGS RWPESSKFVA EDAPNFGIGF CRNRCIEQSH GDILVFLDSD DVMMPQRLEL QVPLAAENRT AIVGGCWKRY PSGSTEHYEA WANMMTQNEL HLEQFRECTV LLPTWCMART VAETVGGFVE APPGSGEAED LIFFQRHLAL NYEQNLKKGL PSLLRAGDYP HNPVLLYRWS PQSASARCSR RRLLQVRAQA FEERILPMKS WEKFIVWGAG RDAKNFMNEI SPAARARVEA MIDIDPRKTG RQYTNSQAPD QAPVPIVHFS KAPRGLPVVV CVAKRRKGSG DSGDLESNVD TLGLIEGTTL WYFN
|
| |