Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49484 |
Symbol | |
ID | 5001442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 36277 |
End bp | 37331 |
Gene Length | 1055 bp |
Protein Length | 302 aa |
Translation table | |
GC content | 65% |
IMG OID | 640416863 |
Product | predicted protein |
Protein accession | XP_001417123 |
Protein GI | 145345237 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0159] Tryptophan synthase alpha chain |
TIGRFAM ID | [TIGR00262] tryptophan synthase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0000433927 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCGACGCGC TCGACGACGC GCGCTCGATC GTTGAACGCG GTCGACGTCA CCGCGATTCG ATGGCCGCGA GCGCGTCCCG CGCGCTTCGC GCGGCGCCGC GCGCGCCGCG CGCGTCCCGC GCCCGCCGTA ACCCTGAACG CGGCGCCGCG TCTCGCGCCG TGGCGACGCG CGCGAGCGTG TCCGAGGCGT TCAAGGCGGT GCTGGACGAT GGCAAGCGAG CGTTCATTCC GTTCATCTGC GCGGGCGATC CCGATCTGGA GAGCACGAAG AAGGCGCTGA AGATTTTGGA CGACGCGGGC GCGGATGTCA TCGAGCTCGG CGTGCCGTAC AGCGACCCGT TGGCGGACGG ACCGGTGATT CAGGCGGCGG CGACGCGGGC GTTGGAGAAC GGGGCGACGT TGAATAAGGT GATCGATTTA GTGCGAGAGA TGACGCCGCA GATTAAGGCG CCGATCGTGA TGTTTACGTA TTACAATCCG ATTTATCAAC GCGGAGTGGA TAAATTTTGC GCCGACATCG CCGCGGCTGG GGCGAAGGGA TTGCTCGTGC CGGATATTCC GTTGGAGGAG ACGTACGATG TGAGCGAGAT CGCGAGTAAG CACGGCATAG AGCTCGTTCT GCTTTCCACG CCCACGACGC CGGTGGAACG GGCGAAGAAG ATTGCGCAGG CGACGAAGGG GTTCGTCTAC CTCGTCTCCG TCACGGGCGT CACCGGCGTG CAATCGAACG TGGCGACGCG CGTGGAGCAA TTGGTGGAGG AGTTGAGAAG CGTGACGGAT AAGCCCATCG CGGTCGGGTT CGGGGTGAGC GAGGCAAAGC ACGCGAAGCA AATCGTGGAT TGGGGCGCCG ACGGCGTCAT CGTCGGTTCC GCGCTCGTGC GCGCGCTCGG CGAAGCCAAG ACGCCCGAGG AAGGTCTCGC CGCGCTCAAG GCCAAGGCTG AGGAAATCCG CGGTGGCGCC ACGCTCTGAG AAAGCGTCTT CACGCGGCGC GAGAGCGACG AACGACCTAG GTAGCGGTGG ACATTTTAAT CTATTAACAA CGCTCGAGCG CGCCC
|
Protein sequence | MAASASRALR AAPRAPRASR ARRNPERGAA SRAVATRASV SEAFKAVLDD GKRAFIPFIC AGDPDLESTK KALKILDDAG ADVIELGVPY SDPLADGPVI QAAATRALEN GATLNKVIDL VREMTPQIKA PIVMFTYYNP IYQRGVDKFC ADIAAAGAKG LLVPDIPLEE TYDVSEIASK HGIELVLLST PTTPVERAKK IAQATKGFVY LVSVTGVTGV QSNVATRVEQ LVEELRSVTD KPIAVGFGVS EAKHAKQIVD WGADGVIVGS ALVRALGEAK TPEEGLAALK AKAEEIRGGA TL
|
| |