Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32358 |
Symbol | |
ID | 5002370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 730039 |
End bp | 731633 |
Gene Length | 1595 bp |
Protein Length | 355 aa |
Translation table | |
GC content | 61% |
IMG OID | 640417791 |
Product | predicted protein |
Protein accession | XP_001418562 |
Protein GI | 145348239 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.125229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG CGACGGGACG CGCGACGCTC GCGACGGGCG CGCGACGGGA CGCGCGACGC GCGCGACGCG CGACGCGCCC GCGCGCGAGC GCGCCCTCGG GCGTGGACGA CGGCGCGGTG AGCATCCGAC GACGACCGCC GAACGGACCG TCCAAGCACG ACCTGGGCGT CGGACAGTTT GAGTTCGTGA TCGAGGAAGT CGCGCCGACG GGCGAGCGGG TGACGAACGA GAACAACAAG CCGAAGAACA TCTTGGAGGA GATCGTGTGG TATAAAGACG TGGAACTCGC GGAACGCAAG GAGAAATTCC CGTTGCAGTT GGTGCGAACG GCGCTGGTGA ACGCGCCGCC GACGAGGGAT TTTGTGAAGG CGATCACGGA TCAGTTGGCG GCGACGAAAC AGCCGGGACT GATCGCGGAG GTGAAGAAGG CGTCGCCGTC GAAGGGGGTG ATTCAGCCGA ATTTTGATCC GGTGAAAATC GCGAAGGCGT ACGAGGAGGG TGGGGCGGCG TGTTTGAGCG TGTTGACGGA TGAAAAGTTC TTTCAGGGTG GATTTGAAAA CTTGGCGCTC ATTCGCGAGG CTGGGGTGAC GTGCCCGTTG CTCTGCAAGG AGTTCATCGT GGACGCGTAT CAAATTTATC TCGCGCGCAA GTACGGCGCC GACGCGATTT TGTTGATCGC CGCGGTTTTG CCCAACCAAG ATTTGAAGTA CTTCATTAAG ATTGCGCACT CGCTGGGCAT GAAGTGCTTG ATCGAGGTGC ACACGTACGG GGAGATGAAG CGCGTGTTGG ACGGAGACTT TCCCATCGAT TTGCTCGGCA TCAACAACCG GGATTTGGGA ACGTTTGAGG TCAGCTTGAG CGTCACCACC GATCTCATGA ACGGGCCGCT CGGCGAGGAG GTTAAGAGAC GCGGCATCAC CATGGTTGGC GAGTCTGGAA TCTTCACCAT CGACGACGTC AACTTGTTGC AAGATTCCGG CGTCGGCGCG GTCTTGGTCG GCGAGTCCCT CGTCAAGCAG GACACCCCGG ACGTCGGCAT CAAGAAATTG TTCGGCCGCC CGGTGTGATT CGATCACGCG CGATTTCCAT CGACTCGACA CGATTTTTAA CCCGCCCCGC GCGAGTACCG CTGATTTAAT GACAACCACT ATCCATCCGG AATGCGCCGC TACCATGCGA AATTTCTAGA CTTGAGATTA CACGTCGACA TTCTTATCGA TCATCTTCAT GCTCATACTC CCCATTCGTC GTCGTCGTTG CCGTACTGCG CGTGCCCGCC TGATTGCGGG GCGTTATTCC AACCTGGCGG CGGCGGCATC ATGTGGTTGC CGCCGCCAGC GTATCCTCCG TGAACGACGC CGATCGGCGT GGGCGTGTAG CCCGGCGGCG CCTCTGGCGC ATGTTGCGCC GCATAGGCAG GAAGCTGCAC GTACGTCACG TTCGGCGTGG CTTGCTGAGG CTGTGCCGGC CGCGGTTGGG CCCCTATGTC GTAAATATCG CCAAGCGTGT ACTGGCCTCT GCCTGTCAGA CGCCTCACGA GCCCACGCGG TTTCTGCTGC TGCTGCACCA CGTGCGGCTG CTGCTGCTGC TGCTG
|
Protein sequence | MSAATGRATL ATGARRDARR ARRATRPRAS APSGVDDGAV SIRRRPPNGP SKHDLGVGQF EFVIEEVAPT GERVTNENNK PKNILEEIVW YKDVELAERK EKFPLQLVRT ALVNAPPTRD FVKAITDQLA ATKQPGLIAE VKKASPSKGV IQPNFDPVKI AKAYEEGGAA CLSVLTDEKF FQGGFENLAL IREAGVTCPL LCKEFIVDAY QIYLARKYGA DAILLIAAVL PNQDLKYFIK IAHSLGMKCL IEVHTYGEMK RVLDGDFPID LLGINNRDLG TFEVSLSVTT DLMNGPLGEE VKRRGITMVG ESGIFTIDDV NLLQDSGVGA VLVGESLVKQ DTPDVGIKKL FGRPV
|
| |