Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28757 |
Symbol | |
ID | 4999391 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 256470 |
End bp | 258511 |
Gene Length | 2042 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 63% |
IMG OID | 640414812 |
Product | predicted protein |
Protein accession | XP_001415434 |
Protein GI | 145340650 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0466564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCG CCGTTAAGGT GCGCGATCGA CGCGCGAGAC GACGCGATCG GACGACGACG GCGCGATCGC GACGTCGCGA CGTCGCGACG CGCGGTCGCG ACGTCGGCGA CGGCGCGATG ACGGACGCGG GACGAGCGAT CGCGGATAAT CATCTCATCG ACGCCATCGG CGACGACGAT CGGGGACCGC GGGCGTCGAG CGCGGACGCG CGCGCGCGCG AGGACGCGAG GGACTGACGG AAAGAATTTT GTGCGTCGCG ATCGCAGACC GTCGAACTCA CCGCCCGCAC CGCGCTCGCG GGTAACTTCC GCAAGGCGAC CGCGCCGCGC GCGGCCGCGA AGGTGCGTGA CCGCGCGGAA CTCGGCGCGC GCGCGACGCG CGAGCGCGCG CGACGTCGAC GGGGGTGCGC GGAGACTGAC GAATGGGGTG TTTGCGCGTC TAGGCCAACG TGAACGTGGT GACCGAGGCG AAGGTTAAGG TTGCCATCAA CGGTTTCGGC CGCATCGGTG CGTGCGATCG CGGATAAGGG TCGCGATGCG ATTGTCGCGA TGTCGATGCG TTTCGCGCGT CGTGTGCGCG GTGTTCGCGC GATGTGAGGC GAACGAGGGC GCGCGGTGAT TGGGAGAATA TGCGCGCAGC GCTCGTGCGT CGAGGGAGGG CGCGCGCGAG GCGCGCGAGG CGCGCGAGGG TGCGATCGAC GCCGGGGTGC GCGCACGTGG CACCCTGGGG ATTTTTATGA CTTTTAGCTC GCGCGCGTGG GCACGACTCG AACGACGCGA CTGACGATGA AATGAACGCT TGTGGGCGAT TTAAAACACA GGCCGCAACT TCGTGCGATG CTGGAAGAGC CGTGGCCCGG ACTGCCCGCT CCAAGTCGTG TGCGTGAACC AATCCGGCGG CGCCAAGCCG GCGGCGCACT TGCTCAAGTA CGACTCCATC TTGGGTACGT TCAAGTCTGA CGTCAAGGTT GTCGACGACG CCACGATCTC CGTCGATGGT GACATCATCA AGGTTGTGTC CGATCGCGAC CCGCTCAACT TGCCGTGGAA GGAAATGGGC ATCGACATCG TCATCGAAGG TACGGGCGTC TTCCTCGACG GCCCGGGCGC GGGCAAGCAC ATCCAAGCCG GCGCCAAGAA GGTTGTTATC ACCGCCCCGG CCAAGGGCTC CGACATCCCG ACCTACGTCG TCGGTGTGAA CGCCGACCAA TACGACAACT CTGCCAACAT TGTCTCCAAC GCGTCGTGCA CCACCAACTG CTTGGCGCCG TTCGCCAAGG TGATCGACGA CAAGTTCGGC ATCGTCAAGG GTACGATGAC CACCACGCAC TCTTACACTG GTGATCAACG CATCTTGGAT GCGTCCCACC GTGACTTGCG CCGTGCTCGT GCCGCCGCCT TGAACATCGT GCCGACCTCC ACCGGTGCCG CCAAGGCTGT TGCGCTCGTC TTGCCGCAAC TCAAGGGTAA GCTCAACGGT ATCGCGCTTC GCGTGCCGAC GCCGAACGTG TCCGTCGTCG ACCTCGTCGT CAACGTTGAG AAGAAGGGCG TCACCGCTGA AGAAGTCAAC GCCGCGTTCA AGGAAGCGCA AGATGGCCCG ATGAAGGGTG TTCTTGCGAT CACCGATGTC CCGCTCGTGT CTGTCGATTT CCGGTGCACC GACGTGTCCA CCACCATTGA CGCGGCTCTC ACCATGGTGA TGGGCGATGA CATGGTCAAG GTTGTCGCGT GGTATGACAA CGAATGGGGT TACACCCAAC GCGTCGTCGA CCTCTCCCTC ATCGTCGCCA ACGGCCTCAT CAACGAAGGT GTTGCCTCCG CCGACCCGAT GGACCAACTC TGCGCGGATG ATCCCTCGGC GGACGAATGC AAGGTGTTCG ATTAAACCAT TTTTTCGATC AACTTCATCG AGCGATGTTG GCGAGTGGAC GAGGCGTGAG ACGTAATTAT TACGTCGTTG CGCCCTTGAT TTCTCGCGCG ATTTACCAGC GTATTAACAC ACACTTCTGT AATCAGAATC TAGAGACAGA TTTGAAAGAA CC
|
Protein sequence | MSAAVKTVEL TARTALAGNF RKATAPRAAA KANVNVVTEA KVKVAINGFG RIGRNFVRCW KSRGPDCPLQ VVCVNQSGGA KPAAHLLKYD SILGTFKSDV KVVDDATISV DGDIIKVVSD RDPLNLPWKE MGIDIVIEGT GVFLDGPGAG KHIQAGAKKV VITAPAKGSD IPTYVVGVNA DQYDNSANIV SNASCTTNCL APFAKVIDDK FGIVKGTMTT THSYTGDQRI LDASHRDLRR ARAAALNIVP TSTGAAKAVA LVLPQLKGKL NGIALRVPTP NVSVVDLVVN VEKKGVTAEE VNAAFKEAQD GPMKGVLAIT DVPLVSVDFR CTDVSTTIDA ALTMVMGDDM VKVVAWYDNE WGYTQRVVDL SLIVANGLIN EGVASADPMD QLCADDPSAD ECKVFD
|
| |