Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26284 |
Symbol | |
ID | 5004183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 539295 |
End bp | 542542 |
Gene Length | 3248 bp |
Protein Length | 976 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419604 |
Product | predicted protein |
Protein accession | XP_001420206 |
Protein GI | 145351701 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0122884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.247759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCGC TCATCGCATC CGGCGCGCGG GCGGTCTCGA CCGAAGCGCT GAAGCCGCTG GACACGTTCG AGCGACGACA TAACTCCGGA ACGACGCAAG AAGTGGCGGA GATGTGCGCG GTGATCGGGT TCAAGGACAT CGACGCGCTG ATCGACGCGA CGGTGCCGGA AAATATTCGC CTGAAGAAGA CGATGGACAT GGGCGAGTAC ACGCAGCCGC TCACGGAGAG CGAATTCTTG ACGATGATGA AGAACATGGC GAGCAAGAAT AAGGTTTTTA AGAACTACAT CGGGACCGGG TATCACGGCA CGCACGTGCC GACGGTGATT TTGCGTAACA TTTTGGAAAA TCCGGGGTGG TACACGCAGT ACACGCCGTA CCAGGCGGAG GCGTCGCAAG GACGCCTGGA ATCGTTGTTG AACTTTCAAA CGATGATTAC CGACTTGACC GGCATGCCGC TGTCTAACTC GTCGTTGTTG GACGAAGGCA CGGCTGCGGC GGAGGCGATG ACGATGTGCT CTGCGTTGAA CCGCGGTAAG AAGCCGAAGT TCTACGTGTC GAACAAGTGC CACCCGCAAA CCATCGCGGT GGTGCAGACG CGCGCCGAAG GTTTGGGTCT CGAAGCCGTC GTGGGCGATG AGAACTCTTT CGATTACACC GCGAAGGATG TCTGCGGCGT TCTCGTGCAA TACCCGGCCA CGGATGGTTC GATCATCGAC TACAAGCCCA TCGTGTCCCA GGCGCAAGCC AACGGCATTC GCGTCGTCGC CGCCGCCGAC TTGTTGTCGC TCACCATGTT ACAACCTCCG GGTGAATGGG GTGCTGACAT CGTCATCGGT TCATCTCAGC GATTCGGCGT GCCCATGGGC TACGGTGGTC CGCACGCTGC CTTCTTGGCG ACGACGCACG ACTGCAAGCG TTTGATGCCG GGCCGCATCA TCGGCGAGTC CATCGACGCC GAAGGCAAGC CGGCGCTTCG CATGGCGATG CAAACGCGCG AGCAACACAT CCGTCGTGAC AAGGCGACTT CGAACATTTG CACCGCGCAA GCGTTGTTGG CCAACATAGC TGCCATGTAC GGTGTTTACC ACGGTCCGGA GGGCTTGAAG CAAATCGCCA AGCGCTCGCA CGACTTTGCC GCCGTCTTCG CCGCCGGTGC CGAAAAGCTT GGCTTCAAGA ACACCACCCC GGAGTTCTTC GACACCGTCA CGCTGAAGTG CCCGAGTGGC GCGGATGCCA TCGTCAAGGC GTGCGCGTCC GCTGGCATCA ACATTCGCAA GATGGACGCC GACCACGTCT CTTTGGCGTT TGACGAAACC ACAGAAATCG CCGACGTCGA CGCTCTCTTC AAGGTGTTCG CTGGTGGCGC TGCCGCGCCC ACCGTCGCGC AAGTTGCGCC GTCTGTGAAC ACGACCATGC CGATGGCGCG TAAGTCTGAA TTCATGACCC ACCCGGTGTT CAACCAGTAC CACAGTGAAC ACGAGATGGT GCGCTACCTC AAGCGCTTGG AAGAGAAGGA TCTCTCCTTG GTTCACTCCA TGATCGCTCT CGGCTCTTGC ACGATGAAGC TCAACGCAAC GACTGAAATG ATTCCGATCA CGTGGCCGGA GCTTGCGAAC ATTCACCCGT TCGCGCCGAA AGATCAAACG CTTGGTTACC AAGAGATGTT CCGCGGTCTC GAAAAGCAAC TCTGCGAGAT CACCGGCTTC GACGCCATGT CCCTCCAGCC GAACTCTGGT GCGTCTGGTG AGTACGCTGG TTTGATGGGT ATCCGTGCCT ACCACCAATC TCGCGGTGAC CACCACCGTG ACGTGTGCAT CATCCCGGTT TCCGCGCACG GTACCAACCC GGCGTCCGCC GCGATGTGCG GCATGAAGAT CGTCGTCATT GGCACCGACG CCAAGGGTAA CATCAACGTC GCCGAGCTCA AGGCTGCGGC CGAAAAGCAC TCCGCGAACT TGGCGGCTCT CATGGTTACG TACCCGTCGA CGCACGGTGT CTACGAGGAA GACATCAAGG AAATTTGCGA AGTCATTCAC CAACACGGCG GTCAAGTGTA CATGGACGGC GCCAACATGA ACGCCCAAGT CGGTTTGACT TCTCCGGGTT TCATTGGTGC GGATGTGTGC CACTTGAACT TGCACAAGAC TTTCTGCATT CCGCACGGTG GTGGTGGCCC GGGTATGGGC CCGATCGGTG TCAAGGCGCA CTTGGCGCCC TTCATGCCGG ATCACCCGTC CATGAAGGAT GGTGCCGTCG CCGTCGGCGG GGACAAGCCC TTCGGTGTCG TCGCGGCTGC CCCGTACGGA TCCGCGCTCA TTTTACCGAT TTCCTTCTCT TACATCGCTA TGATGGGTTC CGAAGGTTTG GCGAACGCGT CTAAGCGCGC CATCTTGAAC GCCAACTACA TGTCCAAGCG CTTGGAGGAT TACTACCCGG TGCTCTTCAG CGGCAAGAAC GACACGTGCG CGCACGAGTT CATCCTCGAC ATGCGCCCGA TCAAGGATGC CACCGGCGTC GAAGTCGCGG ACATCGCGAA GCGCTTGATG GATTACGGTT TCCACTCGCC GACGATGTCT TGGCCGGTCG CCGGTACGTT GATGATTGAG CCGACCGAGT CCGAGTCCAA GGCGGAGCTC GATCGATTCT GCGACGCTCT CATCGCGATT CGTGGTGAAA TCCGCGACAT TGAGGACGGT AAGGTGGACC GCGAGAACAA CGTTCTCAAG AACGCCCCGC ACACCGCGGA GGTCGTCACC GCGAAGGAGT GGAACCGCCC GTACCCGCGC GATCTCGGTG CGTTCCCGGT TGAATGGACT CGCTCTCACA AGTTCTGGCC GCAAACCTCT CGCATCGACG ACGTCTACGG CGACAGAAAC CTCGTCGCGA GCCGCGCGGC TGTGGAAGTC GCCGTCGCTC AAACCGCTTA AAAATCACAT TTTGCACTCG TTAGGAGAAA TGTAATTCAT CACGCCTTAA TTCACTATTC TGACAAGCTA AAAATTCTAA ATCAACCGCA AGTGTTCTTG TGCGTCGTCC CGGCGCCCAT CCGTCGGTGC GTAAAACGTC CCCGCGGCTA ACGCCGAGGC CCACACCACC ACACACACGC ACGCCGCTAG TTGTAAGTAC ATCGTGGACG AGCTATTCTC ATCCACTGAT GTGTCCTCTA CGTCAGAGTA TTCCTCCAGA CCGGCGCGAC CGCCCTTGGT CCACGACTCG CCCTCGCCGT CTAAACCTTG TGAGTCGT
|
Protein sequence | MTSLIASGAR AVSTEALKPL DTFERRHNSG TTQEVAEMCA VIGFKDIDAL IDATVPENIR LKKTMDMGEY TQPLTESEFL TMMKNMASKN KVFKNYIGTG YHGTHVPTVI LRNILENPGW YTQYTPYQAE ASQGRLESLL NFQTMITDLT GMPLSNSSLL DEGTAAAEAM TMCSALNRGK KPKFYVSNKC HPQTIAVVQT RAEGLGLEAV VGDENSFDYT AKDVCGVLVQ YPATDGSIID YKPIVSQAQA NGIRVVAAAD LLSLTMLQPP GEWGADIVIG SSQRFGVPMG YGGPHAAFLA TTHDCKRLMP GRIIGESIDA EGKPALRMAM QTREQHIRRD KATSNICTAQ ALLANIAAMY GVYHGPEGLK QIAKRSHDFA AVFAAGAEKL GFKNTTPEFF DTVTLKCPSG ADAIVKACAS AGINIRKMDA DHVSLAFDET TEIADVDALF KVFAGGAAAP TVAQVAPSVN TTMPMARKSE FMTHPVFNQY HSEHEMVRYL KRLEEKDLSL VHSMIALGSC TMKLNATTEM IPITWPELAN IHPFAPKDQT LGYQEMFRGL EKQLCEITGF DAMSLQPNSG ASGEYAGLMG IRAYHQSRGD HHRDVCIIPV SAHGTNPASA AMCGMKIVVI GTDAKGNINV AELKAAAEKH SANLAALMVT YPSTHGVYEE DIKEICEVIH QHGGQVYMDG ANMNAQVGLT SPGFIGADVC HLNLHKTFCI PHGGGGPGMG PIGVKAHLAP FMPDHPSMKD GAVAVGGDKP FGVVAAAPYG SALILPISFS YIAMMGSEGL ANASKRAILN ANYMSKRLED YYPVLFSGKN DTCAHEFILD MRPIKDATGV EVADIAKRLM DYGFHSPTMS WPVAGTLMIE PTESESKAEL DRFCDALIAI RGEIRDIEDG KVDRENNVLK NAPHTAEVVT AKEWNRPYPR DLGAFPVEWT RSHKFWPQTS RIDDVYGDRN LVASRAAVEV AVAQTA
|
| |