Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36056 |
Symbol | |
ID | 5000337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 105320 |
End bp | 107536 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | |
GC content | 58% |
IMG OID | 640415758 |
Product | predicted protein |
Protein accession | XP_001416311 |
Protein GI | 145343351 |
COG category | [C] Energy production and conversion |
COG ID | [COG1882] Pyruvate-formate lyase |
TIGRFAM ID | [TIGR01255] formate acetyltransferase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTCG CGGTGGACGC GGTGGACGCG TTCGTGCGCG CGCGCGGAAC GCCGCGCGCG GACGACTGCG CGACGACGCT CGCGAAACCG ACGGGAAAGA CGCTGGAACT GCTGGCGGAG GCGCGGCGGT TGCTCGAGCT GGAGCGACGC GCGGGAGGGG CGCTGGCGGT GGATGCGGAG ACGCCGTCGC GAGCGACGTC GCACGGCGCG GGGTACGTGC TGTCGAAGGA GGTGGACGAC GTCGTGGTCG GGCTGCAGAC GACGCGCGCG CTCACGCGGA CGATGAAGCC GTTGGGAGGG TTGCGCGTGG TGGAGAAGGC GCTGGGCGAG CGCGGCGCGA CGATGAAGGA GGACGCGAAG AAGTTTTTTG ACGATTGGGT CACGACGCAC AACGACGCGG TTTTCGACTT GTACACGGAT GAAATGCGAT CGGCGAGGAA GGCGCATCTC TTGCTCGGGC TCCCGGACGC GTACGGACGA GGGCGCTTGA TCGGGGATTA TCGCCGCGTG GCGCTGTTCG GCGTCGATCG GTTGATCGCG GAGAAGCAAA TCGACAAAGA TGCCATCGCG ACGAGCGACG AGGAGGACTT GCGCGCACGA TTTGAGGTGT CGGAGCAAAT TCGCGCGTTG GAGCGGCTCA AGGCGATGGC AAAAATGTAC GGTTTCGACA TCGGCGCCGC CGCGCGCGAT TGCAAAGAAG CGGTTCAGTG GACGTACTTT GCCTACCTCG CGTGCGTGAA GGAACACGAC GGCGCGGCGA TTTCTCTCGG TCGAGTCGAC GCTTTTTTTG ATGTGTACAT CGAGCGCGAC ATGAAGGCTG GTTTCATCGA CGAATCGGGC GCGCAGGAAC TCATAGACAA TTTTGTGCTC AAGCTGAGAC TCGTTCGGCA TCTTCGACCG GAGGCGTACG ACGACATTTT CGCCGGCGAT CCCATCTGGG CCACGTGTTG CGTGGGTGGC ATGCAGATGA ACGGCACCGA CACGTTGGTG ACGAAGACTT CTTGGAGGTT TCTACAAACG CTACGCAACA TGGGCGCGGC GCCGGAACCA AACATGACGG TTTTGTGGAG CGAATCGTTG CCGAAGCCGT TTAAAGAATT CGCGGCGGCG ATCAGCATCG AGTCGAGCTC AATTCAGTTC ATCTCCGACG ACTTACTGCG GGAAAAGTTT CAATCGAGCG ACGTCGGTAT CAGTTGTTGC GTGTCGGGAA TGCAGTTGGG GCATACCATG CAATATTTCG GCGCGCGATG CAATCTACCC AAGCTTCTTC TCTACGCGCT GAACGGCGGT CGCGACGAAA TCACCGGCGC CCGCGTCGCG CCCGCGCTTT TCGACGACGA AGTTCCTGAA GGTGTTTTAC AATACGATGA CGTGATGAAG CGTTTTACGG CGTACATGGA CTGGCTCGCC AAGCTTTACG TGGAGACGAT GAACTGCATC CACTACTCGC ATGATCGATT CAACTACGAG GCCTTGTTTT TCGCGCTCAT GGACACCGAT TGCAAGCGGA TGATGGCGTT CGGCATCGCT GGGTTGAGCG TTGTGGCCGA CTCTCTCAGC GCCATCAAGC ACGCAAAAGT CAAGATGATT CGCGACGAAA AGTCGGGATT ATCCACTCGG TTCGAGGTCG ATGGCGATTG GCCCGCGTTT GGCAACGACG ACGATGAGGT TGACACCATC GCCGCCAGTG TTGTGGAAAC GTTCATCTCG AGTTTACGAA AGACGCCAGC GTATAGAGGA TCCGAGCACA CGCTTTCGAT TCTGACGATT ACGTCCAACG TCATGTACGG GAAACATACT GGCGCGACGC CGGATTTGCG AAAACTCGGT GAACCGTTCG CGCCGGGCGC GAATCCGATG CACAATCGCG ACAAGACGGG TGCTTTGAAT TCACTCAATT CTGTGGCAAA GATCCCGTAC GCGAGTTGCA TGGATGGCAT CAGCAACACG TTCTCCATCA CCGCGCCATC GCTCGGCAAG ACGGACGGGA CTCGAGTCTC GAACTTGACT GCGTTGCTGG ATGGCTACTT CTCACACGGC GCTCAACATT TGAACGTTAA CTGTATCGAT AGATCGGTAC TGTTAGACGC CATGGCGCAT CCAGACAACT ATCCCACCCT CACCATTCGC GTGAGCGGGT ATGCGGTGAA TTTCATCAAA CTCTCGCGAG CACATCAAGA GGAAGTCATC GCTCGTACTT TTCACGCTTC GCTTTAG
|
Protein sequence | MTLAVDAVDA FVRARGTPRA DDCATTLAKP TGKTLELLAE ARRLLELERR AGGALAVDAE TPSRATSHGA GYVLSKEVDD VVVGLQTTRA LTRTMKPLGG LRVVEKALGE RGATMKEDAK KFFDDWVTTH NDAVFDLYTD EMRSARKAHL LLGLPDAYGR GRLIGDYRRV ALFGVDRLIA EKQIDKDAIA TSDEEDLRAR FEVSEQIRAL ERLKAMAKMY GFDIGAAARD CKEAVQWTYF AYLACVKEHD GAAISLGRVD AFFDVYIERD MKAGFIDESG AQELIDNFVL KLRLVRHLRP EAYDDIFAGD PIWATCCVGG MQMNGTDTLV TKTSWRFLQT LRNMGAAPEP NMTVLWSESL PKPFKEFAAA ISIESSSIQF ISDDLLREKF QSSDVGISCC VSGMQLGHTM QYFGARCNLP KLLLYALNGG RDEITGARVA PALFDDEVPE GVLQYDDVMK RFTAYMDWLA KLYVETMNCI HYSHDRFNYE ALFFALMDTD CKRMMAFGIA GLSVVADSLS AIKHAKVKMI RDEKSGLSTR FEVDGDWPAF GNDDDEVDTI AASVVETFIS SLRKTPAYRG SEHTLSILTI TSNVMYGKHT GATPDLRKLG EPFAPGANPM HNRDKTGALN SLNSVAKIPY ASCMDGISNT FSITAPSLGK TDGTRVSNLT ALLDGYFSHG AQHLNVNCID RSVLLDAMAH PDNYPTLTIR VSGYAVNFIK LSRAHQEEVI ARTFHASL
|
| |