Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37633 |
Symbol | |
ID | 5006090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 370851 |
End bp | 372485 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 62% |
IMG OID | 640421511 |
Product | predicted protein |
Protein accession | XP_001421921 |
Protein GI | 145355340 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01798] citrate synthase I (hexameric type) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.506551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0169313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGATT CGCGCGACGT GGCGAATCGG CGGCTGGGAC AAATCAACGA TCACGTCCGC GGCGACGCGA AACGCGCGAA CGGAGGCGCG AGAGCGGGCG GTGGCGATGG GTCGGACGCG CGAGCGACGC CGTCGGGGGG CGCGGGGGCG CTGCACGTGC GAGACTCGCG GACGGGGAAG GAGTATGAAA TCGCGGTGTC GAGCGACGGC GCGGTGGACG CGTCGGCGTT TAAACAAATC ACCGCGGGCG GCGATGGACG AGGTTTGGTG ATGTACGATC CCGGATACAT GAATACGGCG CCGTGTAAGT CGAAGATATC GTACATCGAC GGCGAACGGG GGATTTTGAG GTATCGCGGG TACGCCATCG AGAGCTTGGC GAAATCGTCG ACGTATCTGG AGACGGCGTT CGCGTTGGTG TACGGAGATA TGCCGAATGC GAGTCAGTTG ATGGAGTGGG AACGGACGAT TGCGAGGCAT AGTGCGTTGC CGGTGCAGGT GGTGCACGCG ATCGAGGCGT TGCCGCACGA CGCGCATCCG ATGGCGGTGA TGCTCGCGGG GTTGAACTCG TTGAGCGCGA TGCACCCGGA GCAGAATCCA GCCATCGCGG GTGGGGCGAT TTACAACTCT CACACCGTGC AGGATAAACA AATCGTGCGC ATCATCGGCA AAATGACCAC GCTCGCCGCG CACGCATATC ATCGAAACAC CGGGCGCTCG CCGGCGGCGC CGAATACGCG CATGTCGTAC GCGGAGAATT TCTTGTACAT GCTCGACGCC GGTTTGGACG CGCGGCATCG ACCGCATCCG AAGCTCGCGA AGGCGCTGGA CGTGATGTTT TTACTACACG CGGAGCACGA GATGAACTGT AGCACGGCGG CGTGCAGACA CTTGGCATCG AGCGGGGTGG ACGTGTTTTG CGCCGTGGCG GGCGCGGTGG GCGCGCTTTA CGGGCCTCTG CACGGCGGCG CCAACGAAGC AGTGCTCAAG ATGCTCGAAC GGATCGGAAG CGTCGATAAC ATCCCGAGCT TTTTGGCCGG GGTGAAGGAA AAGCGATACG TGATGTTTGG TTTCGGACAC CGCGTTTACA AGAACTTTGA TCCTCGCGCG AAGATTATTC GCGACATCGC GAACGACGTC TTCGAGCTCG TCGGACGCGA CCCGTTGATC GACATCGCCA TCGAGCTCGA GAAGGCGGCG CGGGCGGACG AATACTTCGT CAAGCGTAAG CTTTACCCGA ATGTAGACTT CTACAGCGGG CTGGTGTATC GCGCCATGGG GTTTCCCCCC GAATTTTTCA CCGTGCTCTT CGCCATCCCG CGCGCGACGG GTTATCTTGC GCACTGGCGC GAGAGCTTAA CGGACGCCGA CAAGAAAATC ATGCGCCCGC AGCAAATTTA CCAAGGCGAA TGGTTGCGAG ATTACGAACC GATCGCCGCG CGCTCGCGCT CGCTCACCGA CGCCATGGAG GATATTCAAC CATCCAACGC GGCGCGGCGC CGCATGGCTG GAGACCCGCC ATCCGGCACG GCGTGGGTCG GGAAAGGCGT GGAAATGTCC ACGCCGGCGT GGCAGTCGGG CGCGTCAGTC GGCGACGCGA CGAGCGGCGT GGAGAACTAT CTCGGACGAA AATGA
|
Protein sequence | MGDSRDVANR RLGQINDHVR GDAKRANGGA RAGGGDGSDA RATPSGGAGA LHVRDSRTGK EYEIAVSSDG AVDASAFKQI TAGGDGRGLV MYDPGYMNTA PCKSKISYID GERGILRYRG YAIESLAKSS TYLETAFALV YGDMPNASQL MEWERTIARH SALPVQVVHA IEALPHDAHP MAVMLAGLNS LSAMHPEQNP AIAGGAIYNS HTVQDKQIVR IIGKMTTLAA HAYHRNTGRS PAAPNTRMSY AENFLYMLDA GLDARHRPHP KLAKALDVMF LLHAEHEMNC STAACRHLAS SGVDVFCAVA GAVGALYGPL HGGANEAVLK MLERIGSVDN IPSFLAGVKE KRYVMFGFGH RVYKNFDPRA KIIRDIANDV FELVGRDPLI DIAIELEKAA RADEYFVKRK LYPNVDFYSG LVYRAMGFPP EFFTVLFAIP RATGYLAHWR ESLTDADKKI MRPQQIYQGE WLRDYEPIAA RSRSLTDAME DIQPSNAARR RMAGDPPSGT AWVGKGVEMS TPAWQSGASV GDATSGVENY LGRK
|
| |