Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_51965 |
Symbol | |
ID | 5006620 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 348418 |
End bp | 353638 |
Gene Length | 5221 bp |
Protein Length | 1562 aa |
Translation table | |
GC content | 59% |
IMG OID | 640422041 |
Product | predicted protein |
Protein accession | XP_001422721 |
Protein GI | 145357021 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0069] Glutamate synthase domain 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.738666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0653658 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGTCGCGATC GTGCGCGCGC CGGAAGGGCG CGCGAGCGAG CGAAGGCGAT GCAAGCGATG GGCGCGCGCG TAAGCCCCGT CGCGGTGACG GATGCGCGCG CGCGTCGCCA TCGCGCGCGC GTGGGGGCGA AAAATGGTGC ACACGCGCAG GCGATGCGTA ACAAAAACGT CGCGAGCGCG AGCGCGGGCA CGATGCGCGC GTTGGGGAAC CAAGCGGTGC GTAAGACGCG CGCGACGACG CGCGAAGGGG GTTAGGGCTT GTTTTCCAAG AGTCACCCCG CGACGAATCA CGAACGGCGA ATGCGCGTTT AAAAGACGGG CCGAAGGCGA GGCGCGAAGG CGTCGATCGA ACGTCGAACG AAGAAGACTG ACGGTAGAGT GTGATCATTT ACATCGCGCA GAAGTTTACT CCGTCCCCGG TGACGTTGCG TAAGCCCGCG GCGTTGCCCA AGTGCTCGAC GGTGCGCGTG CCGGGACGCG ACCGCGCGAA GACGGTGTGC CTGGCGGAGC GAGGGTCGGT GTACGGCGAT TGGCCCGTGT ATGAAGGCGG TGCGGTGGAT AACACGCAGC TTTTGGAAGA GCATGATGCG TGCGGGGTCG GTTTCATCGC TTCGTTGAAG GGTGAGCGCA CGCACAAGAC TGTGAAAGAT TCCTTGATGG CTGTCGGGTG CATGGAGCAC CGCGGGGCGT GCTCGGCGGA CAACGACTCT GGTGATGGTG TCGGTGTCAT GACGCACATT CCGTGGAAGC TTTTGGACAA GTGGTGCGCG GCGAATGGTA TCAGTGGATT CTCTGAAGGC TCCTCGGCGG TCGGTATGGT TATGTTGCCC ACCGACGGTG CCAAGGCGGC TGAGGCGAAG AAGATTCTTG AGGCGAGCTG CGTCGCCGAA GGCCTCAAGG TGCTCGGTTG GCGTGCGGTT CCGGTGGACA ACTCCGTCGT CGGTCCGCTC GCGAAGATGA CGTGCCCGGT GCACGAACAG ATTCTCGTCG ACGGGGCGGG CTTGGAACGC GAAGAGCTCG AGCGCAAGTT GTTCATCGCG CGCAAGACGT GCGAAAAGTC GGCGAGCTCC GACGCCGTGT TAGCTGAGAG CTTCTACATT TGCACCTTGA GCTCCCGCAC GATTGTGTAC AAGGGTATGC TTCGATCCGC GGTGTTGGGC AAGTTCTACA AGGATTTGGA AGACCCCGAC TACGAGTCGC AGTTCTCGAT TTACCACCGC CGTTTCTCCA CGAACACGAC GCCGAAGTGG CCACTCTCGC AACCGATGCG TTTCTTGGGA CACAACGGTG AGATCAACAC CCTTCAAGGT AACTTGAACT GGATGGCGTC TAAGGAAGCT GATATGGAGA ACCCGATCTG GGGCGGTCGT GAACCGGAAT TCCGCCCGAT CTGTAATCCG GCTGCCTCCG ATTCCGCCAA CCTCGACAGA GTCGCGGAGC TCCTGGTGAG AACCGGCCGC GCGCCGGCGG AGACTATGAT GTTGCTTGTG CCAGAAGCGC ACCGCAACCA CCCCGAACTC GATGCGACGT TCCCGGAAGT TCATGATTTC TACGATTATT ACGCTGGGAT GCAAGAAGCG TGGGATGGTC CAGCGTTGCT CGTCTTCTCC GATGGCAAGC AGCTCGGCGC CCGCCTCGAC CGCAACGGGT TGCGCCCGGC GCGCTTCTGG CGCACGTCCG ATGATTACAT CTACGTCGCC TCGGAAGTTG GTGTCCTCGG TGATGTCATG TCCAACGCGT CCAATGTCGT CTCCAAGGGC CGTCTCGGCC CGGGTATGAT GATTTACGCT GATTTGGAGA CGGGCGAGTT CAAGGAAAAT ACGGAAATCG CGAAGGAAGT CTCCGCGCGC CTCCCGTACG GCGAATGGAT GAAGGCCATC GATCGCGTCA AGGGCATCGA ACCCATTGGC GCGACGCAAC TGGACCCGAT CCAACTCATC GAGTGCCAAG CTCGCGCCGG TTACGCCGCC GAAGACATCA CGATGATCAT TGAATCGATG GCGTCCGATG CCATCGAACC CACTTGGTCG ATGGGCGACG ACACCCCGAT GCCCGTCTTG TCTGGCCGAC CGCGCTTGCT TTATGACTAC TTCAAGCAAC GCTTCGCGCA AGTTACCAAC CCCGCCATCG ACCCTCTTCG CGAGGGTCTC GTCATGTCCT TGGCCATGAC TCTTGGTGCG AAGGGCAACT TGCTCGACAC GCAAGGCAAG GAAACGCCGC CGGTCATGCT CGACTCCCCG GTCCTCTTCG ACTCTGAGTT GGAGCACATT AAGAACCACC CGAAGCTGAA GACGCAAACT ATTGCCGCGC GTTACGCCGC TGGTGGTGCC GCTGGTGCCC TCAAGGCTGG CCTTGACAAG CTTTGCGAAG AGGCCGCCGC GGCGATTCGC GCCGGCAGCG AGTGCATCGT CATCACGGAT CGTCCGGATC AAGGTCCGGA CTCGCCCGCG ATTCCCTCGC TTCTCGCTGT TGGTACCGTG CACCACTACT TGATCGCGCA AGGTCTTCGA ACCCGCGCGT CTATCGTCGT GGAGTCTGCT TCGGCGTTCA GCACGCACCA CATTGCCACC TTGGTTGGTT TCGGCGCACA CGCTGTGTGC CCGTGGTTGG CTTTGGAAAC CTGCCGGTCA TGGAGAAAGT CCCCGAAGGT CGAGACCGCC ATCCAGCGCG GTAAGATGGG TGATGTCTCT GTGGAAGGTG TGCAAGTCAA CTTCAAGAAT GCCCTCAACA AGGGTCTCAA GAAGATCTTG TCTAAGATGG GTATCTCTTT GATCACCTCG TACCAAGGCG CGCAAATTTT CGAGTGCTAC GGTCTTGGGC CTGAAGTCAT CAACACCGCC TTCAAGGGCA CCGTTTCCCG CATCGGTGGT CTCACCATGG ATGAAGTTGC CGCGGAGACG CACATGTTTG TCCAGTCCGC TTTCCCGGGT GAGGCTGAAG AGATGGCCAA GGTTGAGGCG CGCGGTATGT TCCAAGTCAA GCCGGGATTG GAATACCACG GCAACAACCA AGAGATGTCT AAGCTTCTTC ACAAGGCTGT TGGCCTCGGT GGTGGTGAAA AGAATGATGA GTTCTGGAGC GCCTACCAAG CGCACCGCAA CGATCGTCCG TACACGTGCT TGCGCGATCA ACTCGAAATC AAGTCTGACC GCCAACCGAT CTCCGTCGAT GAGGTCGAAT CCGTCGCTGA CATTTGCACG CGCTTCTGCA CGGGTGGTAT GTCTCTCGGT GCTATCTCCC AAGAGTGCCA CGAATCTATC GCCATCGCGA TGAACCGCAT CGGTGGTAAA TCCAACTCTG GTGAAGGTGG CGAAGACCCG AAGCGATTCG AAACCATCAC TGACGCCACC GCGGATGGCA AGTCTGAAAC GTTCCCGTAC CTTCGAGGCA TGGAGAATGG CGACGTCGCG TCTTCCGCTA TCAAGCAAGT CGCTTCCGGT CGCTTTGGTG TCACGACGTC GTTCTTGATG TCTGCCAACC AGACCGAAAT CAAGGTGGCG CAAGGTGCCA AGCCGGGAGA AGGTGGTCAG CTTCCGGGTA AGAAGGTTTC CCCGTACATT GCCTGGCTCC GCCGATCCAA GGCTGGTGTC CCGCTCATCT CCCCGCCGCC GCATCACGAC ATCTACTCCA TTGAGGATCT CGCGCAGCTC ATCTATGACT TGCACATGGT CAACAAGAAC TCGAAGGTGT CCGTGAAGCT CGTGTCCCAA GCGGGCATCG GCACGGTGGC GTCCGGCGTC GCCAAGGCGA ACGCCGACAT CATCCAAATT TCGGGTGGCG ATGGTGGTAC CGGCGCGTCT CCTTTGTCGT CCATCAAGCA CTGCGGTGGT CCGTTGGAGA TGGGTCTCGT CGAATCGCAC AGAACTCTCG TTGAGAATGG CCTTCGCGAG CGCGTCGTCT TGCGCGCCGA TGGTGGCTGC CGCTCCGGTC TTGACGTCAT CCAAACCGCT CTCATGGGTG CCGATGAATA CGGTTTCGGT ACCGTTGCGA TGATTGCCAC TGGCTGCGTC ATGGCTCGTA TTTGCCACAC CAACAACTGC CCCGTTGGTG TTGCGTCCCA GCGCGAAGAG CTTCGCGCGC GCTTCCCCGG TGCGCCAAGC GATCTTGTCA ACTTCTTCAT GTACGCCGCG CAAGAAGTGC GCGAGATCCT CGCCCAAATG GGTTACAGAT CTCTCGATGA GATCATCGGT CGCAACGACT TGCTCAGCCA AATTGACAAG GCGCCGGCGA AGACTTCGTC TCTCGACTTG TCCTTCCTCA CCACGTCCTC TGGCGAGGCT GGCGCTTCCT CGGACCGCAT CGCGCAACCG GTGCACAACG ACGGTATCGT TCTCGATGAC AAGATCCTTA GCGATCCGGA AGTCCAAAAG TGCATCGAAA CCGAAGGCAC GTACACGAAG AAGGTGGAGA TTGTCAACGT CGACCGTTGC GCGACGGCGC GCGTCGCCGG TCAAATCGCC AAGAAGTACG GCGACAATGG CTTCGCTGGT TCTCTCACCT TAGACATCGA GGGTTCCAGC GGTCAATCTT TCGGTGCTTT CGTTGTCGGT GGCCTGAAAG TGCGACTTGT GGGTGAAGCG AACGATTACG TGGCGAAGAG CATGAGTGGC GGTGAGATTG CCATCATGCC TCCTCCGAAC TCTCCGTTCG CGCCGGAGTC GGCGAGCATC GCGGGTAACG CGTGCTTGTA CGGCGCCACT GGTGGTCAAG TGTTTATCAG CGGTCGCGCT GGTGAACGCT TCGCCGTCCG TAACTCGCTC GGTGAAGCGG TCGTTGAAGG CACTGGCGAC CACTGCTGCG AATACATGAC GGGTGGTTGC GTCGTCGCGA TCGGCAAGGT TGGCCGCAAC GTTGGCGCGG GTATGACTGG TGGCATCGGT TACTTCCTCG ACGAAGACGG TACGTTCGAA TCCAAGGTGA ACGGCGAGAT TGTCGCCATG CAGCGCGTGA TCACGCCGGC GGGTGAGGCC CAACTCAAGG GTCTCATCTC CGCGCACGCC GAGAAGACGA ACTCGCCGAA GGCGAAGGCT ATCCTCGCTG ACTGGGCCAA CTATTTGCCC AAGTTCTGGC AGTTAGTTCC GCCGTCTGAG GCGAACACGC CGGAGGCGAC GAACGATGTT AAGGCTGGAG TTGAAGCCAC TGCTTAAATC CATAGCGCGG CGCGGCGTCG TACGCAAAAA ATGCTTGCAA GCGTTCATGA GTTGAGGAAT TTAGAAACAG T
|
Protein sequence | MQAMGARVSP VAVTDARARR HRARVGAKNG GAVDNTQLLE EHDACGVGFI ASLKGERTHK TVKDSLMAVG CMEHRGACSA DNDSGDGVGV MTHIPWKLLD KWCAANGISG FSEGSSAVGM VMLPTDGAKA AEAKKILEAS CVAEGLKVLG WRAVPVDNSV VGPLAKMTCP VHEQILVDGA GLEREELERK LFIARKTCEK SASSDAVLAE SFYICTLSSR TIVYKGMLRS AVLGKFYKDL EDPDYESQFS IYHRRFSTNT TPKWPLSQPM RFLGHNGEIN TLQGNLNWMA SKEADMENPI WGGREPEFRP ICNPAASDSA NLDRVAELLV RTGRAPAETM MLLVPEAHRN HPELDATFPE VHDFYDYYAG MQEAWDGPAL LVFSDGKQLG ARLDRNGLRP ARFWRTSDDY IYVASEVGVL GDVMSNASNV VSKGRLGPGM MIYADLETGE FKENTEIAKE VSARLPYGEW MKAIDRVKGI EPIGATQLDP IQLIECQARA GYAAEDITMI IESMASDAIE PTWSMGDDTP MPVLSGRPRL LYDYFKQRFA QVTNPAIDPL REGLVMSLAM TLGAKGNLLD TQGKETPPVM LDSPVLFDSE LEHIKNHPKL KTQTIAARYA AGGAAGALKA GLDKLCEEAA AAIRAGSECI VITDRPDQGP DSPAIPSLLA VGTVHHYLIA QGLRTRASIV VESASAFSTH HIATLVGFGA HAVCPWLALE TCRSWRKSPK VETAIQRGKM GDVSVEGVQV NFKNALNKGL KKILSKMGIS LITSYQGAQI FECYGLGPEV INTAFKGTVS RIGGLTMDEV AAETHMFVQS AFPGEAEEMA KVEARGMFQV KPGLEYHGNN QEMSKLLHKA VGLGGGEKND EFWSAYQAHR NDRPYTCLRD QLEIKSDRQP ISVDEVESVA DICTRFCTGG MSLGAISQEC HESIAIAMNR IGGKSNSGEG GEDPKRFETI TDATADGKSE TFPYLRGMEN GDVASSAIKQ VASGRFGVTT SFLMSANQTE IKVAQGAKPG EGGQLPGKKV SPYIAWLRRS KAGVPLISPP PHHDIYSIED LAQLIYDLHM VNKNSKVSVK LVSQAGIGTV ASGVAKANAD IIQISGGDGG TGASPLSSIK HCGGPLEMGL VESHRTLVEN GLRERVVLRA DGGCRSGLDV IQTALMGADE YGFGTVAMIA TGCVMARICH TNNCPVGVAS QREELRARFP GAPSDLVNFF MYAAQEVREI LAQMGYRSLD EIIGRNDLLS QIDKAPAKTS SLDLSFLTTS SGEAGASSDR IAQPVHNDGI VLDDKILSDP EVQKCIETEG TYTKKVEIVN VDRCATARVA GQIAKKYGDN GFAGSLTLDI EGSSGQSFGA FVVGGLKVRL VGEANDYVAK SMSGGEIAIM PPPNSPFAPE SASIAGNACL YGATGGQVFI SGRAGERFAV RNSLGEAVVE GTGDHCCEYM TGGCVVAIGK VGRNVGAGMT GGIGYFLDED GTFESKVNGE IVAMQRVITP AGEAQLKGLI SAHAEKTNSP KAKAILADWA NYLPKFWQLV PPSEANTPEA TNDVKAGVEA TA
|
| |