Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24245 |
Symbol | |
ID | 5000841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 520899 |
End bp | 528035 |
Gene Length | 7137 bp |
Protein Length | 2378 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416262 |
Product | NCS1 family transporter: cytosine/purines/uracil/thiamine/allantoin |
Protein accession | XP_001416972 |
Protein GI | 145344920 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.175257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00602767 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGCTGC GCGCCGCCGC GAGCGCGGCG GCGGAGCGGC GGAGCGGCGC GCGCGAGAGC GGGCGACGGG CGGGGTCGAT CGGGACATTT GCGAAATCAA TCGTTGGGCA ATCGCATGCG GCAACGGCGG GGGGCGGGGC GCGCGCGCGG GGCGCGACGG TCGTCGCGCG CGTGGTCGGT GGAGGAAAGA AACGATGTGG ATTGGCGTGG GAGGCCGCGT GCGCGGGTGG GACGCGAAGG CGAGGCGTGG GGATGAGGGA GGCGTCGACG CGCGCGAACG CCGTCGGATT GGGGTACGCC GATCAAACCG GCGACGACGA TAAGGATGAC GAAGGGTTCA TCGTCGCGCC TGGGGACGAG GATTTGATCA TCCCGCCGAC GACGGAGGAG AATGCGATCG AGAAGGGCAA GGAACTCTCG CTGTGGGCGG CGCTGGCGAC GTGCGCGGTG ATGTATTCGG TGCCGTACGG TTTCGTTCGG TGGAACTTGC ACACGCTCGC GCCGTTGCTC TGGTATGAAC TCGGTTTGGC GACAGCGCTC GGTACTTTGA TGGCGATGCC ATTTATCTCT GCAATCGGAT ACCCCGCGGC GAAGTACGAA CTGAATATGC CGATCCTGGC GCGTTCGAGT TTCGGGATTT GGGGGGCGGA GTTCACCGAC GCCTGTCGCG CGACGCTTGG ATTCTTGCTC TACGCCGTGC AAACTTTGGT CGGGGGCGAA GCGGTGTTCG GATTGTTCAA AACGACGTTT ATCGCCGCGA GTATCGAAAG CGATGTAGTC GTGCGAGCGA TGTCGTACGC GATGTTTTGG GTGGTGCAGC TCGCAATTTC CTTGCGCACC GTCCCTTCAT CGAGACTTCT CGCGTCTGGT CAGACCGTCT TAGCGCTCGT GGCAGGTTTC ATCGCTTGGA CTTTGCGTCA AGGTATTCCT CAGACCGCGA TCGGTTCGTT GATTTCCGCG GGTATGCCGG GCGCCTTGCC GAACGAGTTC TGGAAGCACG CTTGGCTTAT GGTGGGCGTT TGGGTGACGC TCGCGAGCGT GTATCCAGAT TACACAGCGC ACATCAAGGG CTCTAGAACT GCACCTATTG GTCAGTTTTT CTGGCTTCCC TTGCTGTCCG GTTTGTTTGC TATCGTCGCC GCGGCCATGG CGCAAGCGCC GAAGCCGATT TTATTTGCGT CCATTGCCAT CGCCTCGCTC GTGACGAACA GTGCCGCAAA CGTCGTTGGT CCGCAGGAGT ATATGCAAGA ATTCTCGCTT CCGTGGGCAA AGACCAAGAA CTATCTTGGT GGTACGATCT CGTCCAAGCA AGCAGCGCTC ATCATCACCT GCGCTGTCGG CCTCTTTGCG CCGGCTCAGT TGATGTGGCA ACAAGTCGTC GCCGCTGCCT CGTGGATCAT TGGCGTTGGT TCGTTGTTAG TCTCTCCCGT GCTCGGTGTG ATTTTGTGTG ACTTCTGGCT TATGCGTGCT GGCAAATTAT CGCGACGAGC AATGCAGACG CGCGATACTA AGGGTCCATA TTGGTTCTTT AAGGGCGTGA ATCCACGCGC CATGATTGCC ATTGCCGTCG CGGCGGCGCC CAACTTGCTT TCACTCATTT CTGGTGTTTC CTATTTGATC GAAGCCGGTG CCTTCTCGGG TTCTGGCGCG TTCTACACGT ACGTAGTGAA CATGGAGTAC GCAAGTATTA TCGGTGCTGC GATTGCTTTC ATCGTCTATC TCTTCATTCA CTTGCTTGAG ATGAAGCCGG TGAGGCTCGG TGAAGTCGCG GCATCCATCG GAGACGCCGC AGGCACCGTC ACAACATCGG TAGCGTACAG CTTTGATGGA CGAACGAAGA AGACGATAAT CACGACGCGA GAGAAGGTGA ACACGCGTCT CGCCCGTAGG CGTGAAGAAT TGCCGGACGA GGAATACCAC GACGAAATCG ATGCTCGCGA GCGTAAACTT CGCAAGGAGG AAGATAGACG CCGCAAGGAA GCCGAGGAGG CTCTTCGAAA GGCTGTCGAG GATGAAAACA ACGAGCGGCG TAAAGAGAAC AGACGTCGTC GACTCGAGAA GGAGAGGATC GATAGGCTGT ACGACGAATG TGAAGCCATT TACTCGGAAG AGATCATCAC GACTAAGGTC ATCACTGAGA CACAAAAGGG TGACCTCGAG GACCTCATGG GTGGCGATGA TGGCGCACCC GTTTCAGTCG CGAAATTGCG CGAAGAAATC GAACGCCGCC GAAAGGAAAT CGATGCCCTT CGCAACTCTA TTCAACTACG CCGCGCGCAA GGTGAACAAG ATATCAACTT GGCGAAGAAT AACTTTGAGA AGGCGAAGGG ACTCGAGGCC GATCTTCGAG CAAGATTCGA CACCGAGACG CGATCTCTTT TGGAGAAGGA GGCTCGTTTG CAAGATGAAG AAAATCGGCG TCGTGCGGAA GCTGAGGCTC GAATTCAACG TATGCGAGAT GCTCTCGCCG CCGATCGTTC CGCTGAAGAA GCTCGACGAC GAGCGTGGGA TGACGAACAC GCGCGTGAGC TTGAAGAAGA GAGGCGTCTC CAGGAAACGG AGGATACCCG GCGAGAGCAG GCTCGTGAAG ACATCAGACG CGCAGTTGAT GATGAACGTG CGCGTCTTGA ACGCGAAGAA GCCGATCGTC AAGCTGCGGA GGAACGTCGT CGTGATGATG CTGCGCGTAA GCTTCTACAA ATGGAGGATG ACGAGGCAAA GCGTCGCGAC GACGAGGACA AGCGCCGCCG CGAAGCCGAA GCTGCCATCA ACGCTCTCCG GGAAGCGATT GTCAAGTTCG AACGCGATAC AAAAGATCGA CGCATCGCTG CCGAAAATGA CGCTGATCGC AGAAAGCAAG ACATCGAAGC TGCGCTTAAG GCTGAAGAAA TTCGCGCCGC GAACGCGCAA GCTGCGATGC GAGCCGCTGC AAAGGCAGAA GGCGATCGGC GAAAGGCAGA GGACGATGAT CTCGCCGCTA AGCGTCGAGC CCTTGAAGAG TTCAAGCAGC TTGGTATTCG TCTTAAGGAT ACGGAAGAAA ACCGACGGCA AAAGTATGCT CAAGAGATCG CTCGAATCAA CGCTGAAGAG GTGCGACGCA TCGCCACTGC TGAGGCCAAC CACAAATCTC GACTTGAAGA AATCAAGATC CTCGAAGAGG CTGAAAAGCG CAAAATTGCT GATGAAGATC AACGGCGCGC TCGTGTCGAG GCGCAAGCTG AGGCCGCTGA GGATGCTGAA CGCCGCAAAA GAGAAGCCGA AGATGCGCGT CGACGTGCTT ACGACGATGC CGCTCGAGCT GCGCGAGAAG CTGAAAACCG ACTTCGTGCC GAAGAGGATC AACGTCGTGC CGAAAATAAG CGTCATGAAG AAGAACTTGC TCGAGAAGAA GCGGCTAAGA AGGAAGAAGA AACACGTCGT CAGCAAGAAC AAGACAAGGC GAATTTGATT TTCAATTCCC AAGAAGGTCT GCGACGCACT TCTGAAGATT CTCGTCGCAT CGATTTCATC ATCAACAAGG ATGAATTCAA GACGAAGGAA CAGCTTCAAC GTGAAGCGGA AGAAGCGAGA CGTAAACAAG CCAAGCAAGA CCTTTTGGCT GCTGCTGAAG CAGAGCGTAG AAAGCGTGAA GAGGAAGAGC GCCGTCGACG TGAAGCCGAT GCTGCTGCTG CCGAGGCTGT GCGCAAGGAA GCCGAAAGAG TGGAAGCTGA AAATCGCCGT CGCGTCGCCG CAGAGCAAAA GTTGCGGGAC ATGGAGCAAC AAGAGCAACG GTTACGTATG GAGGCCGAGA ATAAACGTCG CGCGCAAGAA GGCGAGAGTC GCACGCGAAA AGAAGCGCTC GAGCGTGCTA TCGTTGCGGA GGAGAATCGC CGAGCCCGCG CAGCTGACGA GCTTCGTCGT CGCGAAGAAG ACGAGGCACG CACTCGAGCT CGTGAAGATC AACGCCGTGC CGATTGGGAT CGCGAACAAG CGCGTCGCGA CGCAGAAGAA CAAGCTCGTC GCGCCAGAGA AGATGCTAGA GCTAAAGAAG CGGAAGATGT TATTCGTGCT GCTGCGGAAG CCGAGAGAAA GAAGATGGCC GAGGAAGACG CTCGTCGCAA CGCCGAGGAA GAACGCATTA CAAAGATGCG CTCCGACGAA GACGCTGCGT GGGCCTCCGA GCTTGCTCGT CGTGAAGCTT TCAAGGCTGA ACTCCAAGCA AAACAAAGAG CTGAAGAAGA TCTCCGCGCT CAGGAAGAAC GCAGACGTGA AGATCATGCG AATGAGGTGT CCAGAAAGCG TCAAGAAATT GACGATTTCA TTCGCGACTT GGAACGTCGT CGACGCGAAG CTGATGCCGC CGCCAAGGCT GCACTTGAGG CTGAAGCGCA AAGAGTCACG GAGTTCAGAA AGTCTCGTGA AGAAGCTATA GCCGAAGATG AACGTCTTCG CGCGGAAGAT GCGGCCGAGC GCCTGAGAGA AGACGCCAAG CTGCTGCAAA TGCGTGAAGA GCTCAAGAAG CTTGTGCGCG ACGAAGCCAT GCGCCGTGCC GAAGAAGAAG CCCGACGCAA GGCGATGGAC AAGAAGATTG CCGATCAAAA GGCCATCGAC GACGATATTA TCGCGAAAGA GGACGCTAGA CGCGACCGAT TCCGTGATGA ACTGCCGGAC ATTCTCGACA AGTTCCGAGC GGAACTTGAA GCCAAGGAGG AGTTCTTACA GACTCAAGAG GAGATGCGCC GTAAGCAAGC TGCCGACGCC ATCGCGCAAA AGGAGGCCGC TGAGAAACAA CGAAGAGCTG ATTTTGACGC GCAGTGCGAG CGCAAGCAGA AAGAGATTGA CGACTTTGCG GCGAAGATCA AGGCTTTGTT CAACGCCGAG GAAGCTCGTC GCGCCGAGTT CAGGAAGATG ATGGAGGAGA GACAACGTAA GGAAGATGCG CGGCGACTTG CCGCTGCTTC GCCTATTGCT ATCGACAACA TGATGCAGAA TCTCAACGCT ACTATTACGC AGATGGAGTC GCAACTTCAA ATCCTCATTC CCGAATTGCA AGCTCTCGAA AAGACTGCAG CGGCCAACTA CGAAGAGTTT AAGGCGCGCC GTGCTCAGTC CCAGATGTCC ATTGAACGCA CCGTGACGGA GGAGCAGACT AGAACTCACA ATGAAGATGG ACGTCACGCT GCTGTCTGCG CGGAAGCTGA CGCAGCTGTT CGTGCCGAGC GAGCTCTACG TGAAGAGGAA GACGCCCGTA GAAAGGTGGC TGAGATGCGT ATCAAGCAGC TCGAAGCCGA GCTCGCCGCC ATCATCGCTG AAATTGATGC TCGACGTCTT GCATGGAACG CAGAAACTTT GAGCAGGAGA GATTCAGAGT GGGCCGCTAG ACAAGACGAG GACACTCGTA GATTTTTCAA CCAGACCGAA ACTGAGGGCG TTTTGTTCAA AGAATCTTCG ACGATCAGCT TCGAAGAATC CCGTCGTCTT GAAGCCGTCG CAGCTGCTGA ACGCGCGGAG GAGTTACGAA TTGCAACTGC AGCGGCGGAG GCCGAGCGCG TGGCTGCTTC GGAAAGACGC AAGCAATTGG CCCTCGACGA AGAGGAAAGC CGTCGCAAGG CGGCTGAAGC AGCGAAGAAG GACTCCAAAA AGCAAGCAGA AGACTTGAAG CGTCAAGCTG AAGAGCAGCG CCGCGCCAAA GAACAAGCTG AGAAGGAAGC TGCGGCAAAG GCCAAGCAAG CTGAGGAGGA AGCTGCGCGC GCCAAGAAAA AAGCTGACGA GGACGCTAAG GCGGCTGCCA AACGAGCTGA GGAGGAGAAG AAGCGTCTCG AGAAGGAACA AGAAGAGAAG AGAAAGCAAG CTGAGAAGGA GGCTGCCGAG GCGAAGCGTC GCGCTGAGAA GGAGGCGGAA GAACGTCGCA AGGCGGAAGC CAAGGCGCAG GAGGAGGCTG CGAAAGCTCA GAAGGCTGCT GAGGAAGCGA AGCGAAAGGA AGCCGAATTA GCGAAGCGTA AGGCTGAAGA AGAGGCGAAG GCGGCTAAGG CAAAGGCGGA TGCTGAAGCC AAGGCAAAGG CGGATGCCGA GGCCAAAGCC AAGGCGGACG CCAAGGCCAA GGCAGAGGCA GAGGCCAAAG CCAAGGCAGA GGCAGAGGCC AAAGCCAAGG CGGACGCCAA GGCAGAGGCA GAGGCCAAGG CAAAGGCGGA TGCAGAGGCC AAGGCAAAGG CGGATGCAGA GGCCAAGGCA AAGGCGGATG CAGAGGCCAA GGCAAAGGCG GATGCACAGG CCAAGACCAA GGCGGATGCA CAGGCCAAGG CAAAGGCGGA AGCCGTAGCC GCTGAGGCAA AGGCAAAGGC GGACGCTGCA GCCGCCGCGG AGCGCGCGGC CGAACAAGCT GAAGCCGCGT CAAAGCCGGA CAACGAAGTC ACCGATGAGC GTGCATCTCG TTTGCAAAAG CGACTTGAAA AGAGCGCATC CAAGGAGGAA ATCGAAGAGG TTGTCCTCGA GCTCTCTGAA GACGAAGATC CGATCGCCGC AACGTTGATC CTTCAAGGAG CAGCCAACGT GGAAGCTGCG GCGACAGCCC TAGCTGCTTT GTGGCGCGCA AAAGATACCC GAGGCGCTGA AGCGCTCATG GGTCTCGATT TGACCCGTGC TGCGACGTTG ATTGACATTA TGGTGGAGAA CCTTGGAGAC GCCGAAGCCG CCATCGGTTT GGTTAACATG TCCGAGCCCG GTCGCATGGT TGGTGTCGCT GAATTCGTCC TTCCGTACTA CCTGCGAAGC GTTGTGTACA ACGGGGTCGA AATCCTCCGC GTCGTTGCAA ACCGCAACGT CAAGACTGCG AAGGTTATTT ATGCTGGATT GGAGATTGAA GAACAAGTGA GCATTGTTCG CCGTGCGAGC CTCGGGTTCG GCGCGCGCCC GGGCGTGGAG AAGCCGGAAG ACAAACCCGA ACCTGATGTG AAACTCGCCG CGACTCTCTT GACTGGGCTC ACGCCATCCG CCGCTGTGGA TGTCTTGCGC ACCTTCAGCA CCACGGGTGT GAACAAGGGA AGCCCGAAGT GGAAGCGCCG CCAAGACGTC ATCGTTGAAG TTCTCGACGC CTTCAAGGAT ATCGGTCCCG GAGAGGACTC CATCGCTGAA CTCATTCGCG ATAGGCTCAA GCTGTGA
|
Protein sequence | MALRAAASAA AERRSGARES GRRAGSIGTF AKSIVGQSHA ATAGGGARAR GATVVARVVG GGKKRCGLAW EAACAGGTRR RGVGMREAST RANAVGLGYA DQTGDDDKDD EGFIVAPGDE DLIIPPTTEE NAIEKGKELS LWAALATCAV MYSVPYGFVR WNLHTLAPLL WYELGLATAL GTLMAMPFIS AIGYPAAKYE LNMPILARSS FGIWGAEFTD ACRATLGFLL YAVQTLVGGE AVFGLFKTTF IAASIESDVV VRAMSYAMFW VVQLAISLRT VPSSRLLASG QTVLALVAGF IAWTLRQGIP QTAIGSLISA GMPGALPNEF WKHAWLMVGV WVTLASVYPD YTAHIKGSRT APIGQFFWLP LLSGLFAIVA AAMAQAPKPI LFASIAIASL VTNSAANVVG PQEYMQEFSL PWAKTKNYLG GTISSKQAAL IITCAVGLFA PAQLMWQQVV AAASWIIGVG SLLVSPVLGV ILCDFWLMRA GKLSRRAMQT RDTKGPYWFF KGVNPRAMIA IAVAAAPNLL SLISGVSYLI EAGAFSGSGA FYTYVVNMEY ASIIGAAIAF IVYLFIHLLE MKPVRLGEVA ASIGDAAGTV TTSVAYSFDG RTKKTIITTR EKVNTRLARR REELPDEEYH DEIDARERKL RKEEDRRRKE AEEALRKAVE DENNERRKEN RRRRLEKERI DRLYDECEAI YSEEIITTKV ITETQKGDLE DLMGGDDGAP VSVAKLREEI ERRRKEIDAL RNSIQLRRAQ GEQDINLAKN NFEKAKGLEA DLRARFDTET RSLLEKEARL QDEENRRRAE AEARIQRMRD ALAADRSAEE ARRRAWDDEH ARELEEERRL QETEDTRREQ AREDIRRAVD DERARLEREE ADRQAAEERR RDDAARKLLQ MEDDEAKRRD DEDKRRREAE AAINALREAI VKFERDTKDR RIAAENDADR RKQDIEAALK AEEIRAANAQ AAMRAAAKAE GDRRKAEDDD LAAKRRALEE FKQLGIRLKD TEENRRQKYA QEIARINAEE VRRIATAEAN HKSRLEEIKI LEEAEKRKIA DEDQRRARVE AQAEAAEDAE RRKREAEDAR RRAYDDAARA AREAENRLRA EEDQRRAENK RHEEELAREE AAKKEEETRR QQEQDKANLI FNSQEGLRRT SEDSRRIDFI INKDEFKTKE QLQREAEEAR RKQAKQDLLA AAEAERRKRE EEERRRREAD AAAAEAVRKE AERVEAENRR RVAAEQKLRD MEQQEQRLRM EAENKRRAQE GESRTRKEAL ERAIVAEENR RARAADELRR REEDEARTRA REDQRRADWD REQARRDAEE QARRAREDAR AKEAEDVIRA AAEAERKKMA EEDARRNAEE ERITKMRSDE DAAWASELAR REAFKAELQA KQRAEEDLRA QEERRREDHA NEVSRKRQEI DDFIRDLERR RREADAAAKA ALEAEAQRVT EFRKSREEAI AEDERLRAED AAERLREDAK LLQMREELKK LVRDEAMRRA EEEARRKAMD KKIADQKAID DDIIAKEDAR RDRFRDELPD ILDKFRAELE AKEEFLQTQE EMRRKQAADA IAQKEAAEKQ RRADFDAQCE RKQKEIDDFA AKIKALFNAE EARRAEFRKM MEERQRKEDA RRLAAASPIA IDNMMQNLNA TITQMESQLQ ILIPELQALE KTAAANYEEF KARRAQSQMS IERTVTEEQT RTHNEDGRHA AVCAEADAAV RAERALREEE DARRKVAEMR IKQLEAELAA IIAEIDARRL AWNAETLSRR DSEWAARQDE DTRRFFNQTE TEGVLFKESS TISFEESRRL EAVAAAERAE ELRIATAAAE AERVAASERR KQLALDEEES RRKAAEAAKK DSKKQAEDLK RQAEEQRRAK EQAEKEAAAK AKQAEEEAAR AKKKADEDAK AAAKRAEEEK KRLEKEQEEK RKQAEKEAAE AKRRAEKEAE ERRKAEAKAQ EEAAKAQKAA EEAKRKEAEL AKRKAEEEAK AAKAKADAEA KAKADAEAKA KADAKAKAEA EAKAKAEAEA KAKADAKAEA EAKAKADAEA KAKADAEAKA KADAEAKAKA DAQAKTKADA QAKAKAEAVA AEAKAKADAA AAAERAAEQA EAASKPDNEV TDERASRLQK RLEKSASKEE IEEVVLELSE DEDPIAATLI LQGAANVEAA ATALAALWRA KDTRGAEALM GLDLTRAATL IDIMVENLGD AEAAIGLVNM SEPGRMVGVA EFVLPYYLRS VVYNGVEILR VVANRNVKTA KVIYAGLEIE EQVSIVRRAS LGFGARPGVE KPEDKPEPDV KLAATLLTGL TPSAAVDVLR TFSTTGVNKG SPKWKRRQDV IVEVLDAFKD IGPGEDSIAE LIRDRLKL
|
| |