Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119536 |
Symbol | Pho4 |
ID | 5000286 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 506629 |
End bp | 508564 |
Gene Length | 1936 bp |
Protein Length | 600 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415707 |
Product | high affinity phosphate transporter, probable |
Protein accession | XP_001416412 |
Protein GI | 145343615 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0306] Phosphate/sulphate permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0681687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTC TCCCGAGCCT GGATTCTGGA CTCGATTGGA ACCAATCGCG TCTTCTTTCG AGGTTCACGA ACCGTGAATT CGTTCCGCAA GGAATATTCT CGGAATCTTG CCTTTTCAAA TTCCCGCGTG GAAATATGCG CGGTCGCCGC CAGAACACGT CGCGCGCCGT CTCGTCACGC ACAACGATGA GCTCTTCGGA TACGCTCGCT CAAGCGTCCC GCCACCTTTT GGCCGGCGCT ACTGAAACCA AGAGATTCGA ATGGATCGTC GTTTGCGGCA GTTTCCTCGC CTTCTTCGCC GCCTTTGGCA TCGGTACGCA CACGATGCGC GCACTTTTTA CCTTTCGCGT TCATAGGCTT TAATGACACG GCTTGGTCGC ACACGTGCGA TTGGAGTGTA TTAGAAGCAT CACTGACGTC GTGCGATCAA TCGTATTTAC CTACAGGTGC CAACGACGTC GCGAATGCGT TCGCCACCTC TGTCGGGTCC GGCGCGTTGA CCATTAAAAA CGCTGTAGTC CTCGCCGCGA TCTTTGAATT CTGTGGTGCC ATGTTCATGG GCGGTCACGT CGTCAAGACA ATTCGCAAGG GCATTGCTAA CCAGAAATGC TTCGCCGGTA CCGGTGGTGC GAACGATCCC GGCCTCTTGA TGTACGGCTG CCTATGCGTC ATTTTTGCCG TCGCCATCTG GCTCGTCATC GCATCTGCCT TCGAAATGGC CGTCTCCACG ACGCACTCGT GCGTTGGTGG TATGATTGGA ATGACTTTGG TCGCGCGTGG TAGCGAGTGT GTCATTTGGA CCAAGAAAGC GGACGAATTC CCGTACGTCA AAGGCGTCGT CGCCGTCATC ATCTCCTGGC TCTTGTCCCC GGTCATCTCT GGCGCCTTTG CGTTCGTCTT CTTCGTGACC CTGCGTACTC TCGTCATGCG TTCCGAACAT TCTTATTCTC GTACTGCCGT CGCGTTCCCA GTGTTGCTTG CGTGCACTCT CATCATCAAC ATCTTCTTCA TCGTGTACAA GGGCGCCAAG TTCCTTGAGC TCGACGACAC ACCGGTCGGT ACGGCGTGCG CCATCGCGTT CGGCATCGGC GGAGGATGTG GCATTATTGC GTACTTCTTC GTTACCCCTT ACATCCTCAA GACGACCGAT GAGCTCTTCG AGAAGCAACA GCTCGAAAAG GCGGAGCGTG GTTCGGGCAA GAAGGCCGAA GAGAAGGTCG TCCGTCAACC GCGCGAGTAT CCTGTCGGTG TCTTTGGCGC GCCGCGTCGC ATGTGGTACG CTCTGCAAGA TCATCTTGAA TCTTCACTCG CCCACAAGGC TGAAGACATT CTTGACGAAG ACATGGCTGT ATTGGCGATC CACGAAAACG CCGAAAAGTT TGACGAAAAG ACTGAACTTT GCATGCGATA CTTGCAAATC CTCACCGCGT GCTGCGACTC GTTCGCGCAC GGCGCCAACG ATGTGGCCAA CTCTATCGGT CCCTTCGCCT CTATGGTAGT CGTCTTCAAG AGCGGTAAAG TTTCGAAGGA AGCTGAAATG GGCGACGATT CATATTGGAT TCTCGGTCTT GGCGCTGCCG GCATCGTCTG TGGCCTCGCC TTGTACGGCT ACAAGATTCT TCACGCTCTC GGTACCAAGA TTGCCAAGCT CACCCCGAGT CGCGGTATCT GCATCGAGCT TGGTGCGGCT TGCGTCATCA TCATGGGATC CCGTCTCGGC TGGCCGCTGT CTACCACTCA CTGCCAAGTT GGTGCCACCG TCGGCGTCGC CCTACTCGAA GGCCGCAAGG GCATCAACTG GTTCATCATC GGTAAAACTG TGTTCGGCTG GATCATCACC CTCGTCATCG TCGGCTTCTC CACGGCGGCG TTCTTCGCAC AAGGTGCCTA CGCTCCGATG AAGAGCTACC CGTGCTACAT CACAGGTAGT TGCTAA
|
Protein sequence | MNFLPSLDSG LDWNQSRLLS RFTNREFVPQ GIFSESCLFK FPRGNMRGRR QNTSRAVSSR TTMSSSDTLA QASRHLLAGA TETKRFEWIV VCGSFLAFFA AFGIGANDVA NAFATSVGSG ALTIKNAVVL AAIFEFCGAM FMGGHVVKTI RKGIANQKCF AGTGGANDPG LLMYGCLCVI FAVAIWLVIA SAFEMAVSTT HSCVGGMIGM TLVARGSECV IWTKKADEFP YVKGVVAVII SWLLSPVISG AFAFVFFVTL RTLVMRSEHS YSRTAVAFPV LLACTLIINI FFIVYKGAKF LELDDTPVGT ACAIAFGIGG GCGIIAYFFV TPYILKTTDE LFEKQQLEKA ERGSGKKAEE KVVRQPREYP VGVFGAPRRM WYALQDHLES SLAHKAEDIL DEDMAVLAIH ENAEKFDEKT ELCMRYLQIL TACCDSFAHG ANDVANSIGP FASMVVVFKS GKVSKEAEMG DDSYWILGLG AAGIVCGLAL YGYKILHALG TKIAKLTPSR GICIELGAAC VIIMGSRLGW PLSTTHCQVG ATVGVALLEG RKGINWFIIG KTVFGWIITL VIVGFSTAAF FAQGAYAPMK SYPCYITGSC
|
| |