Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29184 |
Symbol | |
ID | 4999878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 957222 |
End bp | 960972 |
Gene Length | 3751 bp |
Protein Length | 1247 aa |
Translation table | |
GC content | 54% |
IMG OID | 640415299 |
Product | CPA1 family transporter: sodium ion/proton |
Protein accession | XP_001415987 |
Protein GI | 145341792 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0025] NhaP-type Na+/H+ and K+/H+ antiporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCTC GCGTCGGTCG CACGCTGCTC AGCGCGGCGG CGTCGAACGC CACGGCTGTC AACGCCGCAG CGCACGACGA TGCGACGGCA CCGCTGTGGT TCATATTCAT CTCTCTAGCC GTCGGCAGCG CGGCGAGGAC GGCGCTTCAC GGGACGTCGG TACCGTACAC GGTTGTGCTG TTCATCATCG GCATGGTCAT GGCGGTGGTG TATTACTACG CAGACTTGGG ACTGTTAGAG GCGTCGCTGA GCGCGTGGGG TGGGATCAAC CCGCACGTGT TGTTGGCGTG TTTCCTACCT GCGTTGATTT TTGAGTCGTC GTTTTCGATG GAGTGGCACA CGCTGAAAAA GGTGCTGCCC AAGGTGTTGA TCTTGGCCGG GCCCGGAGTG TTGATGAGCA TGGCGCTCAC TGGGTGCGTG TTACGCGTGT TTCCGTTCGA TTTTAGCTGG GCATTTTGTA TGATGGTGGG GTCGATTTTG GCGGCGACAG ATCCCGTCGC CGTCGTCGCC ATCTTGAAGG AGGTCGGGGC GAGTAAAGTT TTGGGACATT TGATCGAAGG CGAGTCGCTC GTGAATGATG GTACCGCCAT CGTCGTCTTC AACGTTTTTT TGGCGATCGC AAAAGGCGAG CATCTCTCGT TCGGAGATGT GATGAAGGAT TTAATCACTC AACCCATCTT ATCAGTCATC ACCGGCGTCG TCGTCGGCGA ACTGTGCTTG ATTTGGTTGC GAAATTCCCA TAAAGACCAT TTAGTAGATA TTACCATCAC TCTTTTTGCT GGTTATATTT CTTTCATGCT CGCCGAAATC GTCATCAAGT GCAGCGGCGT GCTTGCCGTC GTCGTTCTCG GATGGTACCT GAGCGCCAAG GGCAGAACGT TTTTCAAGGG CAACATACAA CACTCTATGC ATCACTTTTG GGAGGTGCTA ACATACGCGG CTAACACCAT CGTATTCATT CTTGCCGGTT TTCTGATCGT CGCAGATATT TTGCTCGGCG AACTGGACAT CAAAGCCAGC GACCTGGGGT GGGGTATTCT CCTCTACTTG TGCCTCAACA TAATTCGTTT TGTCGTCACG GCGATGCTGT TTCCGTGGAT GAACTACTAC AAAGGACACG TCTTAAGCAT GCGTGAGACC TTTATAGCCA TTTGGAGCGG TCTACGTGGC GCTGTCGGTT TGGCGCTCGC GCTCGTCGTG TTGGAGGATG ATGCGCATTT CACAAAGCGA GAGCGTGGAT TGACGCTCGT AATGGTTTCC GCCGCAGTGA TTGGGACGCT GTTAATCAAT GCATCTTCTG CGGCAGCTAT GCTTCGCATT GTCGGCCTGC TTACGCCCGG GCTTAGTCAC GACCGAACGT TATTGGCTGC TCGCAAAGCC ATTCACGAGC GCGCGCTCGC GCACTACGAC GAAAAGCTGG ACCAAACTGG TGCGCAGTAT CTCGGCGCGC CCGACTTCAC GGCGGTTAAG CAGCTCGTGC CATTCCTTCG CACACAAACA GACGAAACGG AAATCGAGTT CAAGCACGAG ATCGCGAAAA TGGACGCTCA GCTCATGCAC GAGATGCGTG ATCGTTTCTT GGCCGGGCTC TGTAAAGAAG TCTGGTCGAT GCTTGAAGAC GACTTGATTG ACGCAAAAGT TGCTCACGGG TTGCTTATCG CCTGCGAGCG AGCAACTGAT AAGTCAAAAT CGAACGTATT GTGTACGTTC GACGAGTTCA ACCGCTTCAT GAAGCAGAGC CGCGTGAAGA GCGCCGTCGT CAAGGCGTTT GAACGTCGAG CGTGGGGTAT GCGAATCATC AAAACCTTGC GTTACCTCAG GCTCGTCGAA GAAGGCTGGA TGGGCACGAG ACGTCAAATG CAAGGTTTAC ATGCCCTAAT TAGTGCCCAT CGCTTGACTT CAGAGCAATT CAAATCTTTA CTAGGCAAAA ACGAAGAGGA AGAGCAAGCC AAGGAGGAAG CGCAATCGGA AGTCGCCGAG CTGGCAAAGA GTTATACACT CACAAAGGCT GGTGTCGGCG AAAATCTAGC AAACGAAACG TCCATCAAGT CACATCATGT GCGAGATGTT CTCTTCGAGT CGGATACCGA GGTCGTCCAA GCCGTTGAAA TGCTGAAAGA GATGCAGCAC GCTCACAACG CTGTGGCGCG TTCGCTGAGA TCCGAGGCAC TCGCGCGTGA AATTCTGGAG GTGACGATGG AAGAAACAAA ACTCTTGACG CATACGGGAT TGCTGGACGA ATCGGAAGGA TCAATGTTAC AGGCCGAGCA CGACCTCTAC CTTCAAAGGC TGATTATGCA TCCGCTCCCA CCATCGCGTA CGTCACCTGG CCGCATCCTC GCTGCGATTC CGCTCTTTGA TTTCACGCAA CGCGTGGGCA TCAAGTCGCG AGAGATTCGT AAACACTTCC TTCGAGAAGC AGAAGTTGTA GAGTTTAACG TGGGTGACAT CATCATCGAC CCCAATGTCA AGTTGAACGA CGGAATTATG CTTGTCGGCA ACGGCGTCGT ACTCATCGAG TCTGCGGCGG ATGCGACGCA ACAAATTACG GCAGGCTGTA AGTCGCATCC GCACGCGTGG GCTTTGGCCT TGTACGAAGC GCTGGCAAAT GACGCGAATG AAGAAGAACC AAATCGTAGT GGTATTCGTG CGACGGCGCA GACGACTATG TTTGGTTTCG TCCTTCCGCG CTCACTTTTG GACAGCCTAC GCGGCGATCT TCGTGAGGCG GAAACGCTCG AGTTTATGTG GCGAGTGGTG CTCGGTTTGT TGGCGGAGAC CGTGTTGCGC AGCGAACTAG ATAGCGTCTT ACCGGATCTT CCCGTGAATA AACTCGTACG CGCGGGTAAA CTCGTGAAAA TTTCGCAAGG ATCAAAACTC GAGCACGACG ACAGACAAGT GGTGATTTTG CTTCGTGGAT CTCTGTACAA CAATACGTGC GGCAACATCC CAGCGCCGTG CGTCGTGACT TTCAAACGGA CGGGCATGCT ATCCAACGCA ACAACGTCGG CTTCGAGCGC GCTTGAGAGT AAGCTTGAGT CGCGCTTGTC GGCGTTGAAA TTCGGGAAAA ATGGTCTCGT GCGCGGCTCG TCTCAAAATC TCGCCGCGCT CGCGGCGAAG ATGTCCGTGG ACGAATCCGC ATCTGAATCA CTTCCCGTGG AGCCCGACAG GGATGGTTAC TGCACGTTTG AGAGCGTCAA TGATTTGCTC ATTTACGTCA TCCCGACCGA CAAGCTCTTG CAATCGCCGA CCTCTGGTAA ATCGAGAACT TCGCTCAGGG CGCAAAAGAA GAGGGAATTT TTACGAGGCT TGTCGAACAA GGTTTCGCGA CAACGCTCCA TTTTCATCGG CCCAGACAAC CGATTGAACG CTCAAAAGTC GACGATGCCG CGACGATCGA TCGACTCTGA TCGACGATCG ATCGAGTTCG ACATCGAGTC TGGCGTCGCG AACGCGACGT CGCGTTCGCC GAATCGTCGC TTGTCGTTGC AACTACCCCG ACAGCCCACG GTGGCCGAAG ATGCGGACAA CCCAATCGGT GAAGATCCGC GCACGCAACT CACGTCCAAT TCCATGATCT CATTGGGCAC GGACGACGAC AACGCGCAAG ACACGCACCG ATCGGACGAC TTGGTGCCTT TACGATCACG ATCGCGAGTG TGGGACGATT CGCCGATTCC GGCGCCTCGC GTCTTACCGA GCCGCGACTA CGAGGATCCG TCTACGCAAC ACACGATTTC CTAGTACTAG C
|
Protein sequence | MTPRVGRTLL SAAASNATAV NAAAHDDATA PLWFIFISLA VGSAARTALH GTSVPYTVVL FIIGMVMAVV YYYADLGLLE ASLSAWGGIN PHVLLACFLP ALIFESSFSM EWHTLKKVLP KVLILAGPGV LMSMALTGCV LRVFPFDFSW AFCMMVGSIL AATDPVAVVA ILKEVGASKV LGHLIEGESL VNDGTAIVVF NVFLAIAKGE HLSFGDVMKD LITQPILSVI TGVVVGELCL IWLRNSHKDH LVDITITLFA GYISFMLAEI VIKCSGVLAV VVLGWYLSAK GRTFFKGNIQ HSMHHFWEVL TYAANTIVFI LAGFLIVADI LLGELDIKAS DLGWGILLYL CLNIIRFVVT AMLFPWMNYY KGHVLSMRET FIAIWSGLRG AVGLALALVV LEDDAHFTKR ERGLTLVMVS AAVIGTLLIN ASSAAAMLRI VGLLTPGLSH DRTLLAARKA IHERALAHYD EKLDQTGAQY LGAPDFTAVK QLVPFLRTQT DETEIEFKHE IAKMDAQLMH EMRDRFLAGL CKEVWSMLED DLIDAKVAHG LLIACERATD KSKSNVLCTF DEFNRFMKQS RVKSAVVKAF ERRAWGMRII KTLRYLRLVE EGWMGTRRQM QGLHALISAH RLTSEQFKSL LGKNEEEEQA KEEAQSEVAE LAKSYTLTKA GVGENLANET SIKSHHVRDV LFESDTEVVQ AVEMLKEMQH AHNAVARSLR SEALAREILE VTMEETKLLT HTGLLDESEG SMLQAEHDLY LQRLIMHPLP PSRTSPGRIL AAIPLFDFTQ RVGIKSREIR KHFLREAEVV EFNVGDIIID PNVKLNDGIM LVGNGVVLIE SAADATQQIT AGCKSHPHAW ALALYEALAN DANEEEPNRS GIRATAQTTM FGFVLPRSLL DSLRGDLREA ETLEFMWRVV LGLLAETVLR SELDSVLPDL PVNKLVRAGK LVKISQGSKL EHDDRQVVIL LRGSLYNNTC GNIPAPCVVT FKRTGMLSNA TTSASSALES KLESRLSALK FGKNGLVRGS SQNLAALAAK MSVDESASES LPVEPDRDGY CTFESVNDLL IYVIPTDKLL QSPTSGKSRT SLRAQKKREF LRGLSNKVSR QRSIFIGPDN RLNAQKSTMP RRSIDSDRRS IEFDIESGVA NATSRSPNRR LSLQLPRQPT VAEDADNPIG EDPRTQLTSN SMISLGTDDD NAQDTHRSDD LVPLRSRSRV WDDSPIPAPR VLPSRDYEDP STQHTIS
|
| |