Gene OSTLU_29184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29184 
Symbol 
ID4999878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp957222 
End bp960972 
Gene Length3751 bp 
Protein Length1247 aa 
Translation table 
GC content54% 
IMG OID640415299 
ProductCPA1 family transporter: sodium ion/proton 
Protein accessionXP_001415987 
Protein GI145341792 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCTC GCGTCGGTCG CACGCTGCTC AGCGCGGCGG CGTCGAACGC CACGGCTGTC 
AACGCCGCAG CGCACGACGA TGCGACGGCA CCGCTGTGGT TCATATTCAT CTCTCTAGCC
GTCGGCAGCG CGGCGAGGAC GGCGCTTCAC GGGACGTCGG TACCGTACAC GGTTGTGCTG
TTCATCATCG GCATGGTCAT GGCGGTGGTG TATTACTACG CAGACTTGGG ACTGTTAGAG
GCGTCGCTGA GCGCGTGGGG TGGGATCAAC CCGCACGTGT TGTTGGCGTG TTTCCTACCT
GCGTTGATTT TTGAGTCGTC GTTTTCGATG GAGTGGCACA CGCTGAAAAA GGTGCTGCCC
AAGGTGTTGA TCTTGGCCGG GCCCGGAGTG TTGATGAGCA TGGCGCTCAC TGGGTGCGTG
TTACGCGTGT TTCCGTTCGA TTTTAGCTGG GCATTTTGTA TGATGGTGGG GTCGATTTTG
GCGGCGACAG ATCCCGTCGC CGTCGTCGCC ATCTTGAAGG AGGTCGGGGC GAGTAAAGTT
TTGGGACATT TGATCGAAGG CGAGTCGCTC GTGAATGATG GTACCGCCAT CGTCGTCTTC
AACGTTTTTT TGGCGATCGC AAAAGGCGAG CATCTCTCGT TCGGAGATGT GATGAAGGAT
TTAATCACTC AACCCATCTT ATCAGTCATC ACCGGCGTCG TCGTCGGCGA ACTGTGCTTG
ATTTGGTTGC GAAATTCCCA TAAAGACCAT TTAGTAGATA TTACCATCAC TCTTTTTGCT
GGTTATATTT CTTTCATGCT CGCCGAAATC GTCATCAAGT GCAGCGGCGT GCTTGCCGTC
GTCGTTCTCG GATGGTACCT GAGCGCCAAG GGCAGAACGT TTTTCAAGGG CAACATACAA
CACTCTATGC ATCACTTTTG GGAGGTGCTA ACATACGCGG CTAACACCAT CGTATTCATT
CTTGCCGGTT TTCTGATCGT CGCAGATATT TTGCTCGGCG AACTGGACAT CAAAGCCAGC
GACCTGGGGT GGGGTATTCT CCTCTACTTG TGCCTCAACA TAATTCGTTT TGTCGTCACG
GCGATGCTGT TTCCGTGGAT GAACTACTAC AAAGGACACG TCTTAAGCAT GCGTGAGACC
TTTATAGCCA TTTGGAGCGG TCTACGTGGC GCTGTCGGTT TGGCGCTCGC GCTCGTCGTG
TTGGAGGATG ATGCGCATTT CACAAAGCGA GAGCGTGGAT TGACGCTCGT AATGGTTTCC
GCCGCAGTGA TTGGGACGCT GTTAATCAAT GCATCTTCTG CGGCAGCTAT GCTTCGCATT
GTCGGCCTGC TTACGCCCGG GCTTAGTCAC GACCGAACGT TATTGGCTGC TCGCAAAGCC
ATTCACGAGC GCGCGCTCGC GCACTACGAC GAAAAGCTGG ACCAAACTGG TGCGCAGTAT
CTCGGCGCGC CCGACTTCAC GGCGGTTAAG CAGCTCGTGC CATTCCTTCG CACACAAACA
GACGAAACGG AAATCGAGTT CAAGCACGAG ATCGCGAAAA TGGACGCTCA GCTCATGCAC
GAGATGCGTG ATCGTTTCTT GGCCGGGCTC TGTAAAGAAG TCTGGTCGAT GCTTGAAGAC
GACTTGATTG ACGCAAAAGT TGCTCACGGG TTGCTTATCG CCTGCGAGCG AGCAACTGAT
AAGTCAAAAT CGAACGTATT GTGTACGTTC GACGAGTTCA ACCGCTTCAT GAAGCAGAGC
CGCGTGAAGA GCGCCGTCGT CAAGGCGTTT GAACGTCGAG CGTGGGGTAT GCGAATCATC
AAAACCTTGC GTTACCTCAG GCTCGTCGAA GAAGGCTGGA TGGGCACGAG ACGTCAAATG
CAAGGTTTAC ATGCCCTAAT TAGTGCCCAT CGCTTGACTT CAGAGCAATT CAAATCTTTA
CTAGGCAAAA ACGAAGAGGA AGAGCAAGCC AAGGAGGAAG CGCAATCGGA AGTCGCCGAG
CTGGCAAAGA GTTATACACT CACAAAGGCT GGTGTCGGCG AAAATCTAGC AAACGAAACG
TCCATCAAGT CACATCATGT GCGAGATGTT CTCTTCGAGT CGGATACCGA GGTCGTCCAA
GCCGTTGAAA TGCTGAAAGA GATGCAGCAC GCTCACAACG CTGTGGCGCG TTCGCTGAGA
TCCGAGGCAC TCGCGCGTGA AATTCTGGAG GTGACGATGG AAGAAACAAA ACTCTTGACG
CATACGGGAT TGCTGGACGA ATCGGAAGGA TCAATGTTAC AGGCCGAGCA CGACCTCTAC
CTTCAAAGGC TGATTATGCA TCCGCTCCCA CCATCGCGTA CGTCACCTGG CCGCATCCTC
GCTGCGATTC CGCTCTTTGA TTTCACGCAA CGCGTGGGCA TCAAGTCGCG AGAGATTCGT
AAACACTTCC TTCGAGAAGC AGAAGTTGTA GAGTTTAACG TGGGTGACAT CATCATCGAC
CCCAATGTCA AGTTGAACGA CGGAATTATG CTTGTCGGCA ACGGCGTCGT ACTCATCGAG
TCTGCGGCGG ATGCGACGCA ACAAATTACG GCAGGCTGTA AGTCGCATCC GCACGCGTGG
GCTTTGGCCT TGTACGAAGC GCTGGCAAAT GACGCGAATG AAGAAGAACC AAATCGTAGT
GGTATTCGTG CGACGGCGCA GACGACTATG TTTGGTTTCG TCCTTCCGCG CTCACTTTTG
GACAGCCTAC GCGGCGATCT TCGTGAGGCG GAAACGCTCG AGTTTATGTG GCGAGTGGTG
CTCGGTTTGT TGGCGGAGAC CGTGTTGCGC AGCGAACTAG ATAGCGTCTT ACCGGATCTT
CCCGTGAATA AACTCGTACG CGCGGGTAAA CTCGTGAAAA TTTCGCAAGG ATCAAAACTC
GAGCACGACG ACAGACAAGT GGTGATTTTG CTTCGTGGAT CTCTGTACAA CAATACGTGC
GGCAACATCC CAGCGCCGTG CGTCGTGACT TTCAAACGGA CGGGCATGCT ATCCAACGCA
ACAACGTCGG CTTCGAGCGC GCTTGAGAGT AAGCTTGAGT CGCGCTTGTC GGCGTTGAAA
TTCGGGAAAA ATGGTCTCGT GCGCGGCTCG TCTCAAAATC TCGCCGCGCT CGCGGCGAAG
ATGTCCGTGG ACGAATCCGC ATCTGAATCA CTTCCCGTGG AGCCCGACAG GGATGGTTAC
TGCACGTTTG AGAGCGTCAA TGATTTGCTC ATTTACGTCA TCCCGACCGA CAAGCTCTTG
CAATCGCCGA CCTCTGGTAA ATCGAGAACT TCGCTCAGGG CGCAAAAGAA GAGGGAATTT
TTACGAGGCT TGTCGAACAA GGTTTCGCGA CAACGCTCCA TTTTCATCGG CCCAGACAAC
CGATTGAACG CTCAAAAGTC GACGATGCCG CGACGATCGA TCGACTCTGA TCGACGATCG
ATCGAGTTCG ACATCGAGTC TGGCGTCGCG AACGCGACGT CGCGTTCGCC GAATCGTCGC
TTGTCGTTGC AACTACCCCG ACAGCCCACG GTGGCCGAAG ATGCGGACAA CCCAATCGGT
GAAGATCCGC GCACGCAACT CACGTCCAAT TCCATGATCT CATTGGGCAC GGACGACGAC
AACGCGCAAG ACACGCACCG ATCGGACGAC TTGGTGCCTT TACGATCACG ATCGCGAGTG
TGGGACGATT CGCCGATTCC GGCGCCTCGC GTCTTACCGA GCCGCGACTA CGAGGATCCG
TCTACGCAAC ACACGATTTC CTAGTACTAG C
 
Protein sequence
MTPRVGRTLL SAAASNATAV NAAAHDDATA PLWFIFISLA VGSAARTALH GTSVPYTVVL 
FIIGMVMAVV YYYADLGLLE ASLSAWGGIN PHVLLACFLP ALIFESSFSM EWHTLKKVLP
KVLILAGPGV LMSMALTGCV LRVFPFDFSW AFCMMVGSIL AATDPVAVVA ILKEVGASKV
LGHLIEGESL VNDGTAIVVF NVFLAIAKGE HLSFGDVMKD LITQPILSVI TGVVVGELCL
IWLRNSHKDH LVDITITLFA GYISFMLAEI VIKCSGVLAV VVLGWYLSAK GRTFFKGNIQ
HSMHHFWEVL TYAANTIVFI LAGFLIVADI LLGELDIKAS DLGWGILLYL CLNIIRFVVT
AMLFPWMNYY KGHVLSMRET FIAIWSGLRG AVGLALALVV LEDDAHFTKR ERGLTLVMVS
AAVIGTLLIN ASSAAAMLRI VGLLTPGLSH DRTLLAARKA IHERALAHYD EKLDQTGAQY
LGAPDFTAVK QLVPFLRTQT DETEIEFKHE IAKMDAQLMH EMRDRFLAGL CKEVWSMLED
DLIDAKVAHG LLIACERATD KSKSNVLCTF DEFNRFMKQS RVKSAVVKAF ERRAWGMRII
KTLRYLRLVE EGWMGTRRQM QGLHALISAH RLTSEQFKSL LGKNEEEEQA KEEAQSEVAE
LAKSYTLTKA GVGENLANET SIKSHHVRDV LFESDTEVVQ AVEMLKEMQH AHNAVARSLR
SEALAREILE VTMEETKLLT HTGLLDESEG SMLQAEHDLY LQRLIMHPLP PSRTSPGRIL
AAIPLFDFTQ RVGIKSREIR KHFLREAEVV EFNVGDIIID PNVKLNDGIM LVGNGVVLIE
SAADATQQIT AGCKSHPHAW ALALYEALAN DANEEEPNRS GIRATAQTTM FGFVLPRSLL
DSLRGDLREA ETLEFMWRVV LGLLAETVLR SELDSVLPDL PVNKLVRAGK LVKISQGSKL
EHDDRQVVIL LRGSLYNNTC GNIPAPCVVT FKRTGMLSNA TTSASSALES KLESRLSALK
FGKNGLVRGS SQNLAALAAK MSVDESASES LPVEPDRDGY CTFESVNDLL IYVIPTDKLL
QSPTSGKSRT SLRAQKKREF LRGLSNKVSR QRSIFIGPDN RLNAQKSTMP RRSIDSDRRS
IEFDIESGVA NATSRSPNRR LSLQLPRQPT VAEDADNPIG EDPRTQLTSN SMISLGTDDD
NAQDTHRSDD LVPLRSRSRV WDDSPIPAPR VLPSRDYEDP STQHTIS