Gene OSTLU_40732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40732 
Symbol 
ID5005769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp422440 
End bp423612 
Gene Length1173 bp 
Protein Length391 aa 
Translation table 
GC content56% 
IMG OID640421190 
ProductCPA1 family transporter: sodium ion/proton 
Protein accessionXP_001421659 
Protein GI145354790 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00131155 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTGTGGGC TACCGTCGTT GACTGGAATG TTGTTGTGCG GGTTGCTGCT GAGAAACGTG 
CCGGGAGGCC TGGTGAACGA TTTGCCAGAG CGGTGGTCGA GCGACATTCG CGCCGCCGGT
TTGAGCGTGA TTTTAATGCG GAGCGGGTTA GAGTTAGATT TGGACGCGTT TCGCTCGATA
GGATGGATGG CGAGTCGATT GACCGTCATG CCAGGGCTCT CTGAAGCGAT CGCGTGCGGG
TTATTTTCGA TGCTGATATT TAAAATGTCG TTCCCGCTCG GGATGTGCTT GGGGTTCATC
CTGGGCGCGG TGTCGCCCGC GGTGGTGGTG CTCGGCATGT TTGAGTTACA ATCGCTCGGT
TACGGCGTGG CGAAGGGAAT TCCGTCGTTA GTCGTCGCGG CGGCGTCGTT TGATGACGTC
GTAGCGATCA CGGGGTACAC GATCTTCAAG TCGTTCGCGC TAGGCAGCAA GGGACACATG
GCGTGGACGA TTTTGCACGG TCCCGTGGAC GTGTTGGCGG GTCTGTGCGT CGGCACGCTC
GGGGGGGTTA TTTGCGGTAT GACGAAAATT TGGAACGAGC GTTGGAAGAG ATCGTCCATG
GTTATGATTT TAGGGTTGTT TACGATGTTC TTAGGTCGAC ATTACGGGTT CAACGGTGGT
GGCGCCATGT CCGCGCTCGC TTTGGGCATC TCGGCCAATA AGTGTTGGCG CACGAACAAG
CCGCACCCGG CGCTGACAAA CGGACCTTCC GACGATCACG CGCACGGTGT GGAGACTGAT
TTGGCCAAGT TATGGCGTTT CATATTTCAG CCCTTGCTGT TCGGCGTCAT CGGCACGGCG
GTGAGTTTCA AAGACGTCAC GCCGTCGACG ATTCCCAAAT CCATCGGTCT TCTCCTCATC
GGTATTTGCA TTCGTTTGCC CATGGCTTAT GTAGCCGTGG GCGGCGGTGA GCTATCGCGA
ATCGAGCGCG CGTTTGTGTC TTTGGCATGG ATCCCCAAAG CCACCGTCCA AGCTGCGCTG
GCATCTGATC CGCTGGATTA CATCATAGAG AAGAAAAAAT CCGCCGAGTA CGTGTCGTGG
GGCAACGACA TTCTCACTAC GGCGGTATTT TCAATCATTT TAACCGCGCC GCTCGGAATG
GTCATCATCG CCACGCTCGG GCCGAAGTGG TTA
 
Protein sequence
MCGLPSLTGM LLCGLLLRNV PGGLVNDLPE RWSSDIRAAG LSVILMRSGL ELDLDAFRSI 
GWMASRLTVM PGLSEAIACG LFSMLIFKMS FPLGMCLGFI LGAVSPAVVV LGMFELQSLG
YGVAKGIPSL VVAAASFDDV VAITGYTIFK SFALGSKGHM AWTILHGPVD VLAGLCVGTL
GGVICGMTKI WNERWKRSSM VMILGLFTMF LGRHYGFNGG GAMSALALGI SANKCWRTNK
PHPALTNGPS DDHAHGVETD LAKLWRFIFQ PLLFGVIGTA VSFKDVTPST IPKSIGLLLI
GICIRLPMAY VAVGGGELSR IERAFVSLAW IPKATVQAAL ASDPLDYIIE KKKSAEYVSW
GNDILTTAVF SIILTAPLGM VIIATLGPKW L