Gene PICST_78097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78097 
SymbolNHX1 
ID4839370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp152675 
End bp154875 
Gene Length2201 bp 
Protein Length653 aa 
Translation table12 
GC content44% 
IMG OID640390685 
Productmonovalent cation:H+ antiporter, CPA1 family (NHX1) 
Protein accessionXP_001385049 
Protein GI150865717 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID[TIGR00840] sodium/hydrogen exchanger 3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.966822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCTTGTTTCA CGCGTAATCC ACAGTCTTGG ATCGTACATT ACTTCATTGT CATCATCACT 
GTCTACTTAC TTTCGACCCT CAAAAATTTT GAATAAAACT AAACACTGTA TTCTTCAGCT
TTATAATGTC CAGTATAGTA CTATTCGCTG CAAAACAGTT GGTAAGGCGA GCTCTTCCGT
CCGCTGATTT GGACGAAGAC ACTCCTGATA TGTCTACCCC AGACGACCAT CTCGATGATA
CAAATCCCGT CACCGAAGAA ATCTTCTCTT CGTGGGCACT TTTCATCTTG CTCCTGTTGC
TCTGTGCTGC GTTGTGGTCC TCGTACTTTC TCCAACAGAG ACGTATCAAA GCCATCCACG
AAACAGTCTT GTCCATCTTC TATGGAATGA TCGTAGGTCT TATCATCCGT ATCAGTCCGG
GCCACTACAT CCAAGACGCT GTCAAGTTCA ACTCAGGCTA TTTCTTCAAC ATCTTGTTGC
CTCCCATTAT CTTGAACTCC GGATATGAGC TTCACCAGGC GAACTTCTTC CGTAATCTCG
GCTCCATCTT GACGTTTGCC ATCCCGGGAA CGTTTCTTTC AGCTCTTGTG TTGGGAACCA
TATTGTACAT ATGGACATCG TTGGGTCTCG ACGGAATTTC ACTAGAGTTT GTAGATGCTC
TTGCTGTAGG TGCCACCCTT TCTGCTACAG ACCCTGTGAC CATCTTATCC ATCTTCAACG
CATACAAGGT CGATCCCAAG CTTTATACCA TTATCTTTGG CGAATCGTTG TTGAACGACG
CTATTTCTAT TGTTATGTTC GAGACTTGCC AGAAGTTCCA CGGCCACCCA GTACATTTCT
CGTCGTTCTT TGAAGGTATC GGCTTGTTCT TGATGTCCTT CACCATCTCC ACCATCATTG
GTATTCTCAT TGGTATCTTT GTAGCCTTGA TTCTTAAGCA TTCGCACATT AGACGATACC
CTCAGATTGA AACCTGTTTG GTGCTCTTAT TTGCTTATGA GTCGTACTTC TTCTCCAACG
GTGCCCATAT GTCAGGTATT GTCTCATTGC TATTCTGTGG AATCACCTTG AAGCACTACG
CTTACTTCAA CATGTCGAGA AGAACCCAAA TCGCTACCAA ATATATCTTC CAGTTGTTGG
CACAATTGTC AGAGAATTTC ATCTTTATCT ACTTGGGTTT GTCGCTTTTC ACAGAGGTAG
AATTGGTGTT TAAACCCTTA TTGATTATCA TTACTTTCAT CTCTATCTGT ATCGCCAGAT
GGTCTGCCGT ATTTCCTCTT TCTCGTTTCC TTAATTTCTT CTACAAGGCT AGATTCGAGA
AGTTCAACAC CAGAAACGCT CTCAACGGTA ACATTTCTGC CCAATCTTTG CCGGATGAGA
TCAGCCATTC GTACCAGATG ATGATTTTCT GGGCTGGCTT AAGAGGGGCA GTCGGTGTTG
CTTTAGCCAT GGGTATCCAG GGTGAGGCTA AGTGGACATT ATTGGCCACA GTGTTAGTAG
TTGTAGTATT AACCGTGATT TTGTTCGGTG GTACTACGGC TTCAATGTTG GAAATCTTGG
GAATTAAGAT TGGCTGTATC GATGAAAGTA ACGACTCTGA CGATGACTTC GATATTGAAG
CTCCTCGTTT ACCCTTGTCT AATACACCTT CGGCTTTAGG CAATAGACGC TACAACAACC
CAAGCAGAGT ACCCAAGACA CCTTATAAGG ATGATCTCAG AGGTATCTCG AACCCTAGCT
TGGCTGGCCA ATATCCTAAC TCAAACTCCA ATAGTGCCAA CAGCTTGATC GACAACAACC
TCGCCAACGA TGATGAAGAC GAAGACGAAT TTGTCACAAG TGATGTAGAC GATTTGTTGA
CTGGTTCCAA CAATATTCCG TTGTCTTCAA ACGGAGATAA CAACTTTGGT GGTGTATTGG
GAGCTTTCTT GAGTGCTGAA GAACACGCCA AATGGTTCAC TCGTTTTGAC GAACAGGTGT
TGAAACCAGT GTTGTTGGAT ACTTTGCCCA ATGGATCCAG CAACGGCAAC AACGGAAGTG
GTAGCAATAA TAGAAGAAAC CATAGAGATG GCGACGGCCA GCCTTAGTTA GAAAGTGACG
TATCATCACA TCAATTCAAG ACACGGCAGA ATGTCCATCA GCAGGAATGT CGCTTCAGCT
CGTTCTTACA TTTGTACAAT AGAGAATATA CATTAAACAT T
 
Protein sequence
MSSIVLFAAK QLVRRALPSA DLDEDTPDMS TPDDHLDDTN PVTEEIFSSW ALFILLSLLC 
AALWSSYFLQ QRRIKAIHET VLSIFYGMIV GLIIRISPGH YIQDAVKFNS GYFFNILLPP
IILNSGYELH QANFFRNLGS ILTFAIPGTF LSALVLGTIL YIWTSLGLDG ISLEFVDALA
VGATLSATDP VTILSIFNAY KVDPKLYTII FGESLLNDAI SIVMFETCQK FHGHPVHFSS
FFEGIGLFLM SFTISTIIGI LIGIFVALIL KHSHIRRYPQ IETCLVLLFA YESYFFSNGA
HMSGIVSLLF CGITLKHYAY FNMSRRTQIA TKYIFQLLAQ LSENFIFIYL GLSLFTEVEL
VFKPLLIIIT FISICIARWS AVFPLSRFLN FFYKARFEKF NTRNALNGNI SAQSLPDEIS
HSYQMMIFWA GLRGAVGVAL AMGIQGEAKW TLLATVLVVV VLTVILFGGT TASMLEILGI
KIGCIDESND SDDDFDIEAP RLPLSNTPSA LGNRRYNNPS RVPKTPYKDD LRGISNPSLA
GQYPNSNSNS ANSLIDNNLA NDDEDEDEFV TSDVDDLLTG SNNIPLSSNG DNNFGGVLGA
FLSAEEHAKW FTRFDEQVLK PVLLDTLPNG SSNGNNGSGS NNRRNHRDGD GQP