Gene CNA00670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA00670 
Symbol 
ID3253894 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp191885 
End bp194717 
Gene Length2833 bp 
Protein Length835 aa 
Translation table 
GC content50% 
IMG OID638252400 
Productsulfate transporter, putative 
Protein accessionXP_566490 
Protein GI58258155 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.122332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGACAAAAA ATCGCTTCTT GGTGAATTCA CCTTTCAGTC AATCCCGCCA TATCATCTCC 
CATCTTAATC TGAAGACGTG CTGGCTCCCG CTGTCTTCTT CACCTTTTTA GACAAATAGA
TTCGATAGAT GGGTCTTATA AAGAGTAGCC TTGGGAATTC GAATAAAGGA AAATTCGTAG
ATTGTTCAAC ACCACCACTA CACAACTCGT CCTTCATTCG TATTCAGGAA GATAGTTCAT
CTACGGCGGC AGCGAGAACA TCGACTCAGG AAGATCTCCA CGCCTCCATG TCTCCCGATA
TCAAGCCTAG CTTCCGCCCC GAGGTGGTCA AGAGTAAAAT CAAGCACTAT TTTGGTTATA
CGGAAACAAC ACCAGAGACT GTCTCCGTCT TTGACTGGGC AAGGAGTCAG ACTCCAGCTC
TTGGGCCAGG AGTGAGTGAT GCATTCACTT CCCAGATGAC TGTTCTGACT ATTTTTCAGA
TCAAAGCTTA TATCCTCTCT CTTTTCCCCT TCATTCAATG GGTTCCCCGA TACAACTTGA
CGTGGTTGTT TGGTGACCTC GTGGCGTGAG TAGGCTTCTT CTTTCCGTGG TAATGGCTGA
CCAATCATCC ACAGTGGTAT CACCGTTGGT ATGGTCCTTG TTCCCCAATC GCTTTCTTAT
GCCAAAATTG CCGAACTCGA GCCCCAATAC GGTTTATACT CTTCTTTCAT CGGTGTCCTT
ACCTATGCCT TTTTTGCCAC ATCCAAGGAT GTTTCCATCG GTCCGGTCGC CGTCATGTCT
CTCGAGACTG GTAACATCAT TCTCAGTGTG CAAGACAAGT ACGGCGATCT TTACTCCAAG
CCTGTTATCG CTACTGCTTT GGCGTTCATC TGTGGATTCA TCGTCTTGGG TATCGGATTA
TTGAGGATTG GATGGCTCGT TGAGTTCAGT GAGTTCGGAG TTTTTTTCGA GTCGCCAGGA
GCCGCAGTTT GCTGACATAA TCCATAGTTC CTCAACCTGC CGTCTCTGGT TTCATGACTG
GTAGTGCTCT TAACATCGCC GCTGGCCAAT TCCCGGCTGT TTTTGGTCTG TCCAAGAAGT
TTGACACTCG TGCTGCCACA TACAAGGTTA TCATCAACAC TCTCAAGTAC TTGCCCCAGG
CATCGCTCGA CACCGCTTTC GGTATGACTG CACTTGCTGC CCTTTACGGA ATCAAGTGGG
GATTTACCTG GCTCGGTAAA CGATACCCTC GCTATGGCCG AATCACTTTC TTCTGCCAAT
CTCTTCGACA CGCCTTTGTC ATCATCATCT GGACCATCAT CTCTTGGCGA GTCAACGTTC
ACGCCGCCTC GCCTCGCATT TCCCTCGTCG GCAATGTCCC TTCGGGTTTG CAACACGTAG
GCCGACCTTT CATTGATAGC CAACTGCTTT CTGCCATCGG TCCCCACATT CCTGTTGCCA
CTATCATCCT TCTTCTCGAG CACATCTCCA TTGCCAAATC TTTCGGTCGG TTGAACGGTT
ATAAGATTAA CCCTAACCAA GAGCTTATTG CTATCGGTGT TAACAACACC ATCGGTACTC
TTTTCTCTGC GTACCCCTCC ACCGGTTCTT TCTCTCGATC TGCCCTCAAG TCTAAGGCTG
GTGTGCGCAC CCCTGCTGCG GGTCTCGCCA CCGGTGTTGT CGTCATCGTT GCCTTGTACG
CAGTCGCACC GGCCTTTTAC TGGATTCCCA ACGCGGCTCT TTCTGCCTTG ATTATTCACG
CCGTCGCCGA CCTTGTCGCT TCTCCCAAGC ACTCTTACAG CTTCTGGCGA GTTGCCCCCA
TTGAATACGT GATCTTCGTT GGTGCGGTTC TTTGGTCCGT TTTTTACACC ATCGAGTCAG
GTATCTATTG GTCTCTTGCC ACCTCTGTCG TTCTCTTGCT TCTTCGTATC GCTCGACCCA
AAGGTCACTT CCTCGGGCGT GTACGAATCA AGCCTGAGGC TGGTAACACC CTTGAGCACA
TCCGAGATGT CTACGTTCCC CTTGATGAAG AATCTTCTGG GGAAGATGTC AAGGTTGAGA
ACCCTCCTGC CGGTGTCATA ATCTACCGAT TCGAAGAGTC TTTCCTCTAC CCTAACGCTT
CTTATATCAA TGACCGACTT ATCGAACAGG CCAAGAAGGT AACCAGGCGA GGTGGTGACT
ACTCCAAGGT TGCAGCGGGC GACCGACCTT GGAATGACCC AGGACCCAGC AAGAAGAACG
CGGCGGCGGT TATAGAGGCT GACATGGTCA AGCCTGTACT CAAGGCTGTT ATCCTTGACT
TTGCTGCTGT TGCCAACCTT GATACCACTG GTGTGCAAAA TTTGATCGAC ACCAAGACGG
AGATGGAGAA ATGGGCCGAT GGTCCTGTCG AGTTCCACTT TTGCGGTATT CTTTCACCTT
GGATCCGACG TGCTCTTGTT GCTGGTGGAT TTGGTCAAGG CCGTGCCAAG GAAGGTGCCG
CTCTTGAAGT AGCTCCTGCT GTCATCGAGA ATCTTGAGAA CGCTGCTTCC CCGGGAATGG
AGAGGGAGAG GTATAATGAG CATGAAGTCT CTTTCATTCA CGAGGTCGGT CAAGCTTCCA
ACTCAACTTC CGCCGGTACT TCTTTCAGCG AGGAGGAGAA GAGGATTGGC TCTGGAGCCA
CCACACCATC TCCTCAATTG GATGGGGTTA ACGAAAGAAG ACCGAGCGGT GTATCAACCA
AGACTGTTCC ATTGTTAGAC AGGTCAACAC CTTTCTTCCA CTTTGACCTG GCCGATGCCT
TGAACTCGTT GAACTTACCT GAGAATGAGT GATTTGTATG ACGTGTAAAG GGGTCTGGGG
CTTATTTGTA GTG
 
Protein sequence
MGLIKSSLGN SNKGKFVDCS TPPLHNSSFI RIQEDSSSTA AARTSTQEDL HASMSPDIKP 
SFRPEVVKSK IKHYFGYTET TPETVSVFDW ARSQTPALGP GIKAYILSLF PFIQWVPRYN
LTWLFGDLVA GITVGMVLVP QSLSYAKIAE LEPQYGLYSS FIGVLTYAFF ATSKDVSIGP
VAVMSLETGN IILSVQDKYG DLYSKPVIAT ALAFICGFIV LGIGLLRIGW LVEFIPQPAV
SGFMTGSALN IAAGQFPAVF GLSKKFDTRA ATYKVIINTL KYLPQASLDT AFGMTALAAL
YGIKWGFTWL GKRYPRYGRI TFFCQSLRHA FVIIIWTIIS WRVNVHAASP RISLVGNVPS
GLQHVGRPFI DSQLLSAIGP HIPVATIILL LEHISIAKSF GRLNGYKINP NQELIAIGVN
NTIGTLFSAY PSTGSFSRSA LKSKAGVRTP AAGLATGVVV IVALYAVAPA FYWIPNAALS
ALIIHAVADL VASPKHSYSF WRVAPIEYVI FVGAVLWSVF YTIESGIYWS LATSVVLLLL
RIARPKGHFL GRVRIKPEAG NTLEHIRDVY VPLDEESSGE DVKVENPPAG VIIYRFEESF
LYPNASYIND RLIEQAKKVT RRGGDYSKVA AGDRPWNDPG PSKKNAAAVI EADMVKPVLK
AVILDFAAVA NLDTTGVQNL IDTKTEMEKW ADGPVEFHFC GILSPWIRRA LVAGGFGQGR
AKEGAALEVA PAVIENLENA ASPGMERERY NEHEVSFIHE VGQASNSTSA GTSFSEEEKR
IGSGATTPSP QLDGVNERRP SGVSTKTVPL LDRSTPFFHF DLADALNSLN LPENE