Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA00670 |
Symbol | |
ID | 3253894 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 191885 |
End bp | 194717 |
Gene Length | 2833 bp |
Protein Length | 835 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252400 |
Product | sulfate transporter, putative |
Protein accession | XP_566490 |
Protein GI | 58258155 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.122332 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGACAAAAA ATCGCTTCTT GGTGAATTCA CCTTTCAGTC AATCCCGCCA TATCATCTCC CATCTTAATC TGAAGACGTG CTGGCTCCCG CTGTCTTCTT CACCTTTTTA GACAAATAGA TTCGATAGAT GGGTCTTATA AAGAGTAGCC TTGGGAATTC GAATAAAGGA AAATTCGTAG ATTGTTCAAC ACCACCACTA CACAACTCGT CCTTCATTCG TATTCAGGAA GATAGTTCAT CTACGGCGGC AGCGAGAACA TCGACTCAGG AAGATCTCCA CGCCTCCATG TCTCCCGATA TCAAGCCTAG CTTCCGCCCC GAGGTGGTCA AGAGTAAAAT CAAGCACTAT TTTGGTTATA CGGAAACAAC ACCAGAGACT GTCTCCGTCT TTGACTGGGC AAGGAGTCAG ACTCCAGCTC TTGGGCCAGG AGTGAGTGAT GCATTCACTT CCCAGATGAC TGTTCTGACT ATTTTTCAGA TCAAAGCTTA TATCCTCTCT CTTTTCCCCT TCATTCAATG GGTTCCCCGA TACAACTTGA CGTGGTTGTT TGGTGACCTC GTGGCGTGAG TAGGCTTCTT CTTTCCGTGG TAATGGCTGA CCAATCATCC ACAGTGGTAT CACCGTTGGT ATGGTCCTTG TTCCCCAATC GCTTTCTTAT GCCAAAATTG CCGAACTCGA GCCCCAATAC GGTTTATACT CTTCTTTCAT CGGTGTCCTT ACCTATGCCT TTTTTGCCAC ATCCAAGGAT GTTTCCATCG GTCCGGTCGC CGTCATGTCT CTCGAGACTG GTAACATCAT TCTCAGTGTG CAAGACAAGT ACGGCGATCT TTACTCCAAG CCTGTTATCG CTACTGCTTT GGCGTTCATC TGTGGATTCA TCGTCTTGGG TATCGGATTA TTGAGGATTG GATGGCTCGT TGAGTTCAGT GAGTTCGGAG TTTTTTTCGA GTCGCCAGGA GCCGCAGTTT GCTGACATAA TCCATAGTTC CTCAACCTGC CGTCTCTGGT TTCATGACTG GTAGTGCTCT TAACATCGCC GCTGGCCAAT TCCCGGCTGT TTTTGGTCTG TCCAAGAAGT TTGACACTCG TGCTGCCACA TACAAGGTTA TCATCAACAC TCTCAAGTAC TTGCCCCAGG CATCGCTCGA CACCGCTTTC GGTATGACTG CACTTGCTGC CCTTTACGGA ATCAAGTGGG GATTTACCTG GCTCGGTAAA CGATACCCTC GCTATGGCCG AATCACTTTC TTCTGCCAAT CTCTTCGACA CGCCTTTGTC ATCATCATCT GGACCATCAT CTCTTGGCGA GTCAACGTTC ACGCCGCCTC GCCTCGCATT TCCCTCGTCG GCAATGTCCC TTCGGGTTTG CAACACGTAG GCCGACCTTT CATTGATAGC CAACTGCTTT CTGCCATCGG TCCCCACATT CCTGTTGCCA CTATCATCCT TCTTCTCGAG CACATCTCCA TTGCCAAATC TTTCGGTCGG TTGAACGGTT ATAAGATTAA CCCTAACCAA GAGCTTATTG CTATCGGTGT TAACAACACC ATCGGTACTC TTTTCTCTGC GTACCCCTCC ACCGGTTCTT TCTCTCGATC TGCCCTCAAG TCTAAGGCTG GTGTGCGCAC CCCTGCTGCG GGTCTCGCCA CCGGTGTTGT CGTCATCGTT GCCTTGTACG CAGTCGCACC GGCCTTTTAC TGGATTCCCA ACGCGGCTCT TTCTGCCTTG ATTATTCACG CCGTCGCCGA CCTTGTCGCT TCTCCCAAGC ACTCTTACAG CTTCTGGCGA GTTGCCCCCA TTGAATACGT GATCTTCGTT GGTGCGGTTC TTTGGTCCGT TTTTTACACC ATCGAGTCAG GTATCTATTG GTCTCTTGCC ACCTCTGTCG TTCTCTTGCT TCTTCGTATC GCTCGACCCA AAGGTCACTT CCTCGGGCGT GTACGAATCA AGCCTGAGGC TGGTAACACC CTTGAGCACA TCCGAGATGT CTACGTTCCC CTTGATGAAG AATCTTCTGG GGAAGATGTC AAGGTTGAGA ACCCTCCTGC CGGTGTCATA ATCTACCGAT TCGAAGAGTC TTTCCTCTAC CCTAACGCTT CTTATATCAA TGACCGACTT ATCGAACAGG CCAAGAAGGT AACCAGGCGA GGTGGTGACT ACTCCAAGGT TGCAGCGGGC GACCGACCTT GGAATGACCC AGGACCCAGC AAGAAGAACG CGGCGGCGGT TATAGAGGCT GACATGGTCA AGCCTGTACT CAAGGCTGTT ATCCTTGACT TTGCTGCTGT TGCCAACCTT GATACCACTG GTGTGCAAAA TTTGATCGAC ACCAAGACGG AGATGGAGAA ATGGGCCGAT GGTCCTGTCG AGTTCCACTT TTGCGGTATT CTTTCACCTT GGATCCGACG TGCTCTTGTT GCTGGTGGAT TTGGTCAAGG CCGTGCCAAG GAAGGTGCCG CTCTTGAAGT AGCTCCTGCT GTCATCGAGA ATCTTGAGAA CGCTGCTTCC CCGGGAATGG AGAGGGAGAG GTATAATGAG CATGAAGTCT CTTTCATTCA CGAGGTCGGT CAAGCTTCCA ACTCAACTTC CGCCGGTACT TCTTTCAGCG AGGAGGAGAA GAGGATTGGC TCTGGAGCCA CCACACCATC TCCTCAATTG GATGGGGTTA ACGAAAGAAG ACCGAGCGGT GTATCAACCA AGACTGTTCC ATTGTTAGAC AGGTCAACAC CTTTCTTCCA CTTTGACCTG GCCGATGCCT TGAACTCGTT GAACTTACCT GAGAATGAGT GATTTGTATG ACGTGTAAAG GGGTCTGGGG CTTATTTGTA GTG
|
Protein sequence | MGLIKSSLGN SNKGKFVDCS TPPLHNSSFI RIQEDSSSTA AARTSTQEDL HASMSPDIKP SFRPEVVKSK IKHYFGYTET TPETVSVFDW ARSQTPALGP GIKAYILSLF PFIQWVPRYN LTWLFGDLVA GITVGMVLVP QSLSYAKIAE LEPQYGLYSS FIGVLTYAFF ATSKDVSIGP VAVMSLETGN IILSVQDKYG DLYSKPVIAT ALAFICGFIV LGIGLLRIGW LVEFIPQPAV SGFMTGSALN IAAGQFPAVF GLSKKFDTRA ATYKVIINTL KYLPQASLDT AFGMTALAAL YGIKWGFTWL GKRYPRYGRI TFFCQSLRHA FVIIIWTIIS WRVNVHAASP RISLVGNVPS GLQHVGRPFI DSQLLSAIGP HIPVATIILL LEHISIAKSF GRLNGYKINP NQELIAIGVN NTIGTLFSAY PSTGSFSRSA LKSKAGVRTP AAGLATGVVV IVALYAVAPA FYWIPNAALS ALIIHAVADL VASPKHSYSF WRVAPIEYVI FVGAVLWSVF YTIESGIYWS LATSVVLLLL RIARPKGHFL GRVRIKPEAG NTLEHIRDVY VPLDEESSGE DVKVENPPAG VIIYRFEESF LYPNASYIND RLIEQAKKVT RRGGDYSKVA AGDRPWNDPG PSKKNAAAVI EADMVKPVLK AVILDFAAVA NLDTTGVQNL IDTKTEMEKW ADGPVEFHFC GILSPWIRRA LVAGGFGQGR AKEGAALEVA PAVIENLENA ASPGMERERY NEHEVSFIHE VGQASNSTSA GTSFSEEEKR IGSGATTPSP QLDGVNERRP SGVSTKTVPL LDRSTPFFHF DLADALNSLN LPENE
|
| |