Gene CNC01920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01920 
Symbol 
ID3256131 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp521279 
End bp523892 
Gene Length2614 bp 
Protein Length524 aa 
Translation table 
GC content47% 
IMG OID638255412 
Productcytosine-purine permease, putative 
Protein accessionXP_569440 
Protein GI58264568 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACGGTTTTA CTCTCATCTG ATCTCCAGAT AAGAAAAAGT GTAGTATCGT AACTATCTTG 
ACTTTTGATC CGATCGTGTC GGAACTCTAG TTCTGATCTT ACTAGTTATC TCTCTCTTCA
CCGCCCTTAT CTCGACGCAC ATTTGGCAGA CAATTACCTG CTTCACTCGA CGGAGCGGCG
ATACAGTGTT ACGATCCACA GGATATCAGC TTGAACACAG AAGGACAGCG CTTGCCTAAT
ATAAAGAAGG TGATTACTTT AGGCTTCCCT CACGTCTTTT CATGCTTACA ATCGATCTTT
TGGCCGTTCC ATAGCTCAAA TACGGCATTG CATATATCGC GCCATTTCAG ACCTCAACAT
GTCAGATATC GAGAAGGGAC TCCAGCCTCT TGATGCCCCT AAGGTTTCAG ACTCATACGA
GGGTCTCCCT ACCGTTGATG CGGGAGTTTA TTCTGGAAAG CACAACGCTG GAGAACCGAC
CAGTCGCTGG GCGAAATTTG ATGAGCTGAA TAGGAAGTTG GAGCACAAGA TGGGTATTGA
GTCGGTGAGT ATACGAAAAC GAGGCCAGCG ATGTATAGTC CATGGTCTGA CTCCACGATA
AGAGAGGCAT CGAACGCGTT TCTGAGTCTG ATCGTACGGA TACGCGACTT GTGAGTTACA
CTTTGCGATT GTTACTGAAA TCCTTCTTAA CCATCAATTT CCCGGTTTAG CATGGAAACC
TTTTCATATG GGCAAGTGCC AACACGGTGC TGCCTACGCT AGGTAAGGTC ATTCAGCTTT
GAGGCTCCAT AAGGCTGATC AGGACTGGGC AGGCGTCGGT ATCCTTGGTC CACTCCTTTT
TGGCCTTGGC CTGGGAGATT CGATGTTGTC AGTCTTTTTC TTCAACGCTG CTACCGCCTG
CATTCCGGCT TTCATGTCTA CTTTCGGACC CAAACTTGGC CTTCGTCAAA TGACTTCAGC
GAGGTACTCG TGGGGTTTCT GGGGGGCAAA GTTAGTAGCC TTACTCAACT GCATCGCGTG
TGTTGGCTGG TCCATCGTAA ACACCATATC TGGCGCTCAA ACCCTTGTGG CGGTCTCCGA
ATACAAGATC TCCGCTGCGG TGGGCGTTGT AATTATTGCC CTCGTCACAC TTTTCATTGG
CCTCTTTGGC TATCGATTTG TACACCAATA TGAAAGATAC TCTTGGTTAC CTACTTTCAT
CACATTCCTT GTCATGCTTG GTGTGTCGGC GAAGCATTTG GCAAATGTGC CTTGGGGAGT
TGGTCAGGCA GAAGCCGCCG GAGTGTTATC CTTCGGTGGA ACCATATGGG GTTTTGCCAT
CGGTTGGTCT TCACTTTCAA GCGATTTCAA TGTCTATATG CCAGCGGAAG CCAAGAGCTG
GAAGGTTTTC GCCTGGACGT ATACGGGATT GATCTTTCCT CTTGTGCTGG TTGAATGGCT
CGGTGCTGCT ATAGGTTGCG CTGCTTTGGT CGTCACCGAC TGGAGTGACG CCTATCACGA
GCATGAACTT GGTGGTCTTG TTGGTGCCGT ATTCAGTAAG TGACATATCT AAGAAGACGA
TGCTCGCTGA CTGTATATTA GTCCCTTCTA TGCACAACGG AGGGAAGTTT TTCATGACTC
TTCTAGTGCT ATCTGTCGTC GCCAATAAGT ATGTAGTCTC TTTTTTATAT CATCCTTTCC
ACTGAGCTAA CTGTACAATG AAAGTACCGT GAATGTGTAC TCTATGGGTT TGAGTGTATC
TGTGATTTCC AACTGGTTGG CTGCTATTCC TCGACTTGTG TGGCCATGCG TGATTACTGC
CATTTATATC CCAGTAGCTA TCGTGGGTGC AAGCTCATTT GCTACCTCTC TCGAAAACTT
CCTCAACGTC CTCGGATACT GGCTTTCTAT CTATGCTACT GTGGTAATTG AAGAACATTT
CATCTTCCGC AAAGGGCGGT ATGAAAATTA CGAAGCGGCC AGCACTTGGA ACAGATCTGA
CAGATTGCCT GTGGGATTTG CCGCTATCGC TGCCGGCTGC TGTGGAGCTG CTGGTGCTGT
TTTAGGCATG GCTCAGGCGT GGTTCACTGG TCCCAGTAAG TTCTTTCATA TTGTTAAGAC
TTGACTAACA TCTCGTATCC CACAGTTGGT AAAAAGGTCG GCGGCACGGC TGACCCGTCC
GGTGGTGATA TTGGATGGCT CTTAGCATTC GTGAGTCGCA AGCTATCTGG GCGAACTAAT
AACAATCAAT GCTTATATGT TCGTATAGGC CTTCACTGGT GTTACTTATC CTGCGTTCCG
AGTACTCGAA AAGAAATGGC TCCGTCGATA AGGTCTGCCG ACTATAGATT TCAATTTCCG
TAATTTTGCT TTAGATTGTT TTACAGTCAT TAGATCAGTA CTTATCCTGC GTTCCCAATA
CTCAAATAGA AAAGGCCCCG TCGACAAGAC CCGCCGACTA TGTCTTCAAT TTTGGTAGTT
TTGTTTTAGA TTTGTTTTAC CATTATTAGA TCAGTTGTTG AAGGGTGAGA TGCCTGCCTG
TGAAGGCCTA ATGACTTTAC GGATATCCAG GGATTGAGGA GTATAGGGTG GGGATGTCGA
AGCATAAATC TGATATGCAT TGACATAAGC TCTT
 
Protein sequence
MSDIEKGLQP LDAPKVSDSY EGLPTVDAGV YSGKHNAGEP TSRWAKFDEL NRKLEHKMGI 
ESRGIERVSE SDRTDTRLHG NLFIWASANT VLPTLGVGIL GPLLFGLGLG DSMLSVFFFN
AATACIPAFM STFGPKLGLR QMTSARYSWG FWGAKLVALL NCIACVGWSI VNTISGAQTL
VAVSEYKISA AVGVVIIALV TLFIGLFGYR FVHQYERYSW LPTFITFLVM LGVSAKHLAN
VPWGVGQAEA AGVLSFGGTI WGFAIGWSSL SSDFNVYMPA EAKSWKVFAW TYTGLIFPLV
LVEWLGAAIG CAALVVTDWS DAYHEHELGG LVGAVFIPSM HNGGKFFMTL LVLSVVANNT
VNVYSMGLSV SVISNWLAAI PRLVWPCVIT AIYIPVAIVG ASSFATSLEN FLNVLGYWLS
IYATVVIEEH FIFRKGRYEN YEAASTWNRS DRLPVGFAAI AAGCCGAAGA VLGMAQAWFT
GPIGKKVGGT ADPSGGDIGW LLAFAFTGVT YPAFRVLEKK WLRR