Gene CNI04150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI04150 
Symbol 
ID3259627 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1100841 
End bp1102685 
Gene Length1845 bp 
Protein Length473 aa 
Translation table 
GC content49% 
IMG OID638258910 
Productcytosine-purine permease, putative 
Protein accessionXP_572604 
Protein GI58270896 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.240841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTC TGGTAGAGCA CGGTGTAGAA GAACGGGGTA TTGATCCTAG ACCAGAAAAT 
GTGAGGCATT TGACATGATG TCCCCGTTCA AGATGCTGAT CCACGCAGGA ACGGGACGAG
TTGAACAAAT GGACCTATTT GCCACAATTC ACCCTTTGGG CTGCTTGGAA CACCAACATT
CTATCAGTAT GTCTGCCGCT CAAACGTCAA TAGCTGCTCT AACCGCGCTC TTCAGTTTTC
AGAGGGTGTC ATCGGACCTT CGTTGTTTGG CCTTGACTGG AAAACCAGCT GCCTGTGCAT
TGTGTTTTTT ACGGCGGCCT CCGCGCTGCC GGTTGCCTAC TGTGCAACCA ATGGTCCCAA
GACGGGTATG AGGCAAATGG TGCAAGCTCG TTACGGTATG GGGTAGGTCC TATTCTTTTT
CAATGAAGGA TATTGGCTGA CACATATGTC ACAGCTATGG CTTGGCTCTT ATCTATGGTA
TACTCAACTG CGCCACCATG ATCGGCTTCA TGGCTCTCAC TGCTATCCTT GCTGGACAGT
GTCTCGCCTT GGCTTCTAAC TCTACCATGA GCTGGGATGT CGGAATTGTT ATTGCTGCTC
TCATCGCTCT CATCGTACGT AAATTACATT CCTCTTGTCT ATCGCAGCAT TAACCCATTT
TTCCCCTAGC TTTCGTTTGT CGGACTCAAC GCTTTACACA TTGTTTCCCT TGCCTCTTTC
CCTGTCATGG TCATCCTCTA TGTCGTGCTT GCAGGTGTTG TTGGTGACAA ACTTCACCTC
GTCCAGTCCG ATGTGGCCAA GGCTGCGACC GCCGTGACAG CTAGCGGTGT TTTGGGTTAT
GGTGCCAGTT TGATCGGTTT TTCTATCACA TATACCAGTT TAGCTAGTGA CTTTGTAGGT
CTCAAGGTCC ATTATTGCCC AAGCGTCGCT AATCAGTTTT GGTCTGAAGA CAACCAGCTT
GCCTCCCCAG ACTCCAGGTT GGAAGCTTTT CCTCTGTGTC TATGTTGGCA TGGTTGTCCC
TATGATCCTT TGCCAGATGT TCGGTGCCGC CTGTCAGCTC GCAGCGTACT CCATCCCCGA
CTGGGAAACA GCGTCCAACG TTGGTGTCCC TAATCTCATC TATACCATGA CTGGCAACGG
TAACGGCGCA TCTCGATTCG TGATGGTACT TTTCAGTCTG AGTGTTGTCG CCAATACCGC
TCCTACTATT TACAGCGCCG GTCTCAGTGG TCAGGTCGCT ATCCCATGGC TTGTCCGAGG
TGCGTAATTT CACCCCTCAA ATTATTAACT GCTGTCTAAC CATCTCCTTA TAGTGCCTCG
ATACTTCCTC GCTCTCGTCG TATCTGCCAT CTACCTCCCC ATCGCCATCT GCGGCGCATC
CAAGTTCTAT TCCGCCTTGG AAAATTTTTC ATCTGTCCTT TCCTACTGGA GTGCATTGTA
CATCCCTCCG ACACTTATCG AGCCCATCCT CTTCCGAGGA CCAGTGAGTA GGAAAACTTA
TCCTGTGGAG ATCTGGAATC AGATTGGAAA GTTGCCAATC GGACTTGCCG CCATTTTCGC
CGCCATCTGT GTGAGTAAAG CTGTCATTTG TGGAAAGTGG CTGCGAACTG ATCGGATAAT
ATAGGGTATC CCTGTGGTGA CCGGTGGTAT GGCTCAGAGT TGGTGGACTG GATGGATTGC
TAGGAAGATT GAGGGAACGT GGGTACCCTT TCTAGAAGGT AGTTGTTATG CAAGGAAGCT
GACATCTTCA TTAGCGGCGA TATTGCGTTC GAGATTGGTT TCGTCGTCGT CGGTCTCATC
TACATCCCTG CTCGTTATCT CGAGAGGAAA TTTACCGGTC GATAA
 
Protein sequence
MDILVEHGVE ERGIDPRPEN ERDELNKWTY LPQFTLWAAW NTNILSFSEG VIGPSLFGLD 
WKTSCLCIVF FTAASALPVA YCATNGPKTG MRQMVQARYG MGYGLALIYG ILNCATMIGF
MALTAILAGQ CLALASNSTM SWDVGIVIAA LIALILSFVG LNALHIVSLA SFPVMVILYV
VLAGVVGDKL HLVQSDVAKA ATAVTASGVL GYGASLIGFS ITYTSLASDF TTSLPPQTPG
WKLFLCVYVG MVVPMILCQM FGAACQLAAY SIPDWETASN VGVPNLIYTM TGNGNGASRF
VMVLFSLSVV ANTAPTIYSA GLSGQVAIPW LVRVPRYFLA LVVSAIYLPI AICGASKFYS
ALENFSSVLS YWSALYIPPT LIEPILFRGP VSRKTYPVEI WNQIGKLPIG LAAIFAAICG
IPVVTGGMAQ SWWTGWIARK IEGTGDIAFE IGFVVVGLIY IPARYLERKF TGR