Gene CNC01940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01940 
Symbol 
ID3256266 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp533401 
End bp536011 
Gene Length2611 bp 
Protein Length599 aa 
Translation table 
GC content48% 
IMG OID638255414 
Productmonosaccharide transporter, putative 
Protein accessionXP_569444 
Protein GI58264576 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAAGAGAAG CGGAGAAACG TCTCGTCAGG GTCTGACGAA AAGTGGGGTT TACCTGTATC 
TGAGCATGCG GGGAAGAGCT AGCGGCGAGC TCGAAAACAA TCGATTGCTT TCCCCCTGTC
AACATATAAA CACCCAAAGC CAGCAAAATA CTTTTGTTTT TTCTTCTTCA TCTGTTTGGT
TTTTCTATAT CTTTACTCTT GTTCATCTTA ATTTCAGCCT AGTAAAAAAG TTTTGAGGAC
ATCCTTTCGC CTATCGCATC TCAACCCACG CAACATGTCG AGCAGCGAGA AAGGCCCAGT
AGTCGGGCCA GGCAAGATTG TTACTTATGG TCGCCTTCAC GGTAATGCCT TGCTCTATGC
CATTGTTGGC ACCGCCACTA CAGGCTTCAG TCTGTTTGGC TATGTACGTC CTTCTCTTTT
ATGATACCAC ATGAATGTGA TGACTGACTA TTATGTTTAG GACCAAGGTA TGTATCACTG
TGGATGGACT CTTGGACTTG GGGGCTTAAC TGAACTTACC CCGCATAGGT CTCATGTCTG
GTATCATTGC CTCAGATCAG TTCAACACCG AATTTCCTGC CGTACGTCAT ATTCTGATTA
AGCCGACGTT ATGCTAACTT TTGTTGGTGA TAGACTTTTC AACACGACGT CAACGATGTG
CATGCCGGTA CCGTTCAAGG TAGTGTGACC TCCTGTTACG AAGTTGGTTG CTTTCTGGGT
GCCCTTTTCG CCTACTTCTT TGGCGAACGT ATGGGCCGTA GACGGGTCAT GCTTATGGGC
GCTGTTATTA TGATCATTGG AACCATCATC TCTGTGTGCG CCTTTGGTCC TGGAGATCCC
CGAGGCAATG TCGGCGGTTT CGTTCAGTTC ATCGTCGGTC GTGTGATCAC TGGTGTCGGG
AATGGTGCGA ATACAGCGAC CATTCCTTCC TGGGTTGCCG AAAGTTCCAA AGCGCATAAC
CGTGGCTTCC TGATTTGTAT GGAAGCTTCT ACAGTCGCTG TTGGTACTGT CATCGCCTAT
TGGATCGATT TCGGTCTTTC TTTCGTGAAC GTGAGTGGGA TCTTGCCGTG ACGAATGTTT
TCAAGTGTTT CGCCGCTAAC TTCACGCCAT GATTTTAGAG TTCCGTGTCT TGGCGATTCC
CCATTGCCTT GCAAATCCTT TTCGCTCTTG TCCTTATCGG TGGTGTTATG GTTCTTCCCG
AGTCGCCCCG TTGGTTGATC GCCCATGGCT ACGACCACGA GGGTTTAAGA GTCATTGCTG
CTCTTGACTC TAAAGCTGAA GATGACCCCG TCGCTATTGC GGACAAGAAC AAGGGTAAGC
ACAATGCAGT AGGACGCTAC GTGGATCAGG CTGATATTTT GAAGTTTCCG ATGCCATTGC
TGCCCAACAA AATGCCAAGG CTAGCAGGAA GAGGGACATC TTGAAAGGTG GCAAGAACCA
ACACTTACGA AGAGCCATGG TTGGTGCCTC CACGCAGCTT TTCCAACAAA TTGGTGGTTG
TAATGCGTAA GTGACTCCTC CACAAGTAGC ATGTTTACCT AACGGCGTTT AATATAGTGT
CATTTATTAT TCTACAGTCT TGTTTGAGAA CCAAATCGGT CTTGATAGCA CCCTCGCTCT
CATTTTAGGC GGTGTTCTAT CAATCGTTTA CGCCATATTT GCCCTCACGT CCTTCTTCCT
TGTTGAGCGA GTTGGTCGTC GAAAGCTTTT CCTTATTGGT ACTTTTGGAC AAGCCGCTGC
CATGTTCATC ACTTTTGGAT GTCTCCTTCC TGGTGATGCT CAATCGGCCA AGGGTGGCGC
TTTCGGCCTT TATCTTTTCA TTGCGTTCTT CGGTGCCACT TGGCTTCCTC TCCCTTGGCT
TTACCCTGCC GAACTCAACT CTATGGCGGT TAGGACCCAG GCAAACGCTA TTTCAACTAT
GGTCAACTGG CTTTTCAACT TCACTGTCGT ACAGGTTTTG CCAACAATGA CGGCCTCCAT
TGGCGCTTAC ACTTTCCTGT TCTTCGCGTG TATTAACTGT GTATTCTTGC CATTCATTTA
CTTATTCTAT CCAGGTATGT GACTCCCTTC TCCTCGTAAG GTTAGTCCTA ACGTCATTAC
AGAGACTACT GGACGAACTT TGGAAGAGCT CGATGTTATC TTTGCTCATG CTCATCTTAC
GTATGTTGAG TTCATCGGTC ATCGATTGCC GGCTAACCTG AGAAATAGCG AGCGTCGTCC
TACTCTTGTG GCTGCTGAGC TCCCCAAGCT CACTGATTTC CAAGTCCAAG AAATGACTGA
CCGGTATGAC ATCCATGGCG GTGCCGCCGA CACTGAAAAC CCCTCTGCCT ATGGCGCTCC
TATTAATGCC GGTGCCCCTG ACACCTCCCT CCCTCCCAAG CACCCTCAGG ATGACCCCAA
CTACTATCCT GATGGTAGTC GCAGACCTTC AGGGGGCACC GCTGAGAGCG GCGAGCAGAC
TAGGGTGACT ACTCCTTCCG GCGAGAAGGC TGGTGCCACT CCACCATCGT AAGGGAATAC
TGTATAGAGA TGTAAAGAAG ATGCGAGAGT GGGCTTCATA TAATAGAGAT CAAACTAGAA
AGTAAGATCT GTCACTGTAG CACTTTATCG A
 
Protein sequence
MSSSEKGPVV GPGKIVTYGR LHGNALLYAI VGTATTGFSL FGYVRLMSGI IASDQFNTEF 
PATFQHDVND VHAGTVQGSV TSCYEVGCFL GALFAYFFGE RMGRRRVMLM GAVIMIIGTI
ISVCAFGPGD PRGNVGGFVQ FIVGRVITGV GNGANTATIP SWVAESSKAH NRGFLICMEA
STVAVGTVIA YWIDFGLSFV NSSVSWRFPI ALQILFALVL IGGVMVLPES PRWLIAHGYD
HEGLRVIAAL DSKAEDDPVA IADKNKVSDA IAAQQNAKAS RKRDILKGGK NQHLRRAMVG
ASTQLFQQIG GCNAVIYYST VLFENQIGLD STLALILGGV LSIVYAIFAL TSFFLVERVG
RRKLFLIGTF GQAAAMFITF GCLLPGDAQS AKGGAFGLYL FIAFFGATWL PLPWLYPAEL
NSMAVRTQAN AISTMVNWLF NFTVVQVLPT MTASIGAYTF LFFACINCVF LPFIYLFYPE
TTGRTLEELD VIFAHAHLTE RRPTLVAAEL PKLTDFQVQE MTDRYDIHGG AADTENPSAY
GAPINAGAPD TSLPPKHPQD DPNYYPDGSR RPSGGTAESG EQTRVTTPSG EKAGATPPS