Gene CNN02420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN02420 
Symbol 
ID3255373 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp747918 
End bp750536 
Gene Length2619 bp 
Protein Length550 aa 
Translation table 
GC content49% 
IMG OID638254651 
Productreceptor, putative 
Protein accessionXP_568727 
Protein GI58262634 
COG category 
COG ID 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.915792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCACTTTTGG TGGGTAAAAT AGGGCGCCAT GGGGGCCTGA GCAGGCATCT GTCATCGACA 
TTCGGGATGC TCTCCGGACG GACAGATCAC ATCCGGCTCT CTCGGCTGGA CCTCTATGAT
AAGTTCAAGG CTCGAGGGCG GATCACTTCG CCTTCCCTCT GTCAACGTAC ATTAACAGAT
ATAAGAGATA TAACAGATAT ATTTGTTGCG GCCCAATCTT CCAACCACCT ATCAACAATC
TAAATCGTCT CAGGCAACCT AGCTCTAGCT TGACAATACA TCTTATACAC ACGTAAATTT
TATCCACTAC TACCAAACCA AAACAACAAC AATGGGCGGT GGTGCTGCTG TTGGTGGAGG
CTTTGATGCT CTCCTCACTC AAAACCAGGA TCGAGGCTTT CGAGGTCTCT TCAAGAACCG
CCGAGCCCTT GGTCTGGCTT GTTTCGCCTC TCTCGGTGGT GTCCTCTATG GTTACAACCA
GGTATGTTCA ACCATTACTC GATCGCCTGC TATTTGCCCA TACTCTCTAT TGCAGGGCGT
TTTCGGTCAA GTCCAAGTCA TGTACAGCTT TAAGGAGCGA TACACCGCTA CTGTAAGTTT
CGTTACCTCC TTTTACTCTG CATTTTTTGA CTTTCCCTCT GCAGTTGACA AACACCGACA
CCAAGGGTCT TTTGACTGCT ATTCTCGAAC TTGGTGCTTT CCTCGGTGCC CTTATGGCTG
GTCCTTTATC TGACAAGTTT TCCCGAAAAG TAAGCTAATG TTCCTTTTCC TCCTTCCTTT
TCCGCTGCTA ACATGTTCCT CAGTACTCCA TTTCCGCTTG GTGTATCGTC TTTATGATGG
GTACTGCTGT TCAAACTGGT GCCAACTACA ATATCGCGTG CATTTACGGT AAGTAAAATT
CTGCGCGTTT ACTCTGGTGA TGACGTCAGT GCTCACACTT GGGTTCAGCT GGTCGATGGT
TCGCAGGTAT GGGTGTTGGC GCTCTTTCTA TGCTTGTTCC TATGTTCAAT GCCGAGTTGG
CACCTCCTGG TATTCGGGGT TCTTTGGTCG CTCTTCAACA ACTGGCCATT ACTTTCGGTA
TCATGATTTC TTACTGGATC GGTTACGGTA CTAACTGTGA GTAACTTTTG TCCAAGTATA
GTGCGAGTAC TAGCTGACAC GGAAACTAGA CATTGGCGGT ACTGGTGCTG GCCAGAGTAC
CGCCGCTTGG CGAGTTCCCC TCGGTATCCA GCTCGTTCCT GCTATTGTTC TTTGTATCGG
TTCTTGCTTC CTTCCCTTCT CCCCTCGATG GCTCATGCTT AAGGGTAAGC GCTTGTCTCC
TGCTTTCAAT CGACATATTG ATCCTCGTAT AGGTCGTGAG GAGGAATGTC TTACGAACTT
GGCTAAGCTC CGAAACTCTA CCGAGGATGC CCCCGAAATC CAGTACGAGT TCCGTGCCCT
TCAGGCTGAG CGTCTTGTTG AGCGTGAGGC CGCCAAGGAG CGATACGGCC AGGAAGACGT
CAACTTCCGA GTCACATTGG CCGAGTACAA GAGGTTGTTC ACTACCAAGC CCCTTCTTCA
CCGACTGATG CTTGGTGCTG GTTGCCAGAC TTTGCAACAA TGGACTGGTA TGAACGCCAT
CACCTATTAC GCTCCTACCA TCTTCGAGCA GATCGGTCTT AGTGGTGCCG GTGCTGGTGG
TACTATCAGT CTTTTGGCCA CTGGTATCAT TGGTATCGTC AAATTTGTCT TCACTATCCC
TGCCGTCCTT TTCGTCGACA ACGTAAGTAC TTCTTTTGCA ATGACTCATC ACATTTGCAG
TAACTGACAG CCCGTTCTAG TTCGGTCGAA AGCCTCTCCT TGCCTGGGGT GAAGCCAACA
TGGCCATCTC TCACGCTATC ATTGCCGCTA TAGTTGCCGT CTACGGTGAC AAGTTCGATA
CCCACAAATC GGCTGGTAAT GCCGCTGTTT TCTTCGTAAG TCGCATGATA CCAACAGATA
ATCGAGCAAG GGCTAACGTT TCGTTTCAGA TTTACTGGAT ATCTGCCAAT TTCGCCTGCA
CCTGGGGTCC CTTGGCGTGG GTTGTATCCT CTGAAGTCTT CCCTCTCGAC ATGCGTGGTA
AGGGTATGAG TGTCTCTTCC GGTGCCAATT GGATTGTAAG TCCAAGTCTT CTTCAGTATG
TCGCTTTTGC AAACCTCGCT AATCGAATCG CCATAATAGA TGAACTTCAC GGTCGCCATG
ATTACCCCTC ACATGATCGG AAGCATTGGC TATAAGACCT ACATCGTCTT CATGTGCTTC
TGTATTGTCG GCTTCTTTTT CTCCATATTC ATCCTTCCCG AACTCAAGGG TCTCTCTCTC
GAAGAGATCG ACAACGTCTT CAACGACGAT TCTGGTGTTG AGGACCGAGC ACGACGAGAG
CGTATTGCAG CCCAGATTGG TTTGGACAAG GTCGCCGACC AGATCCAGCA TCAGGAAAAG
GTCGATGATT CTGAAGTTTA AAGAGGTGTT GTTTTATGCT CCCCTTCTTT TCCGAGCTTC
GCAGTTATAG TGGCATCTTA TCAAGTTGGG GTAGGGTCTT CATACTAGTA ATGTATCATT
AGCATCTACG TTCTCATTAT CTCAATCCGC GCAGCCATG
 
Protein sequence
MGGGAAVGGG FDALLTQNQD RGFRGLFKNR RALGLACFAS LGGVLYGYNQ GVFGQVQVMY 
SFKERYTATL TNTDTKGLLT AILELGAFLG ALMAGPLSDK FSRKYSISAW CIVFMMGTAV
QTGANYNIAC IYAGRWFAGM GVGALSMLVP MFNAELAPPG IRGSLVALQQ LAITFGIMIS
YWIGYGTNYI GGTGAGQSTA AWRVPLGIQL VPAIVLCIGS CFLPFSPRWL MLKGREEECL
TNLAKLRNST EDAPEIQYEF RALQAERLVE REAAKERYGQ EDVNFRVTLA EYKRLFTTKP
LLHRLMLGAG CQTLQQWTGM NAITYYAPTI FEQIGLSGAG AGGTISLLAT GIIGIVKFVF
TIPAVLFVDN FGRKPLLAWG EANMAISHAI IAAIVAVYGD KFDTHKSAGN AAVFFIYWIS
ANFACTWGPL AWVVSSEVFP LDMRGKGMSV SSGANWIMNF TVAMITPHMI GSIGYKTYIV
FMCFCIVGFF FSIFILPELK GLSLEEIDNV FNDDSGVEDR ARRERIAAQI GLDKVADQIQ
HQEKVDDSEV