Gene CNB05710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB05710 
Symbol 
ID3255917 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1601208 
End bp1604158 
Gene Length2951 bp 
Protein Length553 aa 
Translation table 
GC content47% 
IMG OID638255213 
Productsugar transporter, putative 
Protein accessionXP_569312 
Protein GI58264312 
COG category 
COG ID 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.19394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAACCCACTG GGATCCCCAG ATAGGCATCG GGTTGTTGTA GGGTTTTGCT TCAAATTTGC 
GGGGTTTTAC GCATTTACAC AGGAACATGA CAGGAACGAG CAGCTGGTCT TCCACCTTTT
GGCCTGCTGA GGGTTCGAGC TGTACAAGAG GAACGACTCT GGCACAGGGA GGTGAGTTGC
GAAATACCCT TGATTGGCGA GAGGCGAAAG AACAAACCAA GCGGCTTAAT TTTCCACCTG
CCCTTCGGGA TGATCCGTCG GGTAGGAGGG CGATATTTAC TTTTGCTTTC AAACAATGAC
ATGATGCTGC TCCAAACCCA AACTGGCCAT ATATACCGCG GCCTCCAAGA GTCAATACTT
CCTTTGCTGG TCCTCTTCAC ATTTCCAAAC TTCTCCCTCG CATTCAAAGG CCTTGACTTT
TCAATTCCAC ATTTTAATCA TGACCTCCGA CCAACAGCAC CCTCACTCAG GGTTCCACGA
TATTCAAGCT ATCCCTCTCG AGATGAACGA GAAAGACAAT GAGCTCATTC ACACTGAACG
TCTTCAAACC GCCGAAGAGA TTCGAAAGAC CATGTCCAAG GGCGAGGCGG TGCAGATGAG
GTCCAAGTTT GCCGACTTGG GTACCATCCA AGCCCTCAAG GAATACAAGC TCTTAGGTTT
CGTCGCAATG GCAGCGGCCT TCAGTGCTTC ACTTGACGGC TATCGTAAGT CTTAGACCTA
AGGTTGAGCC GCAGTACCAC TAAGGTACAT CAGAAATCAA CTTGAACGGT GGTATCGTCT
CCAACAAAGG TTTCATTCAA CAGATGGCAG CCCCAGGCAC TGAAGCTATC GATGGAAAGT
ACGTCTCGGC ATGGGGTGGT ATCCAATCTG CCGGCCAAAC CTTAGGGCAA ATCGTACGTC
TCCTATTGCA TCTATCAGTT TTCTGGATCT CGAATGAAAG CTGATTCAAT ATCAGTTTCT
TCAATACGCT ACAGACGCTC TCGGTCGAAA ATACGCCCTC TATATCCTAT GGGTCTTCCT
CGTAGCTTCC ATCTTCGCCG AAACCTTTGC TAGCCACTGG TCCCACTGGC TTGTCGCCAA
ACTCTTCTCT GGTATGGGTG TCGGTATGTT GCAAGCAACG ATGCCGGTCT ATTTGAGTGA
AGTAGCGCCT TCTCAGCTTC GAGGTTTCTT CATCAACGCC TACTCATTGT GAATTTTGTA
TTTTTTTATT CCATTTACCC CATTTACCCT TGGTAAACGA AGGAAGAAGC TGACAAACTA
GTTAGTTGGT TTTGCCTCGG TCAACTTGCA GCTTCTATCG CGCTCAATGA GCTCAATGAT
ATGAAGCCTT ACGATTTCCG AACTGCCATT TACACTCAGG TATGTCCCTC TTCCATCAAT
TCTGTCAAAG TACCTTCTTT CTCAAAGCTA ACCAAGTAAT TAACGTGTAG TGGCCAATGG
TCGGTGTCAT GGGTATCGTC TTCCTCTTAC TTCCTGAATC TCCTTGGTGG CTTGTCAGTA
AAGGCAAGCT TGAAAAGGGT TCCAAAATGT TGTCAAGATA TCAAGGTCAT CTCCAAGGCT
ACTCCGTTGA AGAAGAAGTT GTCAGTATGA TTCCTAATCG AATCGTGTGA ATCGTATCTG
ATACGAGTAT GAACATAGGC CATCATGACA GCTACACTGG AAGAAGCCAA GCTCATCGCC
AAACGTCAAG GACAAGAAGG TCAATTGGCA GTATTCAAGG GAAGCAACTT ACTCCGTCTG
TTCATTGCTT CTTGGCCGAA AATGATTCAG CAATTCGTTG GGTACGTTCA TACCAATTAC
CTCTTTATGC CGAATCACTT AGATCGTTTA TCAATAGTCT CTCTGTCTTC AACACTTACG
CTACCTACTT CTGTAAGCTT AATCATCGTC TCGCTCTCTT TCCCCTCTCC CTCCATGGCA
ACCACCCTTC AATACGAAAC CGCTAACTCT CTCAAACAGT CCAACTTGCG GGTAACAGCA
ACCCCTTCCT CGTTACCGTC ATCCTCTCCT GTGTCCAGCT CATTTCTATG CTCATCACCG
TCTCCCTCAG TGACAATATC GGCCGTCGAC CGCTTACTGT CTACCCATAT GGAATCACTG
TCCTCTCTGT TCTTTCGCTT GGTATTATCG GCTGTTTCGA CTACACCAGC AAATCTCTGG
GTTCTCTCCT CGTATGTCCA GTCCTTCTCT CCATCTTTAT GTTGAAATGA AGAACCCTTC
CCTAATCAAG CTTTATCATA GATCTTCTTT GCATGTCTCG CCACCTTCTC AACTACCGGC
GCTTCCGCTA TCGGCTACGC CTATGCTGCT GAAATCCCTA CTCAACGACT TCGAGCCCAA
ACTGCAGGCT GGGGTCTTGC GTTGTCCAAC ATGGTTGCCA TCATGTTCTC TTTCTGCACT
CCGCTCATGC TTAATGGTAA GGCTAAATGG AATGTCAAGA CGGGATTCTT GTAAGTATTT
CTCCATCTGC CCTCCTTTTG ATGAAGACCA TATCTCGAAG GGACCGTAAT CGGAAAGTCA
AATTTGGTAC TGACAAAAAG GTATATAGCT TTGCCGGAAC AGGCTCGGTC GCTACTGTCG
TTGGGTGGTT TATCCTTCCT GAAGTTGCCC GTAGGACACC TGCCGAGATC GACGAGCTGT
AAGTCACCCC TTCTTCACCC TCTTGCATTT AAGCAATAGA AGCAATAGCT TGACTGACCT
CCTGGGTTTC ATTTTGTTTC GTTCAGGTTT GAGAAGAAGG TCAATCCACG CAAGTTCAAA
GGTTACGTTA CAGACGTCGA GATCTCTTAC CGAGAGGCAA ACGAGACTTC TGCTTAAGGT
TTGCTCAGTT GTTAAGATTT GAGGGAATGT AAAGATTACT AGGTCTATGT CATTGTGTAT
ATGCTTTAGG GGGTTGGGTA TAATCGAAAG TAGGGACCTT ATAGGGCTAC TATCATCATC
TTTTGTTAAT C
 
Protein sequence
MTSDQQHPHS GFHDIQAIPL EMNEKDNELI HTERLQTAEE IRKTMSKGEA VQMRSKFADL 
GTIQALKEYK LLGFVAMAAA FSASLDGYQI NLNGGIVSNK GFIQQMAAPG TEAIDGKYVS
AWGGIQSAGQ TLGQIFLQYA TDALGRKYAL YILWVFLVAS IFAETFASHW SHWLVAKLFS
GMGVGMLQAT MPVYLSEVAP SQLRGFFINA YSFWFCLGQL AASIALNELN DMKPYDFRTA
IYTQWPMVGV MGIVFLLLPE SPWWLVSKGK LEKGSKMLSR YQGHLQGYSV EEEVAIMTAT
LEEAKLIAKR QGQEGQLAVF KGSNLLRLFI ASWPKMIQQF VGLSVFNTYA TYFFQLAGNS
NPFLVTVILS CVQLISMLIT VSLSDNIGRR PLTVYPYGIT VLSVLSLGII GCFDYTSKSL
GSLLIFFACL ATFSTTGASA IGYAYAAEIP TQRLRAQTAG WGLALSNMVA IMFSFCTPLM
LNGKAKWNVK TGFFFAGTGS VATVVGWFIL PEVARRTPAE IDELFEKKVN PRKFKGYVTD
VEISYREANE TSA