Gene CNK02020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02020 
Symbol 
ID3254521 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp602518 
End bp604451 
Gene Length1934 bp 
Protein Length535 aa 
Translation table 
GC content47% 
IMG OID638253695 
Producthexose transport-related protein, putative 
Protein accessionXP_567678 
Protein GI58260536 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.50413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAG GAAATTTTGC TTTCAACACT GTACCAAACA ACACAGCGGC TTCATGGTGG 
AATGACCCGG GATTGCGGAA ACTTGTCTTC TTCATGGCCA TATGCACTAT CTCTGCCGTG
AGTCCATTCG ATCGGCTCAG GACGTCAACA TTGACATTTT GTCAGGTAGG CAGTGGTGTC
GATGGTTCTT TAATCAATGG CCTTCAAATC ATTGATTCCT GTAGGCTTCA TACCTTTCAT
ATTTATCTTC CATATCTATA CACATGTGCT AACTCCCTAA ACAGTCATGG ACAACCTGGG
AGATATATCT GACAACTACC TCGGCCTTGT TATCGCGTCT TTCTCTCTTG GTGCCCTCCC
AGCCTTACCC ATCTCTGCCT ACTGTATGGA CCGCTTCGGT CGTCGTTCGT CTCTCCTATT
AGGTTGTATC TCTGTCATCG TGGGAGGCAT TGCCCAGTCT TTTACCCATG GTGCCAATGC
GTTCTTGGGC ACAAGATTCG TCATTGGCTT TGGTATTGCC CTTACAGGTA CATCAGCACC
AACACTTTTG ATGGAGCTTG CACACCCTAG GATGAGAGGT CAAGCAAGTC TGTTGTACAA
TTGTAGCTGG TATATCGGTG CTGTCATCAT TGGATGGATC ACGTATGGAA CCTTGGTAGG
GATGACTGGT AACTGGAGCT GGAGGTTGCC GTGTCTCCTT CAGATCACCC CAGAGATCTT
GCTAGCTGTT ATGTGAGGCC ATTCTCGGTT TCAAGTAAAA TGATATATTA ACGTATCCTT
AGCACTGTTA CACTGATGCC CGAATCCCCT CGATGGCTGA TTTCCAAGGG CCGCAACGAG
GAGGCTCTCA ACACTCTTGC CAAGTACCAT GTTAGTGTCT CAAATCCCCT GCAATGTGCC
ACTATTAAGT GAAACACTTT ATAGGCCAAT GGAGATAAGG AAGATGCTTT AGTCAAGCTG
GAGTATGACG AGATCGTAGA GGCCTTGGAG CTCGAGAAGA CAAGTCAACA TGGCACATCC
TACCTGACTC TTCTCAAAAC CCCAGGTAAT CGTCACCGTC TTATCATTTG TATCCTTGTC
GGTTTCATGT GTCAATGGTC TGGAGTGAGT GTTTCCATAA ACTATCATCC CCGCGAGCAT
TTGTTTATGA GTGTATTAGA ACGGTATTGT GACATACTAT CTTTCCCCCA TCCTGACCAC
TGCCGGTGTC ACATCTTCCG CCACTCAAGC GCTCATCAAT GTTTGCATGA ACATTTGGAA
CTATCCGTGG GCCATTGCTG GGGCTCTTTC AGCCAACAAG CTCGGTCGTC GACCACTGTT
CTTCATTTCT ACCGGGGGCA TGTTGATTTG CTACATTATC ATCACCGCTT TGGCCGCCGA
GTTCACCAAG ACCGGTAAAC CAGCGGTTGG GTATGCTCAG GTAGCATTCT TGTTTTTCTT
CTTCGGTGAG TAACCCATGG GATGGAACGG CTCGGTATCT GACACGTGAT AATCAGCATC
ATACGACTTT GGTTTCACAG GTCTTCAGTC GGCTTATCCG ATTGAAGTAC TTCCGTATTC
ACTACGCGCC AAGGGTTACT CTATTACCCA ATTCTGTATC TATCTTGCAA TGTTCTTCAA
CCAGTTCGTC AACCCCATTG GATTATCAAA CATCTCTTGG AAATACTACA TCGTGTATGA
TTGCATTCTG GTGCTTTTCC TGTTTCTTCA ATGGCTGCTC TTGTGAGTCG GACAAGTTTC
TTGACGATCG ACTTTAAACT CATGGTGTCT TTAGCCCAGA GACGAAAGGC CGTACCCTGG
AAGAGATTCG AGAAATCTTT GACAAAGAAC CACTAAGCTC CACGGAAGGT ATGGGCGCCA
ATGGGTCAGA AGCTTTGGAT GATGGTGGGA AGGCAGAAGA AGCATATGTC GAGAACGTCG
GCTCGCGCGT GTAA
 
Protein sequence
MAGGNFAFNT VPNNTAASWW NDPGLRKLVF FMAICTISAV SPFDRLRTST LTFCQVGSGV 
DGSLINGLQI IDSFMDNLGD ISDNYLGLVI ASFSLGALPA LPISAYCMDR FGRRSSLLLG
CISVIVGGIA QSFTHGANAF LGTRFVIGFG IALTGTSAPT LLMELAHPRM RGQASLLYNC
SWYIGAVIIG WITYGTLVGM TGNWSWRLPC LLQITPEILL AVITVTLMPE SPRWLISKGR
NEEALNTLAK YHANGDKEDA LVKLEYDEIV EALELEKTSQ HGTSYLTLLK TPGNRHRLII
CILVGFMCQW SGNGIVTYYL SPILTTAGVT SSATQALINV CMNIWNYPWA IAGALSANKL
GRRPLFFIST GGMLICYIII TALAAEFTKT GKPAVGYAQV AFLFFFFASY DFGFTGLQSA
YPIEVLPYSL RAKGYSITQF CIYLAMFFNQ FVNPIGLSNI SWKYYIVYDC ILVLFLFLQW
LLFPETKGRT LEEIREIFDK EPLSSTEGMG ANGSEALDDG GKAEEAYVEN VGSRV