Gene CNC03220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC03220 
Symbol 
ID3256199 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1015066 
End bp1018523 
Gene Length3458 bp 
Protein Length561 aa 
Translation table 
GC content46% 
IMG OID638255545 
Productreceptor, putative 
Protein accessionXP_569994 
Protein GI58265676 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.630871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTGGGCCCT CGATCCCTCA TTCTCTCGTT CCCTCGCGTC GTCTGATAGC ATAGTCGGTC 
TACTCCTTGT ACAATTGTCG ATGGTCCGCC GTCTACATGT CCTTTCGCAC TTCAAATAAC
CAGTAATAGA CCAACTGCAC CAGAAGACGC CTTCTCCTCC TCCCCATTGT GTATGCGTGC
CACACTCGCA TAGTTATAAC AGTCCCAAGT CAAGCGCTGC TCAGATTAGT ACAAAGGGAC
AGCACACTGC GTGTTCGACA CGTCTGAGAA GAGCACAACC ATGCCCGATC CCTCTATTCC
GGTAGTCGGT CACAAGACTC AGCGTAAGCT GGTCGGACAT AACCTACTGT ACAGTGTTTC
AGTGTTTCTT AGCATAGGAG TCTGGTTATT TGGGTAGGTG TTTTGTGAAT CCGGATGGTG
TTGTGCTGAT ATGATAACCG GTTAAGATAT GACCAGGGGT AAGTTCACTT CAAGCGCTTA
TCCTTAAGCT TCGCTGATCA TGGCCTTCAA ACTGTGAAAG AGTAATGTCC GGTAAGTTGG
TATCACCATC ACATATCTCG ATAGCTAGTT GACTTTCTAT GTAAGGAATT ATTACCGGCC
CATACTTTAA GTAGGTATGA AGTAAATACT GAGTATCTTT CATTAACATT TATGGCACGT
GTCTACAGAG CTTATTGTGA GTAAAGCCCT AATCAAGCAA ACTAAGCTGA CGTTTTTAGT
CAACCAACCA ACGTCAACGC AGATTGGCAA GTAAGTTAGT CATTCAGTGC CAGTAGCAAT
GTCTGAACAT TCGATAGTAT GGTGGCCGTT TTGGAGATTG GTGCCTTCAG TAAGATTATT
CCATGTCATG TCATTCTTTC AAGAACTCAC GGCAATCACG CAGTTACTTC TCTGGCTGCC
GCTCATATTG CAGATAATTA TGGAAGGCGT ATGACCCTTC GCACAGGTGC AATAGTCTTC
ACCATTGGAG GTGCTATACA GACTTTTTGC GTTGGATATA ATTCCATGGT ACTTGGAAGA
ATTGTCAGCG GCTTTGGGGT AGGGATGCTG AGTATGGTCG TGCCAATCTA TCAGGTATGT
GGTTTACAAT AGTAAAGGCG CGGAGCATAA GCCAATGTGT GTCACCCTCG CGCAGTCCGA
AATATCTCCT GCAGACCATG TAAGAAACTC TTCAATATTT CCAGAAGATT TTTCTAATAA
AACACGATGA TAGCGAGGCC TTTTGGGCTC TGTCGAATTC ACAGGTAATA TCATTGGCTA
TGCCTCCTCT GTTGTACGAT GTGTCACCTG CCTATGTCAA GCACCATTCT CACTCAACGT
TCATAGTGGA TCGACTATGC CTGTTCATTC TTCCAGTCTG ACTGGTCTTG GCGCCTCCCG
CTTTCTGTTC AATGTATAGG CGGCTCTGTT CTCTTTATCG GCAGCTTCGT CACACCAGAG
TCTCCCCGGT AAGCCTTCTA TATGTGCATC ATATGTAGGT GCAGAACTAA GAGCTGTTCA
AAGGTATCTT GTCGATACAG ACCAAGAGGT GGAAGGTTTA GCAGTCATCG CTGATTTTCA
AGGGAAAGCG CTGGACGATA TTTCAGTGCA AGCCGAGTAC AAAGAAATTC GAGATGCTGT
TCTAGCCGAC GTGAGACAAT CCTCTTAACG TTATCCCCAT ACACTTGCTT ACTCTGTTTT
GTTTTTTTTT ACTAGAGAGC TGTCGGAGAT AGAAGCTATA GGGCTTTATG GAGGAGATAC
AAAGGACGAG TTCTGATTGC AATGAGCAGT CAATTGTTTG CTCAACTGGT GAGTCAATCT
TTGCAAAAGT CAAAGAAACA TGAAAATAAA GCGGTCTGTA CTAATGATTA TTGGCAGAAT
GGCATCAATG GTGAGCTTAG AAGAGCGCTG CAAACAATTA CTAACCATTT GGCCAGTCAT
CTCATATTAT GCACGTGCGT CCCATCTCAT TCTGCCGATC ATTCTTGACA GCTGATGTGG
ATTACAGCTC TTGTCTTTGA ACGTTAGTCT CGCTGAAGAC CTAATACTGC CGTGTTACTA
ATGGTGATGA AGAGGCGGGG TGGATTGGGC GTGACGCTAT CCTTATGACA GGTATCAATG
CCTTATTTTA TGTGGCAAGC TCACTTCCGC CGTAAGTTCA GGTCCATCCG AAATTGTGCA
CATTCTCAAA TTATTACTAG ATGGTATCTC ATGGATCGAG CGGGTCGAAG GCCCATTTTG
CTCTCGGGAG CAGTGGCCAT GGCGATTGCA CTGACGGCTA CAGGATGGTG GATATATATT
GATCAAGCAA TAACACCCAA TGCTGGCTCG TCTTTTGTTC TGCCATGTCG GATGAAGCTG
ATGGTATTTG TCGATAGTGG TCATTTGCGT AGTGATTTAT AATTCCGCAT TTGGCATGAG
CTGGGGACCT GTCCCATGGT ATGTGTCATT GACATATGAC CGGTCGGAAA TGAAGTTAAT
TAATTAATTG ATCCCAGGCT TTATCCTCCG GAAATCATGC CGTTGTCATT CCGAGCAAAG
GGAGTATCCT TATCTACTGC TACAGTACGT CCAATTTTAT CCCGAATGCA CAACGATGGG
CTAATTTGAG TTATCAGAAC TGGATCTCAG TGGGTCTGGA GCGCTTTAAT CCTGCTTTTT
GCTAACGACT GTTGTATCTG CAGAATTGGT GGGTAGGGGT TTCAACACCG CTCTTTCAAG
AACTTATCGG ATGGCGATTA TATCCGATGC ACGCATTCTT TTGTGCATTA TCATTCATCC
TCGTGTACTT CCGTGAGTTG TCAGCCGAGA TTCGTAAACC ACCACTCATG CACATGGAAC
TAGTCTATCC CGAAACCCGA GGCGTACCGC TTGAAGAAAT GGACAAATTG TTTGGGGATG
AAAGTGATGA AGACGAGGTT GATTCGGACT TCGATGAAGT TGAGGAAGCC GAATCAGAAA
TATCCTCTCT AGTCAGCAAT CCTCGACACC GACGCCGCTC GGCCAGCTCT TCATTGGGCC
CATCTTTGCC GACCTCCCGA AAACCGTCAC CCATACCCTC TAGGGAGGCT TCATCTAGCC
GAGGACTGTT TGGACGTATA ACTGACTCGG TGAATGGTCT GATTGGAAGC ACAAAACAGC
AAAGCAGGAG CGTGGGGTAT ACTGCTGTCA ACGAGGAATA GGAACTCGCG AGTGAGCATC
ATCCGGACTC ATTTCAAGTG ACACTTGACA CTAAATGACC AGGGTTCGAT AAACGGAATC
CAGGGCGTCG TCATGACTTT ACGGGACAGT TCGAAGAGCT CTCTGAAAGT CACCACGAAG
ATGACTGGGA AGTAGACGTA GGAGATATAG AAATGGGGAT AGGAGAAGGG TTGCTTGCCA
GGCGTGTGGC GATTTCCAAT GTCGAGCCTC GAGAGTGACA ATATTTTGTA CGAGAAAGTG
AAGGGTGTAG CATAGTCGAA TCGTATTTGA TGGTCTTC
 
Protein sequence
MPDPSIPVVG HKTQRKLVGH NLLYSVSVFL SIGVWLFGYD QGVMSGIITG PYFKAYFNQP 
TSTQIGNMVA VLEIGAFITS LAAAHIADNY GRRMTLRTGA IVFTIGGAIQ TFCVGYNSMV
LGRIVSGFGV GMLSMVVPIY QSEISPADHR GLLGSVEFTG NIIGYASSVW IDYACSFFQS
DWSWRLPLSV QCIGGSVLFI GSFVTPESPR YLVDTDQEVE GLAVIADFQG KALDDISVQA
EYKEIRDAVL ADRAVGDRSY RALWRRYKGR VLIAMSSQLF AQLNGINVIS YYAPLVFEQA
GWIGRDAILM TGINALFYVA SSLPPWYLMD RAGRRPILLS GAVAMAIALT ATGWWIYIDQ
AITPNAVVIC VVIYNSAFGM SWGPVPWLYP PEIMPLSFRA KGVSLSTATN WISNWWVGVS
TPLFQELIGW RLYPMHAFFC ALSFILVYFL YPETRGVPLE EMDKLFGDES DEDEVDSDFD
EVEEAESEIS SLVSNPRHRR RSASSSLGPS LPTSRKPSPI PSREASSSRG LFGRITDSVN
GLIGSTKQQS RSVGYTAVNE E