Gene CNI01350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI01350 
Symbol 
ID3259690 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp401769 
End bp404689 
Gene Length2921 bp 
Protein Length582 aa 
Translation table 
GC content47% 
IMG OID638258618 
Producttransporter, putative 
Protein accessionXP_572860 
Protein GI58271408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.671168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCTGACCTT TCTACCTATC CTGTTGATAC GGGACTCCTC TTTTTAGTCC TTCCCCAGTC 
CGTTTGTCTT CTCAGCTCCT ATATTTTTTC GCTGGTCAAC AATTTGACAT TCACCCTTCC
CCGGGACAAA GTACTGGAAA CAATGACTCT CGGCGAGAGA GAACGACTCC TGCAACCGGC
TCCCGCCCCT CCTGGGACCA CGCTCTATAG CGAACCAAAC CATTCTGAAG ATATCGAGAC
TACCGATGAA CATAAATTGA GCTACAACAG GGTAGGGTTG AACGCCCGTC GATTTTGGAT
CTTGGTTGGT CATACTGCTT GTGTTCTTGT ATGAGAATAA TGGCTAGCGA AAAGCTTACA
ATGATCATCA CGTTAGTGTG CTTCGATGTG GATAGCCTCT TTCCTGAACG CCTTCGACGG
TACCGTAGGT GAGTGTGAAC ATTTTCCGCC ACATCCGTTG TTTTGCAAAG GTAGTTAGTT
GGCTGAGTGA TATGATGCAG TCGCTACTTT GCTGGGACCC ATATCTTCAT CGTTCAAAGC
GACCAACTTG GCATCATGGC TCGGTACATC ATAGTGAGTC GTATCTAAGC AAGAAGGGAC
GGCGTATGCT TATATTCCAC GCCGCCTTCG TAGTATGCTC TCAGTCTGCT GCTTCACTCC
CATTTACGGA CGATTGTGTA ACATTATTGG TCGTCAGGGT TCAATGCTGC TTGCGCTTGC
AATCTTCAGT AGGCATTCAA TGACCCGCTA ATAGCTTTTT GAGGTATCAA GCTGATTGCA
ACTGTCACTG CGTAGCAACT GGTAATCTTC TGTGCGCGTT TGCTCCTTCT ATGGAGGCTT
TGATTGCTGC TCGTGCACTA GCCGGTATGG GTGGAGGTGG TCTCAGTATA AGTGGGTTTT
CCGGGACGAA AATGGCTCCC CGAAGTGGAA TGAAGCTGAC CATTGGAATA GTTGGAAGTA
CCATCATGAG CGACATCGTC CCTATGTGAG CATCGCGTTT TCGGTAACTC TATCCTGTGA
GGGCCAATTC GTTAACTCCA TGTATGCAGC ACCCATCGAG GTATCTTCCA AGGTCTTGCC
AATCTTGCCT TCGGTAGCGG AATGGGGTAA GCTGCTCAGT TGTCATGAAA TGCGATTCTC
TGTGTGACAC TGTGATTTGT AGTCTCGGCG CTCCCATCGG CGCTCTCATC AACGATTGTC
TCAATTGGCG ATGGGCTTTT TGGGTTCAGG TACGTTTTTT CAACATATTT CTCACGCAAA
AATTCTAAGT TTACATTATC CGTCCTTAGA TTCCTGTTCT CCTCTTTGCC AGCTATCTTG
TCCATTCCAA TGTTCGATAT GATGTCCCAT CACGCCCCAG CTCAGGTGCC GCTACACCTA
ACCCTGCGGC TGTTAAGCAA ACCGCTATGC AGCTTTTCAA GCGGATCGAC TTTCTGGGAT
GTTTCCTACT TGCCGGATGG GTAGGCGCCG CTCTGATCGC CATCTCGCTC AATATTAACT
CTACTGCAAC AAATGCGTAC AACTGGTCTG ATCCGATCAT GATCGGCCTA TTCGCCACCA
GTGCTGTCTT ATTCGTCCTC TTCCTATTTG TAGAACTCAA ATGGGCAGCC GAGCCCGTCA
TGCCTTTTGA GCTACTGGTC AGTCGAACTC CGGTTGCGGT TGCTATCAAT AACTTTGTGT
TGTCTGTGGC CAACTTTGCT ATTGTAAGTA CTTCCGCCTT CGTCCCCTGA CTATGTCAGT
CCTCCATTCC TGCGCGGTAA TGATACCATT GACTAATAGT CTCTCTCGAA CAGCTATATA
GTGTCCCTCT CTACTTTACA ACTGTACGAC AAATGTCCGC TTCCAACGCC GGCGCTCATC
TTATTCCAAA CTCGTTCGTC GGCGTGATTG GCTCTCTCGG CGCTGGACTC ATTGTTCGAC
GAACTCATAA ATATTACTGG CTCAACACTT TTTGTGCATG CTTTGGAGTG ATTGGTTGCT
TCTTGATCTC CACTTGGAGA CTTGGTACAT CTGAGTGAGT TCTTTGGATC ATGGTCAACG
TTTAACACGC GAAGAAGCTG ATCACAAGTG CCACATCTCA GGTGGATGCT CTGGACGAAC
ATGTCATTCA CCAGTTTTGC CATGGGGGCT GTTACCACCT TGACCATCGT CGCTCTTATT
GCAGATGTCG GGCCTGAGCA TGTCGCCATT GCTACCAGTT GTGAGCATTT ATGGCTACGA
AAAGTATTTC CTGCCTTTGA TCTGACCTAC TTTCTCTCAG TGTCCTATGT GTTCCGTACC
ATCGGCCAAG TCTTGGGTGT AGCCTTGTCT GGAGCTTTGA CTCAGGCAGT TCTGACCTGG
GAACTGGAAA AGAGGATACG AGGTCCTAAT GCAGAAGAGG TGAGTTGCAC ACCAAAGAGA
TCGAGCTTTT TTCTGACATC TAGGACAACG ATTGTTCCAC AGATCATTGC GTCGATCCGA
GAATCGTCTG CTTCTATTCG CTATCTCCCA GAGCCTCTCA AGTCCATCGC GATTGCATCT
TATCAGAAAG GTCTACACGC TGTCTTCATT TGTACCGTAG TCCTGAGTGT GATCACTCTC
TTATCAGGCT TAGGAATCAG AGAACTTGAT ATGAAGCAGA TCATGTCCGG AGGAAAACAG
GCAAAGCAGG TACAGAACGA GAGCGAAGAG GAGGAGGCTT AAGGTCGTCG AGAAATATAT
CGAGAACATA TACCTGCTAA GGGAAGAGTA CTTTGTAAAG CGGAGGCGGC CTCGTCATTA
TACGTGTGGT GTGGGAGTGT GACACAATCA ATTGTATCTT AAAATAGAAC ATCTAGCATA
GAACATCTAG CTAATTGTTA TAGATTCATA TCTAGCCATC ATCTTGGACT TGTCATATGT
GATGCTTACT CATTGGATGC TTTGTCTGTG ATGTTGCATA A
 
Protein sequence
MTLGERERLL QPAPAPPGTT LYSEPNHSED IETTDEHKLS YNRVGLNARR FWILCASMWI 
ASFLNAFDGT VVATLLGPIS SSFKATNLAS WLGTSYMLSV CCFTPIYGRL CNIIGRQGSM
LLALAIFTTG NLLCAFAPSM EALIAARALA GMGGGGLSII GSTIMSDIVP ITHRGIFQGL
ANLAFGSGMG LGAPIGALIN DCLNWRWAFW VQIPVLLFAS YLVHSNVRYD VPSRPSSGAA
TPNPAAVKQT AMQLFKRIDF LGCFLLAGWV GAALIAISLN INSTATNAYN WSDPIMIGLF
ATSAVLFVLF LFVELKWAAE PVMPFELLVS RTPVAVAINN FVLSVANFAI LYSVPLYFTT
VRQMSASNAG AHLIPNSFVG VIGSLGAGLI VRRTHKYYWL NTFCACFGVI GCFLISTWRL
GTSEWMLWTN MSFTSFAMGA VTTLTIVALI ADVGPEHVAI ATSLSYVFRT IGQVLGVALS
GALTQAVLTW ELEKRIRGPN AEEIIASIRE SSASIRYLPE PLKSIAIASY QKGLHAVFIC
TVVLSVITLL SGLGIRELDM KQIMSGGKQA KQVQNESEEE EA