Gene CNF04810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04810 
Symbol 
ID3258146 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1398957 
End bp1400768 
Gene Length1812 bp 
Protein Length448 aa 
Translation table 
GC content47% 
IMG OID638257599 
Producttransporter, putative 
Protein accessionXP_571631 
Protein GI58268950 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTGTACTTA GTCCAAAAAA GCTGCTCTAA CTTACCCAAT CAATGGCCTC CGAGATTGAG 
CTGTCCCGCC TTCCGAGCCA GCGAAATGAA TCCGCCACTG AACAACAACT CGACTATGAT
CGGGAACACG AAGTCGATGA ATCAGCTACA CATTACGCGC TGCCCCCGGT AGATGGAGGT
CGAAGGGCAT GGGCATTCCT TGCTGGTGCT ACCGTTGTGG AGATGCTTGT ATGGGGATTC
CCTTACTCGA TTGGTATCTT GCACGCGTAC TGGAGTAATA TCTTATTCAA AGGCTACGGC
GAGTCAGCAA TCACCCTGGC TTCCACTCTG CAGACCGGTT TACTCTACGT GAGTTGTGCC
ATCTTTGGGC CGTAAGTTTT CATTTCTCGA AAACCCCGAT TGATGACTGA CCATCCTTAA
GAGTGTTTAC CAGATGGCCG AGATGGGAGA AAACTCTCCA ATATTTTGCT CTTTTCGCTT
CTGCATTGTC GATGATTGGG AGTGCTTTCG CAACGAAGGT GTGCTTAACC AAACATATTG
AACTCAAGTG CTGATTATCA TCTAGCCCTG GCACCTTGTC ATAACTAACG GCTGCATCTA
TCCATTCGCA GGAGCCCTCT ATCTTCCGTG TTGTACTATT CTTTTCGAAT GGTTTGTCGC
TAAGCGGTTC GTTTTTCTTG TTCATAACCA TACTCATGCG TTCGCTAATC AAGACACAGA
GGAATTGCTA CTGGTCTTAT GTATGCAGGT ACTGGTATCG GTGGCGTAGC ATACCCGTAT
ATTATGAGTG GCCTCTTGAA CGGCGTAGGC TACAAAGCTG CTCTAGTATC GATGGGTATA
GGCTACGCCA TCCTTGGTTC CATCGCCCTC ATTCCTGTCA ACCGACGAGT CCCCCTTTCT
CGATACTATT TTACAGAGCC TGGAAGGAGA AAACAATTCA ACTTTTCATT CTTGAAAAGC
TCAGTCGCTT TGACGGGTTC GTTGATCATC CTTTTCGTCA GCATGGGTAA CTTCATCCCC
ACTGTCTGGC TGCCATGTAC GTCGGACCAT TATTCGTTTT TGGCAACGAT ACTGACACAT
ATGTAGCTTA TGCCGACGAC CTGAAACTAC GTCACCTCGA CGGTACCGCC CTCATTGCCA
TTCTCAACGC CGCCACTATC CCAGGCAATA TCCTCCTTGG CTACTTCTCT GATTTTTCTA
TCCGCGCTGT TATCGTCGTC TCTTGCGTCG GCAGTGCTTT CGGATGCGCA TTCTTGTGGG
GTTTCGGGAC GAATGCGGCC ATGCTTGTCG CTTTTGCAAT CGTTTACGGT TTTCTAGGAT
CAAGCTTCCA GTGTCTATGG TCTAACATGA TTAGTGTCAT TTCTAGTGAG TTACTGTACG
GACCGATGCC CATTCCTTTC GCTAACGCAG GATTATAGAG GACGACCCCA TTGCTCCATC
TCTAATTTTC TCAATCTTTG CCTTAATGAG GGCTATCGGT AACATCACAT CTGGTGCGTA
TATAGCGCAA TTTTCACAAT GAGTGTACTT ATACTATTGT AGGGCCCGTT TCTGGCGCGC
TCATGAAGCA TGACTCGTTC CCTGGCGCTG TTGGAGCTTA TGGTTTCCAC AACTATGTGA
GTATTCCCCC TTCATCAGGT GCCCATATGC TGACGGCGAT GTTGGAAGGG CGCCTTGTTG
GTGTATACAT CTGTAACAAT CTTTACCGGG GGAGTTACCG GTATCCTGTT CAAAGACCGT
TAGGGAAAAT CTACATCTCC GAAAAGATTT TAAACTCTAT AGAAATCGAA CGCTGTAGCA
ATAGTATGAT AC
 
Protein sequence
MASEIELSRL PSQRNESATE QQLDYDREHE VDESATHYAL PPVDGGRRAW AFLAGATVVE 
MLVWGFPYSI GILHAYWSNI LFKGYGESAI TLASTLQTGL LYVSCAIFGP VFTRWPRWEK
TLQYFALFAS ALSMIGSAFA TKPWHLVITN GCIYPFAGAL YLPCCTILFE WFVAKRGIAT
GLMYAGTGIG GVAYPYIMSG LLNGVGYKAA LVSMGIGYAI LGSIALIPVN RRVPLSRYYF
TEPGRRKQFN FSFLKSSVAL TGSLIILFVS MGNFIPTVWL PSYADDLKLR HLDGTALIAI
LNAATIPGNI LLGYFSDFSI RAVIVVSCVG SAFGCAFLWG FGTNAAMLVA FAIVYGFLGS
SFQCLWSNMI SVISKDDPIA PSLIFSIFAL MRAIGNITSG PVSGALMKHD SFPGAVGAYG
FHNYGALLVY TSVTIFTGGV TGILFKDR