Gene CNF00020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF00020 
Symbol 
ID3258335 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp3140 
End bp5110 
Gene Length1971 bp 
Protein Length539 aa 
Translation table 
GC content52% 
IMG OID638257122 
Productmaltose porter, putative 
Protein accessionXP_571466 
Protein GI58268620 
COG category 
COG ID 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0123203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA TGAATTCTGA CATCGCCCCG CAAATCGAGG CCAAGTCCTA CCATGACGAG 
ACGATTCATG TCGAACAGGC GGACGACGAA ATCAAGGCGG ACATGATCCA GTTCAAGGCG
GATGCGGTGG AGGCGGAGAA TGCCGAACAC TCTATGACGG TATTGGAGGC CGTCAAGGCG
TATCCTATGG CTTGCTTCTG GGCCTTTGTT ATGTCTTTCA CCATTGTAAG CTGTCCTACT
TTGCCCTCTG AGGTACGATC AGCTGATTCA TGTAGATTAT GGAATCCTAC GATGTATTCC
TCATTGGAAA CTTCGTGGCC CTTCCCGCCT TCAGAGACCG GTTCGGTATA TTTGACGAAG
CGACTGGCGG TTATGTCATA GCGACTAAGT GGCAGTCGGC GCTTCAGATG TCCGGGCAGC
TTGGTGCGCT CATTGGCGTT TTCCTTGCAG GTCCCCTCAC TAGCCGTATT GGATACCGTT
GGGCGACTTT GGTTGGCCTC ATGCTCATGA ACGCGACCAT CTTTGTTTCA TTCTTTGCCA
ACTCGCTCCC ACTCTTCTTC ACTGGTCAGC TCCTTGAAGG CATACCATGG GGCATTTTCA
TCGCCAATGC CCCTGCATAT TGCAGCGAGA TTGTCCCCAT GCGCTTGCGT GCCCCTGCTA
CACAGGTCCT GCAAATGTTC TGGGCAATCG GCAGTATCGT CGTCGGTGCT GTCACTTACA
GGTACAACAC CAGGCCCGAC ACTGCTGCCT ACAAGTATGT ATCGTCGTCA TCTATGATTT
AAGTATGCCA TCAGCTGATG GAAGCAGAAT ACCACTCGCG CTTCAATGGA TGTTCCCCAC
TCCACTCGCT ATCCTCATGT TTCTTGCGCC CGAGTCACCT TGGTGGCTTG TCCGAAAGGG
ACGTCTCGAC CAAGCGGCAC GCTCGGTAGA ACGTCTAGGA CGCAAATCGA GGCTCAACGC
GGGTGAGGTT GTGGCTATGA TGCGACGTGT TATCGACTTG GAGACGTCCA CCTCTGCGCC
CGGTTACATA GAATTGTTCA GAAAGACAGA CCTCCGACGT ACCCTCATTG TGTGCGGTAT
CTATGGAGCA CAGAATCTTG CCGGCAACCT TATCGCTAAC CAGGCCGTTT ACTTCTTTGA
GCGTACGTTA TGGCTACACC AAGGGGAACT CTTGTGCTAA CTCAGAGTAG AGGCCGGCAT
CAAGACCAAC CTTGCGTTTG CTTTGGGCCT TATCACCTCG GCCTTGCAGA TGGTCTTCGT
TATGGCTTCA TGGTTTCTTA CAACTTATTT TGGCCGACGT ACCCTCTACC TATGGGGCAC
AGGTGTCAAT ACCGCCCTGC TTATTGCCCT GGGGGTTGCC GCCTCGTGCG GCACTTCAAC
TGCGGCTTCG TATGCCCAGG CGAGCTTAGG CCTCATCATC TCTGTTTTGT TCACCTTCGC
CGCAGCGCCC GTCTCGTGGG TCGTCATTGG AGAAACGTCG GCTATCCGGC TCCGACCCCT
CACAACAGGA ATCGGGCGAG CTACATATTA CATCGTTGAG ATACCTTGCA TTTTCCGTGA
GTCTTCTACA CTGATTACCT GCACAGCCGG CTGATGCAGA TCAGTCGCTT CGTACATGCT
TAATCCCACT GGCGGAAATC GTAAGTGATA TTTCCTTCAA CCGTTCTTAA ACGATACTGA
CAAGTCGCGC AGTTGGCGGG AAGTGTGGCT ATGTGTGGGG TGCGACTGGA TTATTTTGCT
TCGTCGTCGC GTTCTTTTGC CTGCCTGAGA TGAAGGGTCG ATCCTACCGC GAGATCGACC
TTCTGTTCAA GCGTCACACG CCGGCTCGTA AGTTTGCAAC AACGGAAATT GGAGTAGAGG
ACGATGAGTA GAAAAGACCT AGTTAAGAAG TTGCATTTGG TTTGTATTTA CAATGTTTTA
AGGGTTATGA GGTTGTGTCA GGGGCGCGTT TACTTTATTA GCTTGATGTA T
 
Protein sequence
MSQMNSDIAP QIEAKSYHDE TIHVEQADDE IKADMIQFKA DAVEAENAEH SMTVLEAVKA 
YPMACFWAFV MSFTIIMESY DVFLIGNFVA LPAFRDRFGI FDEATGGYVI ATKWQSALQM
SGQLGALIGV FLAGPLTSRI GYRWATLVGL MLMNATIFVS FFANSLPLFF TGQLLEGIPW
GIFIANAPAY CSEIVPMRLR APATQVLQMF WAIGSIVVGA VTYRYNTRPD TAAYKIPLAL
QWMFPTPLAI LMFLAPESPW WLVRKGRLDQ AARSVERLGR KSRLNAGEVV AMMRRVIDLE
TSTSAPGYIE LFRKTDLRRT LIVCGIYGAQ NLAGNLIANQ AVYFFEQAGI KTNLAFALGL
ITSALQMVFV MASWFLTTYF GRRTLYLWGT GVNTALLIAL GVAASCGTST AASYAQASLG
LIISVLFTFA AAPVSWVVIG ETSAIRLRPL TTGIGRATYY IVEIPCIFLA SYMLNPTGGN
LGGKCGYVWG ATGLFCFVVA FFCLPEMKGR SYREIDLLFK RHTPARKFAT TEIGVEDDE