Gene CNN00150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00150 
Symbol 
ID3255310 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp54618 
End bp57143 
Gene Length2526 bp 
Protein Length550 aa 
Translation table 
GC content50% 
IMG OID638254430 
Productalpha-glucoside:hydrogen symporter, putative 
Protein accessionXP_568523 
Protein GI58262226 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGACGGATC AAACAGCACA CCATGAACCT CAACGAAGAG AAGAACGTAG AAGCATTCGC 
CTTGGACAGT GAAGATAACA AGGATGGCAA GGACTTTGAC AAATCGATCG TGCACCTGGA
GAATGCCGGT GCAGCCAACA GTCAACAGTG GGATCAGCTT CGACTCGATG CGATGGCTGC
CGAGGAGCTC GAACACTCCA TGGGTTTAAA GCAGGCCTTG CGTGTTTATC CCAAGGCAGC
TTTCTGGTCT TTCTCTATCT CACTCTGCAT CGTCATGGAG GGGTATGACA TCGGCTGTAA
GCCTTGACAT CGTCCCTCGT ACTAACTATA CCCAGTGCCT AACTATACCC AGTGCTTGGT
AGCATCATCG GACTGCCCAC CTATCGTGAA AAGTATGGAC ACTATTCCGA AAGTGCCGGC
AGCTACCAGT TATCTCCCTC TTGGCAGACT GCCGTTGGTC AAGCCTCAAC TATCGGCTGC
TTCATGTACG TGGATTCCTT CTGTGGATAT CAACAGGCTG ACGCGTCACA GTGGTATTTT
TGCATGCTCC TGGGCCCAAG ACAGATGGGG ATATCGCCGA ACAATCCAGG CGGCACTCAT
CGCCCTTACT GGCTTCATTT TTATCGTCTT CTTTGCGCCT AATATCGAAG TTCTCTTTGT
CGGACAAGTC AGTCTCTGCA TATTCATCAT CAAGTAATGG CTGACTCGTC GCCAGATGCT
CTGCGGTCTA CCCTGGGGAG CTTTTGCTAG TGTGAGTTCC ACGACTCCCA CCAGGGCCTC
GACCAATGAT CAGCATAGTC TGCGGTATCT TACGCAAGTG ACGTGACGCC AGTCCCATTA
CGTGGCTACC TCACAACGTG AGTACATTGA TGCGCCGAAC GCTAATCAAC TCTAGCTATG
TCAATTTGTG CTGGGTCATT GGACAATTCA TCGCTGCTGG GGTATTGCAA AGCACTTCGA
CCCGGACTGA TCAATGGGGA TACAGAGTGA CTCCGACACG TACACCCCGC AGGATTAAAG
CTGACGCCGA TGTCAGATCC CGTTTGCGAT TCAGTGGCTC TGGCCTCTTC CCCTCTTCAT
CCTCGTAACG TTGGCGCCAG AGAGTCCATG GTACCTGGTC CGTAAAGGAA AACTGGAAGA
GGCGAAACAG GCTGTGGCGA GACTTTCCAG GAGAGGCGAT GTCACCGATC CCGCCCAGAC
GGTAGCCATG ATGGTAAGCC TATTGTATTT CTCTTGCTTT TGAGATTGAC GCTTTTTAGA
TCCGTACCAA CCAAATCGAG CTCGCGAATC AAACTGGCTC CAGCTATTTC GTAAGCGGAC
TGCACCTGCA TATTGACGAA GCTTACCCTC GGTAGGATTG TTTCAAAGGC AGCGATCTTC
GCCGTACGGA AATTGCGTGT CTGGGATGGG CTGCCCAGAT CCTTGCCGGT AGTACTTTTG
CGAATACACC TACTTACTTT TTCCAGCAAG GTTCGTGATC CACTTTGCTC GAAAGGCAAA
TCCTGACATT ATCATTCTCA GCTGGACTCA GTACCGCAAA TTCTTTCAAG CTGGGCCTTG
GGACTACAGC ACTAGCCTTT GTTGGTACTT GCGGTTCTTG GATCACTCTT ACGGTGAGTT
TGTTCCTCCA ATCGTAATTA GCAAGCTCAT TCGTTGGTTC AGTACTTTGG GCGGCGAACT
ATTTACTTGT CCGGTCTGGT GGTCCTCACT GTTCTGTAAG GCTCCCGTTT TCCCCCTTAC
TAAGCTCATA CACAACAGGC TTTTCGTCGT GGGTGGGGTC TCCTTCCCAG CAGAGACCAA
CAGCAACGCT GCGTGAGTAC CCAAAGAAAT ACGTGGATAA TCCCTCATAA CTGGCTTAGC
TGGGGTCAAG CCGCAGCAAT TTTAATCTGG GTTCTTGTAT ATGACTTCAC TGTCGGCGTA
AGTTATTTTT GCCCCGCAGT ACTGTGCTTG GATTGACTCC GTATAGCCGC TAGCCTATTG
TATCGTAGGC GAAGTATCGT CAACCCGTCT GAGGTCAAAG ACAGTTGGTC TATCTAGGAA
CCTCTACAAC ATTCTCAGTG TCGTCTCTGG AATCCTCAAC ACCTACCAGA TAAACCCGGA
CGCGTGGAAC TGGAAAGGAA AAGCTGGTTT CTTCTGGGTT CGTTTGTTTC AGACCATGGT
GGGAAAAGTG TGCTGATTGT GTAGGGAGCA TCCGCCGGGC TCATCGCCAC TTGGGCGTAC
TTCCGATTGC CCGAATGCAA GGGTAGAACG TATCGAGAGC TCGACATCAT GTTTGAGCGT
GGCATTCCCG CTAGACGATT CAAGGACACG GTCGTCGACA AAGAGGCTGA GGAGTAATGA
GAGAAGGGGG GGAAGGGGTC AAAGGACCTA TCCGGTGGCG AGAGGTGTTG GTAAGGTTAG
CAAAGACCGA GGGAGGCATA GCTGCAAATA TGTGGTAATT AGTGCCGTTT CAGATATATG
AAGGATTATG ATAGTGATTC AAACGCCTAG AAGCAATATA GCTTGAATGC GTTCTTCATG
CTATTA
 
Protein sequence
MNLNEEKNVE AFALDSEDNK DGKDFDKSIV HLENAGAANS QQWDQLRLDA MAAEELEHSM 
GLKQALRVYP KAAFWSFSIS LCIVMEGYDI GLLGSIIGLP TYREKYGHYS ESAGSYQLSP
SWQTAVGQAS TIGCFIGIFA CSWAQDRWGY RRTIQAALIA LTGFIFIVFF APNIEVLFVG
QMLCGLPWGA FASSAVSYAS DVTPVPLRGY LTTYVNLCWV IGQFIAAGVL QSTSTRTDQW
GYRIPFAIQW LWPLPLFILV TLAPESPWYL VRKGKLEEAK QAVARLSRRG DVTDPAQTVA
MMIRTNQIEL ANQTGSSYFD CFKGSDLRRT EIACLGWAAQ ILAGSTFANT PTYFFQQAGL
STANSFKLGL GTTALAFVGT CGSWITLTYF GRRTIYLSGL VVLTVLLFVV GGVSFPAETN
SNAAWGQAAA ILIWVLVYDF TVGPLAYCIV GEVSSTRLRS KTVGLSRNLY NILSVVSGIL
NTYQINPDAW NWKGKAGFFW GASAGLIATW AYFRLPECKG RTYRELDIMF ERGIPARRFK
DTVVDKEAEE