Gene CNI03850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03850 
Symbol 
ID3259788 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1035296 
End bp1038816 
Gene Length3521 bp 
Protein Length869 aa 
Translation table 
GC content50% 
IMG OID638258880 
Productexpressed protein 
Protein accessionXP_572957 
Protein GI58271602 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATCCATATA GGCTTCCCAG AATACTTCTC AACGTCTCAA CACCTCATCT TCGTAACCAT 
AATCCGCGCA CACCGCCAAA TATGACGGCC ACTCCTCCGG GCAGAATAGC CCTTCCGCCC
AAACCAACAA TTACTGACAT CCTCGGCGGG TCTTCGGCTC TCGCGTCCGA CACACCATCT
CCTCAAGCAC ATCAAAGCTT CAACTCTCCA CACTCTTCCG CCGTCAATGG GCATTCAAGA
GGGACTATTC CGCGCTCCAA AGCTCGTACA AAGTCGGCTC GACTTAGTTT TTCTAGTTAT
GGTAGAGGCA GAGCAGCCAG TTCTCGAGGT CCAGCGGGCG GAATCGTCGA CGATCTGCTA
GATGATGAGG ATTTCGGACC AAAGTTGCCC ACTTTGGTGC CCGGGATCCG AGCGGCTTAT
GCCACTCCTC TCCCTGCACT TCCTATGATT GTAATGTGTA TAGTGAGTGG CAACGGATTC
ATAGATCTGC AATGACATGT AGCTAATCTG CTGCTGCGCA GGCAATGCTG TCAGAGCTAT
TAGCGGCGAA CTTGTGCACT CCATTCATCC TCAAGCAGGT CGAAGGTGTG TCTACAGGTG
TTTATATGTC TAGTAATGGC TTGCTGACTT AATCCCGCAT GGAAGGTTTC TTCCTTAACT
CCGGTCGAGA AAAGAATAGT GAGACGGAAG CCGCAGTTGG TCTTTGGACA GGCAATCTTG
TCTCTGTCTT CTTCATCACA CAATTTTTCA CCTCTCTCCT TTGGAGTAGC ATAGCAGATC
GTCATGGTCG TCGTGCTGTG TTAGTAGCCA GTCTTGCCGG AAGCGCTATT GGTGAGTCAG
TAATTGGGAT TCCGCTTGTG TTCTGTGACT TACGCCATGT AGCTCTTGTA ATTTTTGGTA
CTTCTGAATC TGTGAGACTT GGGTCAATGT GTAAATCCAG CATGTAAGCT GACGCTATAA
GTTAGCTTCC CGAGGCTATC TGTGTCAGAC TGATTCAAGG CATTTTCGGA GGTGCTGTCG
GAGTAGTGAG TTTCCAGCTA ACTATTAGGA ATAGCTCTAA CAGCAGGTTG TCCAGTTCCG
AGGTTCGATC CGAGACTTGA CTGACGACAC TAACGCCGCT CGTGCTTATG CTATGCTCGG
TTTCTCATGG GGTTTCGGTG GTGTCATTGG TCCTATCATT GGCGGTGTCT TTGAAAGCCC
AAAGGAGAAC TTCCCCGGTA CTGCTTTGGC TCAAATTCGT AAGATGACTG GTATCCTTAG
TATGTCAAGA TGATGATCGC TAATTTTCAA TGTCTGCCAG CCCTATTCCA AAATTTCCCA
TATATCCTTC CGACTATCAT TGGCGGTGGC GTTCTTGCTG TCGGCGCCAT CCTCTCTTGT
CTTCTTTCTT GGGATGGAGG TGTTCGAGGC GGCAGGCGTA TCACCCTCGC AGTCGAAAAG
AACGAACCCC TTGCCGGTAT TTCTCCTGCT GCTGGTCGTC ACGCTTCCCC TGCGCCTTCC
AGCCGAACCG CAATCAGGGT CCCATCAGTC AGTTTGAGGC GAGTCTTGTC ACCCAGCCAA
GAAGAAGAGG ATGCCGCTCA CGGTGCTGGA TATCCGGAAC TCTCTCGTAG TGCGGGCGGC
CGAAGGGATA GTCGAGCGAG CTTAGGCACG GCTTATGGAT ACGGAGGTAT TCGTTCCAAG
CACCCTACTT TGGCAGCTAG AGCAGCATTG GAGGCTGCTA GGAGGGCTTC AGCTGCAGTA
GGCAGACCTG ACGAGAGTGA TGAAGAGGAC GGCGAGATGG GTAACAGGGC TCTGAAGGTT
GCACAGAGGC TGTTGCTCGC GAATGAGGAG AACACATTTA ACATCAATGA CCTTTGGGTG
TCTGCCGCGG TCGCCCAAGA CACTGCTGTC TTTGATGATG AGGAGGAGAC AGAGGATGAC
CAAGAAGAGG AGGTTAACGA AGCCGTACCC GACACTTCTT TTGCCTCCCC TTCATTACAT
GCACTTTCAC CGTCAACCTA CGATGGCCAT CAAGGTGACC TCTCCTTCCG ACGGTCTAGC
AGAGCACGCC TCACAAGTGT GGGCAATATT TCTCTTCATC GTAACCTTCC CGGCCACCGA
TTATCCGTAT CACATGGCGG CAGGCGATTC AGCACCACTT CAGGACACAT GCCTGCCATC
TTCTCCAACA CTGGTGTCAA GACTCCGCCA GCTGTAGCAG CCGCATATGA AGCAGAATCG
CCCAGACACG AAGCTGACTC ATTCTTCCGC GCAGCATCAC CTAGTCCCGA CCATGGCAGG
GGATCCACCG GTGGCTTAAG CGCTATCGCT GAAGGGCCTG GAAATGCTGT GGACTCTGCG
ACTGCCGGCG TTGCCGCTCA GATCTCTGAG AAAGAGGCCT CTTCGTTCTC ACTTTTGCCT
GTTCAAGTAA TCATCCAGGT GTGTTCAAAT AATGACTTTG TTCGTCACAT AGAACTTATC
AGTCCATTCC GATAGTATGG TCTCCTTGCT TTGCACAACA CTATCCATGA TCAAATATTC
TTGTCATTCT TGGTAACGTA CGAGGCTTAC CTTTTGAAGT TTTGAGTAAT TGCTGATATG
ATCATAGTCC CTACCGCTCT GGTGGTCTTG GGCTTAACCC GGCCCATTTC TCCCTGGTAG
TCGCTCTCAT GTGTCTCTGC CAGCTCGTGT ATCAATTCTA TATTTACCCT CGACTTGGAC
CCCCTCTCGG TCGTTTCACA CATCTTCAAA TGTTCAGGAT TGGATGTGCC CTCTACCTTC
CTGCCTACTT CTCTCTGCCA ATCTTACACA AAATTGCTTC TCCTGACTCT GAAGGCAGTT
TCTTTTTGAT GTTCTGTGGG TACTCCAAGT CACGACGAAG ACATTGGATC TGATTGATTC
ATTGTAGGCT TGGTTACTAT CACTGCTGTG AGGTACTGTG CAGGTACATT CTCGTATACC
TCCGTCATGG TACTTATTGT GAGTCGAGAC CCCCTACATC AAGTACGCCT GTTAACCCAT
CCCTACAGAA TGCCATGTCT CCTCCCCACG TTGTTGGCCT TGCCAATGGA CTCGCCCAAA
GCACCGTATC ATTCTCGCGT TTCTTTGGCC CTGTAATCGG TGGTGCTGTG AGTAATACCT
TCCAGAGCAT TTCGTGGCAA CATACTGATG CCTTGTTTGT AGGTTTGGAG CGCCAGCATC
AATGGTAATC CAAGCGGTTA TCCTTATGGC TTTTACTTCT GCACTGTAGC ATGCTTTATT
CAGTGGTCAT TGTCATTTTT TATCCGTTAA TCAGCCATAC CCGGTGCTTG AGGAAAAGGT
AGTTTATATT ATGGGCGCCA AGCGCGTAGA TCGTATAGTC GGGGGACCTT AATTGGTGAA
CGACAAGAAT TTGTGAATCA TGGCAGACAT CATTAATCAC TTTTATGACC TTTTTTTTTG
ACAGTAACGA ATATATGTGG ACTCTGCTCG CTAGAATGCG ATGGCGAGCA TATGTAGCAC
ACTGATTTTT CTGATGATTT GAATCCGACG ACCAAAATAT A
 
Protein sequence
MTATPPGRIA LPPKPTITDI LGGSSALASD TPSPQAHQSF NSPHSSAVNG HSRGTIPRSK 
ARTKSARLSF SSYGRGRAAS SRGPAGGIVD DLLDDEDFGP KLPTLVPGIR AAYATPLPAL
PMIVMCIAML SELLAANLCT PFILKQVEGF FLNSGREKNS ETEAAVGLWT GNLVSVFFIT
QFFTSLLWSS IADRHGRRAV LVASLAGSAI ALVIFGTSES LPEAICVRLI QGIFGGAVGV
FRGSIRDLTD DTNAARAYAM LGFSWGFGGV IGPIIGGVFE SPKENFPGTA LAQIRKMTGI
LTLFQNFPYI LPTIIGGGVL AVGAILSCLL SWDGGVRGGR RITLAVEKNE PLAGISPAAG
RHASPAPSSR TAIRVPSVSL RRVLSPSQEE EDAAHGAGYP ELSRSAGGRR DSRASLGTAY
GYGGIRSKHP TLAARAALEA ARRASAAVGR PDESDEEDGE MGNRALKVAQ RLLLANEENT
FNINDLWVSA AVAQDTAVFD DEEETEDDQE EEVNEAVPDT SFASPSLHAL SPSTYDGHQG
DLSFRRSSRA RLTSVGNISL HRNLPGHRLS VSHGGRRFST TSGHMPAIFS NTGVKTPPAV
AAAYEAESPR HEADSFFRAA SPSPDHGRGS TGGLSAIAEG PGNAVDSATA GVAAQISEKE
ASSFSLLPVQ VIIQYGLLAL HNTIHDQIFL SFLVTPYRSG GLGLNPAHFS LVVALMCLCQ
LVYQFYIYPR LGPPLGRFTH LQMFRIGCAL YLPAYFSLPI LHKIASPDSE GSFFLMFCLV
TITAVRYCAG TFSYTSVMVL INAMSPPHVV GLANGLAQST VSFSRFFGPV IGGAVWSASI
NGNPSGYPYG FYFCTVACFI QWSLSFFIR