Gene CNG00040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00040 
Symbol 
ID3258589 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp7894 
End bp9848 
Gene Length1955 bp 
Protein Length502 aa 
Translation table 
GC content48% 
IMG OID638257617 
Productmetabolite transporter, putative 
Protein accessionXP_571721 
Protein GI58269130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.160358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACTC TTTCCTATGA CGACAAAATG GGATACCTCC CCGAGCAGGA CATCGATACT 
ACCATTCACG CAGCACCACT CGCTGAGCTC GAGACGCCAA AGAAACGAAA ATGGATGCAG
ACCTTGTCTG TACTCATTGC AGGTGTAGCC CTTTTCTCGG ATGGTTATAA CATCCAGATT
ACCGGTACCG TATACTTTTT ACACGTAATT TCACGTCAAA GCTGATGAAG GACTAGGTTA
TACGAACACC GTCATGGCAA AGCTCTATCC CACCGCCCTT AACTCAACTA TGAAGACTCG
TCTAAGCAAT TCCATCCTTA TTGGGGATAT CTTTGGAGTA AGTTCCAATT TTCTGTTTAA
TCTTCTCCGT CTGACAATCC TTACAGATGA TCCTCTTCGG TCTCTGCGCC GACCGTCTTG
GTCGTCGATG GGGTATTATC GGTTGCACCT TCTTCCTTGT TCTCGGCGTA ACCCTTGCAA
CTGCAGCGCA TGGGAAGAGT GAAGTTGGAA TGCTATGGAT GATCGTGATT GGGCGCGGCG
TTGCTGGCCT AGGTGCTGGC GGTGAGTACG CAGTATGTAC GACGAGCGCT GTGGAAGCTG
CAGATGAGAC CCATACGTAT GTTACACATC TCTAGTACAT CGTCTCTTGT ATGTGTCGCG
CTTATTATCA ATAGGTTGAG GAAGAAGAGA GGACTCCTCG TAGCATCAAG CACCAATACA
GCCATCATCG CCGGCTTCGT TGCCTCTGCT ATCGTGTTCC TCATCGTCCT CGCCGCGTAC
GGAGGTGAAC CTCACGTCGG AGTCTGGCGT ATATGCTTCG GTATCGGTAT CATTGTAAGT
TCGCTTATTC ATGGAGCGTG ACTAACCGTA AAGATGCCAT TGTCGATCTT TCTCTTCCGT
GTGCGTATGG CAGACTCAAC TCTCTACTCA AAACATTCCA TTAAAAGTCC AAAATTTCCT
TACTGGCTTG CGTTCAAACG TTATTGGAAA CCTTTGCTTG GGTAAGCCCA CCCTGTGCTA
AGTTCTCTGA CTAGTTTAGC TGATCACAAG AATAGATGTT CTCTTGTCTG GTTTCTCTAT
GACGCTGTTG TATATCCATT CAACCTCCTC GCACCTACCA TCACTGCCGG ATTCAGCAGT
AATCAATCAC TGTTAGCTTC AAACGGATGG GGAGCGCTCG TCAATGCGTT TGCCCTTCCT
GGCGCGTTCT TTGGAGGTGA GTCTGCTTTT TCATACTGGT AAACATTCCA TATGTACCAA
ACTAACGCGG TTACACGCGT AGGTTTCATC ATTGACCGTC TCGGACCTAG GCAGACATAC
GCTTTTGGAC TAGTAATGGT CGCAATTTTT GGTTTCGTTA TTGGCGGTAT GTCCTGAGAG
CGTCTCAAAT CAAGTTTTAA CAAGCGCACT TACGCCGGTC TCTAGGTGCT ATGGAACCCC
TTCGTAACAA CGGCACAGGC GCTTTCGCCG CTTTCGTCAT CCTCTTCGGC CTTTTCCAAT
GTTTTCTCTC CGTCGGACCG GGCAACTGCA ACTTCATAGT CTCTTCCGAA TCATTCCCCA
CCCCTGTTAG GTGCGTATGA CGTCTCTTTT AATTCTCTAT CGGATATCAG ATAGTGAAAT
CCTAACATGA TACCTGCAGG GGTCATGCTC TTGGTTTAGC AGCAGCTATC GGTAAAGCAG
GCGCCGCTGT GGGTACACAA GTTTTCCCCC TTATCGAAGC CCGTTTTGCC ACGACTCTCA
AAGGGCAACA AGCCATTTTT CTCATCGGTT CAGGCATTTG TGTTGTCGCA GCGATAGTAG
TCATGACAAT TGTACCGAAC AGAAGGGCGC AGTTGGAGGA TGAGGATGTA GAGTTCAGGA
GGTACTTGGA GGAAAACGGT TGGGATACGT CAGATATGGG CTCTGTGGGA AGAATCGTCT
CTGACGAGGT AGTTGAACGT GGTGAGAAGG TCTGA
 
Protein sequence
MSTLSYDDKM GYLPEQDIDT TIHAAPLAEL ETPKKRKWMQ TLSVLIAGVA LFSDGYNIQI 
TGYTNTVMAK LYPTALNSTM KTRLSNSILI GDIFGMILFG LCADRLGRRW GIIGCTFFLV
LGVTLATAAH GKSEVGMLWM IVIGRGVAGL GAGGEYAVCT TSAVEAADET HTLRKKRGLL
VASSTNTAII AGFVASAIVF LIVLAAYGGE PHVGVWRICF GIGIIMPLSI FLFRVRMADS
TLYSKHSIKS PKFPYWLAFK RYWKPLLGCS LVWFLYDAVV YPFNLLAPTI TAGFSSNQSL
LASNGWGALV NAFALPGAFF GGFIIDRLGP RQTYAFGLVM VAIFGFVIGG AMEPLRNNGT
GAFAAFVILF GLFQCFLSVG PGNCNFIVSS ESFPTPVRGH ALGLAAAIGK AGAAVGTQVF
PLIEARFATT LKGQQAIFLI GSGICVVAAI VVMTIVPNRR AQLEDEDVEF RRYLEENGWD
TSDMGSVGRI VSDEVVERGE KV