Gene CNC04840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04840 
Symbol 
ID3256575 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1466442 
End bp1468405 
Gene Length1964 bp 
Protein Length497 aa 
Translation table 
GC content48% 
IMG OID638255703 
Producttetrahydrofolylpolyglutamate synthase, putative 
Protein accessionXP_569768 
Protein GI58265224 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAATCTTCA TCACACAAAC GGCCGTCCCA CTTCCCATCT CCACCATTTA TGCTCCCGAC 
CTTAAGACTG TTCAGGATGG CCAGCACAAG AACATATGCT GTAAGTTGAT TCCAATATCA
TTACTAAGTC AGGTTGTTCT GATTAGTTAT GTAGGAGGCA GTTTCTCTCC TCAATACCTG
CCAATCCAAT GCTGCGACGT AAGTCGCAGA GCAGTTGAGT GTGTGATCTC TCCAGAACCT
TGATTGATGT CGGAATAGCA TCGAGGCATT CAGAAAGTCA GGCGGACGCT TAACTGAGTA
TGCTGTCGCC GAGATGCATG ATTACCTCCG ACGTATCGGT TACAAGGTGA GGAGGCATTT
ACTTCCCTGG CACGGTGTTC ATGCAGAGTG AAAGGGCCTG ACTAGTAATA GCCAGAAGAT
CTAAATGCGC TTAACGTCGT GCATATCACT GGCACCAAGG GCAAAGGCTC CACCTCTGCC
TTCACTGAAC GAATCCTTCG TGCCCATATG CCAGGCAAGA AGGTCGGGCT CTACACTTCC
CCTCATCTAT GTGCGGTGAG GGAAAGAATC AGAATCAACG GGGAACCTAT CTCGGAGACC
GAATTTGCAA AGTACTTCTT TGAAGTATGG GACCGGCTCG AGGCGGATTC AAAAGTGAGT
GTTTGTGTTT CGACAGCTAT CAGGATTTGA CGAATAGTCT GATTGTCTCT CAGCCATTGA
CACCTCAAAC GCCTAAATTC CCCGTCTACT TCCGCTTGCT CACACTTCTT GCATTCCATG
CCTTCCTTTC TCAAGGAGTA TCGGCTACAG TATTAGAGGT AGGTATTGGC GGTCTTTACG
ATTCGACGAA TATCGTCCCC AAGCCTGTCG TGACTGGAAT CACGTCTTTG GGTCTTGATC
ACACCGCTGT CTTGGGCAAC ACGATCGAAG AAATTGCTCG GAACAAGGCC GGTATTTATA
AGAAAGGTGT ACCTGCTTTG AGTGTCGTAC AAGAGAAAGG GGGAGATGTT TTGAAGGAAG
TGGCTGAAAT GAACGAGGTG TGTCCAAATC ATTCACAAAG CAGCTCTAAA TCTTAGCTTA
TATAAATATA GGCTCCTTTT GAGATTGTTC CAACCATTCC TCCGACTCCC TTGGGTCTGC
CAGGAAGTCA CCAGCTCATT AACGCCTCTC TCGCCGTTTC ATTGTCTTCT CATTTCCTCT
CTTCACAAGG GTATAAATTT TCTCTTGCCA CACCCCCAGC CGTCGTCCCT CCATCATTCG
TCCAGCCTCT CGCCTCTGCT CGCTGGCCTG GACGATGTCA GCTTGTCAAA AAAGGTGAAA
TTACTTGGTT GCTTGATGGT GCCCATACCG TCGAATCATT GAGATCTTGT GGCGAGTGGG
CGTGGGACGC GGAAAAAGAG GATAGAATGC CCCAGGTGTT GATTTTCAAC TGCAGTGGTG
GCAGAGCTGC TGAGAGCTTG TTGGGAGAAC TTCTGGAGTC GGGTGCTAGA ACAAGGAAGA
CTAGCCGGGA TGAAATTGCG AGCAAATTTG ATTCTGTGAT TTTCTGCACA AATGTTACTT
ATATTGATGG TCACTTCAAA TCCGGTGAGT TTTTTCGGAC CATCCGGTGA CAAATTTGAG
CTGACTATGC CTCAACTTAG ATCTTGACGC CAAGGCTATC GATCCTAATG ATCTTTCCCA
GCTTGCCACT CAGAACGCTC TGCGTGATGC TTGGCTTCGC CTCAATCCGT CATTTGCTGC
CGACAGAGTC CACGCTGTCG CGTCTATTCA GCACGCTATT AGAATTGTTG AAAATCTTGG
AGAGAAGACG GTTCTGGTTG CTGGTAGTTT GCATCTGGTT GGAGGTGTTA TGGAAGTTGC
CGGGCTGCAG AATGCGCTAA GTATGGAATA ATATGAAGTT GGAGAGACTA TTGTACATTT
CAACAATGTA ACAGACCGAC CTTTTTGGGA TTATAGATGT ACTA
 
Protein sequence
MLPTLRLFRM ASTRTYAEAV SLLNTCQSNA ATIEAFRKSG GRLTEYAVAE MHDYLRRIGY 
KPEDLNALNV VHITGTKGKG STSAFTERIL RAHMPGKKVG LYTSPHLCAV RERIRINGEP
ISETEFAKYF FEVWDRLEAD SKPLTPQTPK FPVYFRLLTL LAFHAFLSQG VSATVLEVGI
GGLYDSTNIV PKPVVTGITS LGLDHTAVLG NTIEEIARNK AGIYKKGVPA LSVVQEKGGD
VLKEVAEMNE APFEIVPTIP PTPLGLPGSH QLINASLAVS LSSHFLSSQG YKFSLATPPA
VVPPSFVQPL ASARWPGRCQ LVKKGEITWL LDGAHTVESL RSCGEWAWDA EKEDRMPQVL
IFNCSGGRAA ESLLGELLES GARTRKTSRD EIASKFDSVI FCTNVTYIDG HFKSDLDAKA
IDPNDLSQLA TQNALRDAWL RLNPSFAADR VHAVASIQHA IRIVENLGEK TVLVAGSLHL
VGGVMEVAGL QNALSME