Gene CNC04040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04040 
Symbol 
ID3256788 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1249911 
End bp1252367 
Gene Length2457 bp 
Protein Length735 aa 
Translation table 
GC content50% 
IMG OID638255625 
Productfolic acid and derivative biosynthesis-related protein, putative 
Protein accessionXP_570005 
Protein GI58265698 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes
[COG0801] 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase 
TIGRFAM ID[TIGR00525] dihydroneopterin aldolase
[TIGR01496] dihydropteroate synthase
[TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATAACTCTT CAAGCATGCC CCCGGATACG ATAACCATCT CATCTCTCAC ACTTCACCTG 
CCACACGGCC TAGGTCCGTC CGCCTTTCAC CTCACCCCGT CCCCTCCCTG TCCAGCCCTT
CTCTCCCTCA CAATTCACCT CGTCCCCAAC TCTGTCTCTG CCACCGCCGC AGGGGACTCC
ATGGCAGGCC TTGGAGTAAA CTACTCTTCA GTATCCAAAG CGGTATATGC CCTTGCGAAT
GATGCCGAAA AGGTATGGAG CGGACCATGG GAGCTGATGC GCGCTGTCAG TGCCATCCCG
CTAGGGTTTG ATGATGTCCA GAGCGTAGAT ATCCGTCTAG GCCTTCCAAA GGCACTGCTA
CATGCTCTAG AAGCTGTGTA CAAGGCGTCC TATGCCAAAA ATGGCGAAGA AACCGGCAGG
AGCTGTACAA TCCGGGACTT GAAGGTAGTG TGTATTGTGG GACTACATGA GCATGAGAGA
AAAGAGAAGC AGAGGCTGGA GTTGGATGTC AAGGTCAGAG GAGGCGACTG GAATGTTTGG
GGACACAAGG GATTTGCAGA TGAAATATAC GACGTAAGTT GTGCTAGCAG CACTTGTGCT
CAATTATCTT CCATCACTAA CAAAACGTGG TGACAGTTTG TGAGCAACTC GGCTTACGGA
ACTATCGAAT CGCTGAACCA CGAACTGGGA CAACATCTTC TGAAGAGTCG CTATCTGGGG
GATCCCACTC AATCTCATCT AGAAATTACA ATCAGGAAAC CATCGGCAAT ACCTTTCGCT
ACTCCAAGCA TCACGATTCA TCGCTCACAG GCCGACTATG CGCCAAGTCT ACCATCTGGC
ACCGCTTGCA ATGCGCCACG CGAAGCCCTT GAGAGGGTAT TCGTGGCTGT CGGCTCCAAT
ATCGGTGACA GAGTAGCCAA CATCACAAGA GCCGTAAGCC TTTTAGAGGA GGCTGGATGT
AAATTGCTAG GTACTAGTAG ACTGTACGAG AGTGAACCGA TGTATGTGGA AGATCAGGAC
CGGTTCATCA ATGGTGTCAT TGAGGTAAGT TACTGCGGAC GTATCTGTGA AGATATCTGT
TTAACTTCCC CGCAGCTAGC TACACCACTT GAGCCCTTGG AAGTTCTTCG ATTGCTCAAA
CACATAGAGA AAGCTGTCGG GCGTACAAAG ACATTCACGA ATGGTCCTCG AGTCATCGAT
CTAGATTTGG TCTTTTATGG TCACCGAGTG GTTAAAATTG GCAACGAAAC TGATGAGGAA
GATGAACATG GGATCAAATG GTTAGAATGC CCTCACAAAA GACTACGTGA GCGAGAGTTT
GTTTTGAGGC CGCTGGCCGA GTACGTTTAA AACTGCTTTT TTGAAGGTGG AAGTTCCAGG
GACTGACCAT CTCCAAAGCA TTGATCCAGA GTTCACGCAC CCTGCGCTTC GCCGATCCGT
CGGCCAGCTT CTCGCCAGTC TTCCAACCAC GTTTCCGCCA TCTCTTTCAC CTATAATCCC
CTTACACTCT CACTCTTCTC CCCTTAGGCT TTCCATCCCT GCCTCCCCAT ACATCATGGC
CATCTTCAAT ACAACACCAG ATTCATTTTC TGATGGCGAT CCCGCCAGGA CGAATGTCGA
ATATGCGCTC GCCGCTTGCG AAAAGCTGTT AAAAGGACCT GAGCCTCCGG CTATTCTCGA
CATTGGGGGA ATGTCCACGC GCCCAGGTTC TGAACCCTGT TCAGAGGAAG ACGAGCTCAG
TCGCGTTATC CCACTTGTCA AAGCCATCCG ATCATCAAGT AATCCCATCG TTGCATCGAT
TCCGATATCA GTTGACACTT ATCGACCGTC AGTTGCCAAG GCTGCTGTTG AAGCTGGTGC
ATCCATCATT AACGACGTCC GTGGCGGACA AGAGCCAGGA ATGTTGCGGG TCATGGCAGA
AGCAGACGTT CCTGTTGTTC TGATGCATTC TAGGGGTGAT TCAAAAACCA TGATTGCAGC
GGATGTCCAG GATTATGATA GCCACGGAGG TGTCATCAAG GGTGTGATAC AGGAGATGAG
GGTCTTGGTA GAGCAGGCCT TGAAGTCTGG CGTCAAACGA TGGAATATCA TCCTCGATCC
TGGCTTGGGC TTTGCCAAGT CTTCCTCTCA AAGCTTAACA TTGCTCAAAC ACCTGCCTGA
TCTTGTCCTT CCAGGGACCG GACTGGAGAG ACTTCCGATG TTAGTAGGTG CCAGCAGAAA
GGGATTTGTA GGTCAGACCA TAAAACGGGC TGTACCAAAA GAGAGGAGTT TTGGAGATGC
GGCTGTCAGC GGTTGGTGCG CTTCCAGTGG AGTTGTGGAC ATCTTAAGAG TCCATGAGCC
TAGAGAGATG GCTGAAGTTG TAAAAATGGC ATGTGCTATT AGGGATGCAG AGTAGGAGGT
TCTGCTAAAG ATATATGGGC AATAATGCAT GGGAATGACA GGAATGACTC GGATGAG
 
Protein sequence
MPPDTITISS LTLHLPHGLG PSAFHLTPSP PCPALLSLTI HLVPNSVSAT AAGDSMAGLG 
VNYSSVSKAV YALANDAEKV WSGPWELMRA VSAIPLGFDD VQSVDIRLGL PKALLHALEA
VYKASYAKNG EETGRSCTIR DLKVVCIVGL HEHERKEKQR LELDVKVRGG DWNVWGHKGF
ADEIYDFVSN SAYGTIESLN HELGQHLLKS RYLGDPTQSH LEITIRKPSA IPFATPSITI
HRSQADYAPS LPSGTACNAP REALERVFVA VGSNIGDRVA NITRAVSLLE EAGCKLLGTS
RLYESEPMYV EDQDRFINGV IELATPLEPL EVLRLLKHIE KAVGRTKTFT NGPRVIDLDL
VFYGHRVVKI GNETDEEDEH GIKWLECPHK RLREREFVLR PLADIDPEFT HPALRRSVGQ
LLASLPTTFP PSLSPIIPLH SHSSPLRLSI PASPYIMAIF NTTPDSFSDG DPARTNVEYA
LAACEKLLKG PEPPAILDIG GMSTRPGSEP CSEEDELSRV IPLVKAIRSS SNPIVASIPI
SVDTYRPSVA KAAVEAGASI INDVRGGQEP GMLRVMAEAD VPVVLMHSRG DSKTMIAADV
QDYDSHGGVI KGVIQEMRVL VEQALKSGVK RWNIILDPGL GFAKSSSQSL TLLKHLPDLV
LPGTGLERLP MLVGASRKGF VGQTIKRAVP KERSFGDAAV SGWCASSGVV DILRVHEPRE
MAEVVKMACA IRDAE