Gene CNC04150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04150 
Symbol 
ID3256563 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1269226 
End bp1271650 
Gene Length2425 bp 
Protein Length554 aa 
Translation table 
GC content48% 
IMG OID638255636 
Productfamily II 2-keto-3-deoxy-D-arabino-heptulosonate aldolase, putative 
Protein accessionXP_569659 
Protein GI58265006 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACTGACGTTC TCCAATGGCA CCAGCACCTG CACCTTGGCA CCCATCTTCC TGGAGGGAAA 
AACCCATCGC TCAGGTTAGT GCCCAGTTAC AACGACTAAC CTGTGGGCCC TATGTACACT
GAACATATTC TTCCTCCTTC AGGATGTCGT CTATGAGGAC AAGGACCAGC TGGAAACCGT
TCTCAACAAG CTTCGTCGCT TACCACCTCT AGTTTCTCCT GTGGAGGTTA GTTGGTGTTT
GGGGAAACTT CATCCCTCTT ATGCTGACTT GCGAGACGTC GGGGTGACTT TAGATCGATA
GGCTCCGTAC TCAACTGGCC GATGTGGCGG CTGGCAAGGC ATTCTTGCTT CAAGGCGGAG
ACTGTGCCGA GCTTTTTGAC GACTGTTCTC AAGTGAGAAT AATTTTTTAT TTTTTTGCTT
GGATAAACTG ACACAGTTTC GCTATTTCAG GACCCCATCG AGCACAAGCT GTCCCTCATC
CTGCTAATGT CTCTCATCAT CCTTCACGGA TCACGTTTAC CTGTCGTAAG GATTGCACGC
ATTGCAGGCC AGTACGCCAA GCCGAGGAGT AAACCTACAG AGATTGTCGA GTTTCCCACC
AAAGACGGAA AGACGGAGAA GAAAGAGGTG CTGAGTTTCA GAGGAGACAA TGTCAATGGT
TATGATCCGA CCGATAGAGC CCCCGATCCT CAAAGGCTTT TAGGGTAAGT CGCCGCGCAC
TTACGTACCG CCCTACTGAC CATGGTGCTG TTGGCAGATC TTACTTTCAT TCCACCGCCA
CGCTCAACTA CATTCGTACA CTTCTTTCAT CGGGTTTCGC CAATTTACAT AATCCTGTCG
ACTGGTCGTT TTCCCATGTC CGATCGCCAG AGCTTCAGCA AGCTTTCTCC TCGGTCATTG
AGAGTTTGCA AGATAGTTTA GAGTTTATGA AGGTGGCCAC TGGCGCTGTT GGAGGTGGTG
AAAGGGGTGG TATGGAGACT GTTGATTTCT ACACCAGGTA AATCGCAGCG CGCTCGCTCC
TTCTTTTCAG TGTACACATG TTGACTAAGA AAAACCATAG CCACGAAGCT TTGCTGTTGG
AGTATGAAGA GGCTTTCACT CGCTCTTGGG ACTCTACGAC CTTGTCTCCC CCAGCGACGG
GTGACTCTAC CCCGATCCTC TCTCGATCAG CATCTCGTAT CCGTGAAGCA TCATCATCAT
CTTCTTACCC ACACTCTCCC GCTCGTCGTC CCAAATCGCC CAAGAATTTG AGCGACTCGA
TCAACAGTCT GTCCATGTCT GTTGGTGATT TAGGGAAAGG GGAGGCGAAG AAGTGGTACA
ATACCTCTGC TCATTTCATC TGGATCGGGG ATAGGACGAG ACAGTTGGAT GGGGCGCATG
TGGAGTACTT TAGGGGTATC GCAAACCCCA TGTAAGTGAT TTTGGTTCCA TCTTTGTTGT
TTTTGCGCCC GTTGCTCATT ATGGAAAAAG AGGTATCAAG ATTGGCCCTT CCATGGAGCC
CGAGGAAATC GTTCGCGTCC TTGACAGCAA GTCCCAGTTT ATGGAAGCGT TTAACCTTTT
TGCTAATCAA CTGGTGTCAA TTTTAAAATA GTTGTTAACC CCGACAAGAT CCCGGGCAAG
GTGACGCTTA TCGGGCGATA CGGTGCCGCC AAAGTCGATC AGTTCTTGCC AAAGCACATT
GATGCCGTGT TGAGGACCGA TCATCCAGTG GTCTGGCAGT GTGATGCCAT GCATGGCAAG
TGAGTAAAAG GACCTTTTTT TCGAGAAACG CAAATCTTGG CAACTAAACT TTTTTTTCAT
AGCACCAAAT CCTCTGTCCA TGACCCTACC CTCAAAACTC GACATTTTGT AGACGTCATT
ACCGAAATTA CTCGAAGCAT GGAAATCCAT AAAGAAAAGA ATACTATTCT TGGTGGAGTG
CATCTTGAAT TGACAGGAGA GGTCAATGAC GATGGGTATT CTGTTGTATG TCCCCCCATC
CATTTTTTTT AATGCGGGAC ACCAGGCTAA ATAGTCGTGT GAAAAATAGA CCGAGTGTAT
CGGTGGTTCG ATGGAGTTGG AGGATAAAGA CCTCTCGTTC AATTACAGAA CGCATTGCGA
CCCGCGTTTG AATTACGAGC AGTCGCTAGG TCCGGTTTTT TTTTTGCGTG TCAAGCTTAT
CGGCTGACCA TTTGCGACCA CAGACGTCGC GTTTTTGCTC GCCGACTACC TCAAGTCAAA
GAGAAGAGGC GAAAGACCGC ATGATATTTT GCTTGCAAGC TTGCGTGGTC GGAAAAATGA
CGTTGAAAAA TAAGTGGGCG TTGATGCCAT CATGCCGATT CGCCTTTTGC GCTTCTCATG
TCAGAACTCG GTTCAGCAGC AGCAGCTTTG GATAAGTGCA TGTCACATCT CATGGTCAGT
ATAGTCTATG CATATCATGG TCGGC
 
Protein sequence
MAPAPAPWHP SSWREKPIAQ DVVYEDKDQL ETVLNKLRRL PPLVSPVEID RLRTQLADVA 
AGKAFLLQGG DCAELFDDCS QDPIEHKLSL ILLMSLIILH GSRLPVVRIA RIAGQYAKPR
SKPTEIVEFP TKDGKTEKKE VLSFRGDNVN GYDPTDRAPD PQRLLGSYFH STATLNYIRT
LLSSGFANLH NPVDWSFSHV RSPELQQAFS SVIESLQDSL EFMKVATGAV GGGERGGMET
VDFYTSHEAL LLEYEEAFTR SWDSTTLSPP ATGDSTPILS RSASRIREAS SSSSYPHSPA
RRPKSPKNLS DSINSLSMSV GDLGKGEAKK WYNTSAHFIW IGDRTRQLDG AHVEYFRGIA
NPIGIKIGPS MEPEEIVRVL DIVNPDKIPG KVTLIGRYGA AKVDQFLPKH IDAVLRTDHP
VVWQCDAMHG NTKSSVHDPT LKTRHFVDVI TEITRSMEIH KEKNTILGGV HLELTGEVND
DGYSVTECIG GSMELEDKDL SFNYRTHCDP RLNYEQSLDV AFLLADYLKS KRRGERPHDI
LLASLRGRKN DVEK