Gene CNH02650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH02650 
Symbol 
ID3259105 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp373317 
End bp374905 
Gene Length1589 bp 
Protein Length421 aa 
Translation table 
GC content51% 
IMG OID638258220 
Productchorismate synthase, putative 
Protein accessionXP_572438 
Protein GI58270564 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCCAGCGAA TTCACTCCTC ATTTTTTCGT TCACCGTCTA TATTCTCACT CAAAAGACAA 
TGTCTAGCTT CGGAACTCTT TACCGAGTAC ACACTTACGG CGAAAGCCAT TGCAAGAGTG
TCGGCTGTAT CGTTGACGGC GTCCCTCCTG TAAGCTCCTC AGTAAATTGT CAATCGATCG
GCTACTGATT TATTATTCCA GGGACTTCAA TTGACAGAAG CCGACATCCA AACGCAATTG
TCTAGGAGAA GACCTGGACA GAGCGATATT ACTACTGCTG TTTGTTTCAT CCTCCTTGAA
AATATTTCTG AAAGCGTACA CTAACCGCGA ATTGCAGCGA TCCGAGTTCG ACACTGTTCA
CGTTCAGTCT GGTACCGAGC ACGGCGTTAC TCTTGGTACT CCTATCGGCC TTCTTGTCCA
CAACAAGGAC CAGCGACCTC ATGACTACGC CGAAACTGAC CTTTACCCCC GGCCTTCCCA
CGCCGACTAC ACCTACCTTG CCAAGTACGG TGTCAAGGCC TCTTCTGGCG GTGGTCGTGC
GTCCGCGCGT GAGACTATCG GACGAGTTGC GGCTGGTGCG ATTGCGGAGA AGTACTTGAA
GGAGGCTTTT GGCGTCGAGA TTGTTGCCTT TGTCGCCAGT GTTGGCAAGA CCGCTTTGCC
TTTCGCCGAC GAAGAGGATG AGGTTTTGGG CAAAGCGTAC ATGGACCTTG TGCAGACTGT
TACTAGAGAG GAAGTTGACA AGGAGATCAC CCGATGCCCT CACAAAGAGA CTAGCAAAAA
GATGGAGGAG ACTATTCGTG CTGCCAAGGC CAAGGATGAT TCCTTGGGTG GTTCCATCAC
CTGTGTTATC CGTCGATGCC CCCTCGGTCT GGGTGAGCCC TGTTTCGACA AGCTGGAGGC
TGTCCTTGCC CACGCCATGC TTTCTATCCC CTCTACCAAG TCTTTCGAAA TCGGCTCGGG
TCTTCGTGGA TCCACTTTCC CTGGTAGTCT TCACAACGAT CCTTTCGTGG AGGGCGTCGA
TGAAAAGACC GGGGGAAGAA GGTTGAGGAC TGTGACCAAC TGGAGTGGTG GTGTTCAAGG
AGGTATTTCG AATGGCGAGG ATATCTACTT CAGGCAAGTT CAAAGCTCCG ACCTCAGTGT
GACATGGCCT AATGCATATG TTTAGAATCG GTTTCAAGCC TCCTGCGACC ATCGCTCAAG
AACAGTCTAC TGCCAGGTAC GATGGTTCCG AAGGCGTGTT GGCTGCCAAG GGCCGACACG
ACCCCTGTGT CGTTCCTCGT GCTGTTCCTA TCGTGGAGAC CATGGCTGCT CTGGTCATCA
TGGAGTAAGT ATCTATTTCA TCTTTCTTAT CATGGATCGG TATTGATTCT GTCTGTGTTC
TAGCATGGTC CTCCAACAAA ATGCCCGATT GACTGCCGCC TCTCTCCTTC CCGACCTCAC
CCACTTGCCC CCTACAATGG TGCTTCCAGG CAAGAGTACC GTTCAGAAGA TCAGGGAGGG
CCAGAGCGTT GGAGAAGTGC AAAGTCAGAA GGTTGGTGAG GAGTAGAGAA AATACTTTGA
TTTCATAGAG ATGAATTGAG ATTAATTGT
 
Protein sequence
MSSFGTLYRV HTYGESHCKS VGCIVDGVPP GLQLTEADIQ TQLSRRRPGQ SDITTARSEF 
DTVHVQSGTE HGVTLGTPIG LLVHNKDQRP HDYAETDLYP RPSHADYTYL AKYGVKASSG
GGRASARETI GRVAAGAIAE KYLKEAFGVE IVAFVASVGK TALPFADEED EVLGKAYMDL
VQTVTREEVD KEITRCPHKE TSKKMEETIR AAKAKDDSLG GSITCVIRRC PLGLGEPCFD
KLEAVLAHAM LSIPSTKSFE IGSGLRGSTF PGSLHNDPFV EGVDEKTGGR RLRTVTNWSG
GVQGGISNGE DIYFRIGFKP PATIAQEQST ARYDGSEGVL AAKGRHDPCV VPRAVPIVET
MAALVIMDMV LQQNARLTAA SLLPDLTHLP PTMVLPGKST VQKIREGQSV GEVQSQKVGE
E