Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02650 |
Symbol | |
ID | 3259105 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 373317 |
End bp | 374905 |
Gene Length | 1589 bp |
Protein Length | 421 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258220 |
Product | chorismate synthase, putative |
Protein accession | XP_572438 |
Protein GI | 58270564 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCCAGCGAA TTCACTCCTC ATTTTTTCGT TCACCGTCTA TATTCTCACT CAAAAGACAA TGTCTAGCTT CGGAACTCTT TACCGAGTAC ACACTTACGG CGAAAGCCAT TGCAAGAGTG TCGGCTGTAT CGTTGACGGC GTCCCTCCTG TAAGCTCCTC AGTAAATTGT CAATCGATCG GCTACTGATT TATTATTCCA GGGACTTCAA TTGACAGAAG CCGACATCCA AACGCAATTG TCTAGGAGAA GACCTGGACA GAGCGATATT ACTACTGCTG TTTGTTTCAT CCTCCTTGAA AATATTTCTG AAAGCGTACA CTAACCGCGA ATTGCAGCGA TCCGAGTTCG ACACTGTTCA CGTTCAGTCT GGTACCGAGC ACGGCGTTAC TCTTGGTACT CCTATCGGCC TTCTTGTCCA CAACAAGGAC CAGCGACCTC ATGACTACGC CGAAACTGAC CTTTACCCCC GGCCTTCCCA CGCCGACTAC ACCTACCTTG CCAAGTACGG TGTCAAGGCC TCTTCTGGCG GTGGTCGTGC GTCCGCGCGT GAGACTATCG GACGAGTTGC GGCTGGTGCG ATTGCGGAGA AGTACTTGAA GGAGGCTTTT GGCGTCGAGA TTGTTGCCTT TGTCGCCAGT GTTGGCAAGA CCGCTTTGCC TTTCGCCGAC GAAGAGGATG AGGTTTTGGG CAAAGCGTAC ATGGACCTTG TGCAGACTGT TACTAGAGAG GAAGTTGACA AGGAGATCAC CCGATGCCCT CACAAAGAGA CTAGCAAAAA GATGGAGGAG ACTATTCGTG CTGCCAAGGC CAAGGATGAT TCCTTGGGTG GTTCCATCAC CTGTGTTATC CGTCGATGCC CCCTCGGTCT GGGTGAGCCC TGTTTCGACA AGCTGGAGGC TGTCCTTGCC CACGCCATGC TTTCTATCCC CTCTACCAAG TCTTTCGAAA TCGGCTCGGG TCTTCGTGGA TCCACTTTCC CTGGTAGTCT TCACAACGAT CCTTTCGTGG AGGGCGTCGA TGAAAAGACC GGGGGAAGAA GGTTGAGGAC TGTGACCAAC TGGAGTGGTG GTGTTCAAGG AGGTATTTCG AATGGCGAGG ATATCTACTT CAGGCAAGTT CAAAGCTCCG ACCTCAGTGT GACATGGCCT AATGCATATG TTTAGAATCG GTTTCAAGCC TCCTGCGACC ATCGCTCAAG AACAGTCTAC TGCCAGGTAC GATGGTTCCG AAGGCGTGTT GGCTGCCAAG GGCCGACACG ACCCCTGTGT CGTTCCTCGT GCTGTTCCTA TCGTGGAGAC CATGGCTGCT CTGGTCATCA TGGAGTAAGT ATCTATTTCA TCTTTCTTAT CATGGATCGG TATTGATTCT GTCTGTGTTC TAGCATGGTC CTCCAACAAA ATGCCCGATT GACTGCCGCC TCTCTCCTTC CCGACCTCAC CCACTTGCCC CCTACAATGG TGCTTCCAGG CAAGAGTACC GTTCAGAAGA TCAGGGAGGG CCAGAGCGTT GGAGAAGTGC AAAGTCAGAA GGTTGGTGAG GAGTAGAGAA AATACTTTGA TTTCATAGAG ATGAATTGAG ATTAATTGT
|
Protein sequence | MSSFGTLYRV HTYGESHCKS VGCIVDGVPP GLQLTEADIQ TQLSRRRPGQ SDITTARSEF DTVHVQSGTE HGVTLGTPIG LLVHNKDQRP HDYAETDLYP RPSHADYTYL AKYGVKASSG GGRASARETI GRVAAGAIAE KYLKEAFGVE IVAFVASVGK TALPFADEED EVLGKAYMDL VQTVTREEVD KEITRCPHKE TSKKMEETIR AAKAKDDSLG GSITCVIRRC PLGLGEPCFD KLEAVLAHAM LSIPSTKSFE IGSGLRGSTF PGSLHNDPFV EGVDEKTGGR RLRTVTNWSG GVQGGISNGE DIYFRIGFKP PATIAQEQST ARYDGSEGVL AAKGRHDPCV VPRAVPIVET MAALVIMDMV LQQNARLTAA SLLPDLTHLP PTMVLPGKST VQKIREGQSV GEVQSQKVGE E
|
| |