Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI03710 |
Symbol | |
ID | 3259752 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 999543 |
End bp | 1001955 |
Gene Length | 2413 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258866 |
Product | thiamine biosynthetic bifunctional enzyme, putative |
Protein accession | XP_572595 |
Protein GI | 58270878 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase [COG2145] Hydroxyethylthiazole kinase, sugar kinase family |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase [TIGR00694] hydroxyethylthiazole kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.036727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTGTCTCT CATCCATCTC CTAGATTCAC ACCCGACAGC ATGCCCAAGC CCACTCTCGA CTACTCTCTT TACCTTGTTA CCGGAAGGGA GCTTTTGCCT CCTGGAAAGG TAAGCCTACG ACCGCAGATC CTCGCAATAT TTGCAAAATT ATTGACGTGT TACTCGATCC ATAGGACTAC TACGAAAGTC TTGAAGAAGT AGGTGACCAA CTCGGTTCTC TTGAGCCTTT AGCTATCGAT TACATTACCT AACTATTGAT CAGTCCCTTC AAGGCGGCGT TACTCTTGTG CAAGTCCGTG AGAAATACGC CGACACTGGA GAAGTAAGTC AGATTATCGG TATTTGGATG CTGCGTGGAA GCTCATACGC TTTGAAAAGT TTATTGAAGT AGCTCGCCGC ACGAAAGCCA TCTGTGACAA GGCAAGCTAA AACCTCATGC CCTCTGTATA TATATCATTT AACAAAGTGC TTTCAGTACA ATGTCCCAGT CCTTATCAAT GACCGTATCG ACGTCCACCT CGCTGTCGGT ACGTCCTTAA AAAATGTGAA GCGTGGTATA GCTGCTCATC ACTGTCCCAT ACAGGCACTG CAGGCATTCA CGTCGGTCAA ACGGACTGCC CTATCGGTTT AGCTCGTAGC CTTGTCGGAC CGGATGCCAT TATTGGTCTG TCTGTCAGCA ACGTCAACGA AGCCAAACGT GCCATCCAGC AAGGTGCAGA TTATGTCGGG ATCGGTGCTG TGTGGCCTAC CAACAGCAAG GACGTGGCCA ATAAGAAGAT GTTGGGCCCC GATGGAGTGG GCGAAATTCT TGATTTGCTC CATGGAACTG GCGTACAGAG CGTTGCTATC GGTAAGCCAA CTTTTACATA TAGCAGAGTA CGGACGATCA TGGACTAATG CGTCATACGC GAGGCGGCAT CCATCTTCCC AACGTCGCTC AGCTCCTTCA TGCTTCCATT GCACCGCAAT CACGCAATGC TCTTGATGGC ATTGCCATCA TCTCGGACAT TGTCGCCTCC CTCACTCCTC GCGAGGCTGC CACGAATCTA CGAGAAGTTG TTCAGTCTTT CAAGCGCGCG AGGAGTCAAC TTTCCAACCT TGAAGCTGTA TACGGCACCA ACTTGTTCAG CGGTCCAAGG GGTGTAGATG GTTTTATTAA GGAGGCCGTC CATTTGATGG ACGTGATTAA GAGGGAGACT CCATTGATCA ATCAGGTAAG TCCATGCCGT ACGATCGTAT GCATTACTCC AACTGACGCA CATGAAAGAT GACTAACAAC GTCGTCATCA ACGATTCTGC CAATGTCACC TTGGCCATTG GCGCCTCTCC TATCATGGCG ACCCATCCTC GTGACGTCCA TGATCTCAGT CCCGCCATCG GAGCTCTCTT GATCAACTTT GGGTATGATT CGTATCAGTG TTCATTTAAG AGCTTGATGT TAATGCTTGA ATCTAGCACT ATTACGGACA AAGCAGGCAT GCTTGTGGCC GGCCGACAGG CCAACATCAA CAGGAAACCC ATCATTTTTG ACCCTGTAGC CATCGGCGCA ACTCCATACA GGCAAGAAAC GTCTGTCGGT AAGTTGCCCT CTGGCGATTG CCATACTCTA TGAAGAAAGG CAATCCTGAC GCTCTCACAA GAGCTCCTTT CTCACTGGCA GCCGACAATT ATCAAGGGTA ACGCTGGTGA AATCGGGTTC ATGGCGAGAT CGACAGAAGT TGCCAGTCGA GGTGTTGATT CCGTCGGTTC TGGTTTCTCT CGCCCAGGTG CTGTCGTCAA AGCACTTGCG CGAAAGCAGG GTACGTTAAT CATAGTCCCG TATAAAGCAT TTATCGCTGA TTCTTTCCTT CCTTCAATCT CTTATTGCCA ATTTTCAGCC GCAATTATCG TGCTCACTGG CGAACATGAC TATATCTCAG ATGGTTCAAC CACTCTCAAA ATCTCCAACG GCCACCACTA CCTAGAACGT ATCACTGGTT CTGGATGTCA GCTTGGGTCG GTTATTGCAT CATTTGCAGC TACCGCAAGA CTGGAGCATC TGGCGAAGCA CGGTGAATGG GAGAATGCGT CGCAGCTCGT TCAAGGTGAT ATGTTGGCTG CGGCTGTCAC CGGGTAAGTC GTCTTTTTTC AAACAAGTCC AAGCCCCGCA TCCATTGTGG TTGGGACGAT AAAGATTGAA AGCTGATTTC GCGGGAATCA GTGTGTTGGT GTATACCATT GCGGCTGAAG TAGCGGCCGC TCGTGAGGAT GTAAAAGGGC CTGGAACGTT CAGAGCCGCC TTAATTGATG AATTGTACAA CCTCACCCCC GAGGTTTTGC AGCAACGAGC CAAGGTTGAG ATCTTGTAGA TTGTGTTACA ACCCCCAACA AGATTGATGA ATGGATCGTT CTCATTAACC GTGTTGTGTA CAT
|
Protein sequence | MPKPTLDYSL YLVTGRELLP PGKDYYESLE ESLQGGVTLV QVREKYADTG EFIEVARRTK AICDKYNVPV LINDRIDVHL AVGTAGIHVG QTDCPIGLAR SLVGPDAIIG LSVSNVNEAK RAIQQGADYV GIGAVWPTNS KDVANKKMLG PDGVGEILDL LHGTGVQSVA IGGIHLPNVA QLLHASIAPQ SRNALDGIAI ISDIVASLTP REAATNLREV VQSFKRARSQ LSNLEAVYGT NLFSGPRGVD GFIKEAVHLM DVIKRETPLI NQMTNNVVIN DSANVTLAIG ASPIMATHPR DVHDLSPAIG ALLINFGYDS YQCSFKSLML MLESSTITDK AGMLVAGRQA NINRKPIIFD PVAIGATPYR QETSVELLSH WQPTIIKGNA GEIGFMARST EVASRGVDSV GSGFSRPGAV VKALARKQAA IIVLTGEHDY ISDGSTTLKI SNGHHYLERI TGSGCQLGSV IASFAATARL EHLAKHGEWE NASQLVQGDM LAAAVTGVLV YTIAAEVAAA REDVKGPGTF RAALIDELYN LTPEVLQQRA KVEIL
|
| |