Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI04250 |
Symbol | |
ID | 3259473 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 1127591 |
End bp | 1129143 |
Gene Length | 1553 bp |
Protein Length | 412 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258920 |
Product | expressed protein |
Protein accession | XP_572902 |
Protein GI | 58271492 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.144806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAATCGCCA TGCCTGCAAA GAAACTGCCT CCCTATCACC CTTCTCAGCC TGAGCCGGAC ATTGCTCCAT TCGACATTTC GACCGCTGTA GATGCAGAGC ACGACAAGCC CGCTGGTGAC GATGAACTTC TGCCCAAAGT GAGCCGCATG GTGTCTCACG GAATGACGGG GGCTAAAGAT CTTCGTATAG ATACCCAAGG ATGCCAACGA CAAGACTGTA GTATTTCCGC CTCTTATCTT GGGATGCTCC ACATTTGGCT ACGGGATATA TGCCGACGAC GATAATGTCC GGTCGTCTAT GCCATTACGG GTTGTGCGAT TGGCTTTGCG CAGCGGTATG AACGCCTTTG ACACTGGTGA GTGTTCTATG GTTCGTCTAT CCGTTGCGAG GAGCTGAATT TAGCTTAGCA CCATGGTACC ATCCATCAGA AATCATCCTC GGCAACGCGC TTGCCGCGTT AGATTATCCT CGCGGATCAT ACCATATCAT CACCAAAGTT GGTAAATACG GACCCAACTC GTCTGATCAT ATGTATAGTC CCGAAGTTGT CCAAGCTAGT GTGGAAAGAA GTTTGCGAAG GCTGAGGACT GATTATCTCG ATGCTGTTTG TGAGTTTTCC CTTCTCTGAA GCAGTTGAGT ACCGGAAGAT AGATAAGCTA ATAATTATAG ACCTGCACGA TGTAGAATAT GCTTTACCCG GCCCATCATA CGAAGGTGAC CCTGTATCTC TTCTTTCCAC CATCTTATCC CAGCCTCCGG TACCTACCGC CGAAGAACTC AAGATCTTAG ACGGCATTGG CGCCCTTCGC AAGCTCCAAA CCACGGGCCA TATCATACTC GTTGGTATCG CCGGTTACCC TCTGCCTATC CTCCTTCGTC TCGCTCTTCT CGTACTCCAT AGTACCCGAA AACCGCTCGA TGTCGTCCAG ACGTACGCTC ATCATACCCT GCAGAATGAT GCGCTCCAAC AAGGCTATCT ACAAGCTCTG GCGGAGAAAG CGGGTGTGAG GCAGATAGTG AGCGCTTCAC CCCTTGCTAT GGGTCTCCTC ACCACTTCAG GTGGACCTGG TTGGCATCCA GCGAAGGACT ATCCAGAGTT GTTCAATGCT ACCCGGGCAG CGGTGGAGTT GTGTAAAGAA AAAGGGACGA AGCTAGAAGA CGTAGCGCTT TCGTTTGGAT ATCGTCCACT GAGCCAGCCA AACGGTAGAC GGGTGCCGAT CGTGGTGGGA TGTAAAGATT TGCAAGAGAT GAAGGAGACG GTGAGAAGAT GGAAAGAGGT AAATCCAGCC CAAGGGGGCG AAGGGGGGCT GGAGAAGAAG GAACTGGAGG AGGAGGTGAA GAAGTTGTTT ACGGAGAAGG GGGTACAGGG GTGGAGCTGG GCTTGCCCGA GTGAAGCACA AAGGGCTGGA TAGGGTCCTG TCTGGTGTCT GCAAACGAGA ATTAGTAATG AAAAATGGGT TTTGCATATC TCAATGCATA ACAGATTGAT GGGATGCGTC GCAAAGGTAT CTTTTCATCG AGCAGTTGAA ATGAAGATGG TTGTAAACGC GCT
|
Protein sequence | MPAKKLPPYH PSQPEPDIAP FDISTAVDAE HDKPAGDDEL LPKIPKDAND KTVVFPPLIL GCSTFGYGIY ADDDNVRSSM PLRVVRLALR SGMNAFDTAP WYHPSEIILG NALAALDYPR GSYHIITKVG KYGPNSSDHM YSPEVVQASV ERSLRRLRTD YLDAVYLHDV EYALPGPSYE GDPVSLLSTI LSQPPVPTAE ELKILDGIGA LRKLQTTGHI ILVGIAGYPL PILLRLALLV LHSTRKPLDV VQTYAHHTLQ NDALQQGYLQ ALAEKAGVRQ IVSASPLAMG LLTTSGGPGW HPAKDYPELF NATRAAVELC KEKGTKLEDV ALSFGYRPLS QPNGRRVPIV VGCKDLQEMK ETVRRWKEVN PAQGGEGGLE KKELEEEVKK LFTEKGVQGW SWACPSEAQR AG
|
| |