Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNM00450 |
Symbol | |
ID | 3255256 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006682 |
Strand | - |
Start bp | 108534 |
End bp | 111417 |
Gene Length | 2884 bp |
Protein Length | 436 aa |
Translation table | |
GC content | 48% |
IMG OID | 638254204 |
Product | expressed protein |
Protein accession | XP_568456 |
Protein GI | 58262092 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.98204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTTGGGTTA TTTACCAGTT CCTTGTCTCC AGGGCTTCTA TAGATGGGTA TCGGATGCCT TCTACACTTC ACAACTTGAC AACCCAACGC AAAAACTCCT TCCTCTTCTT TTTGTTGAAA TTCGAAACAC TGCCACTCAC GTATCCACAT TCCCAACGGC GGTCGTAACA AACAGCGATT CCGCCCTCCG GCTGGGCAGC TTCCGGTTTA TTTCCGACAA ACATTTCGGA CATGTCCTGT CCGTCACTCG TTTGGCAGAA GTTCGCAGGT TTTTAATAAC TTCCTTCCCC CGAAAGGTGA TCTTCGTAAA CGGTTCTAAA GGCCGATATT CGCCTTGTTT CGGCTTTGCA ACTCAAATTC CCCCATGTGC TGGTCACTGC TCTCGGTCAA ATTTCACCAC CTCTTTTGAC TCCCCCCTTC CAAGCATTAT CTTTTGACGA TTCGTTAGCC GCGCTCCACC AAAAGTTTTG CTTTGGAAGC CTCAATACGG CTCGAAGGAG GTCATCAACC GGCGCAAAGG AAGAGGATCA GCGATCAGTC GCTTGTCGAC AACATTATTA AACGTTACGA TTGTCATATC AAGTCCCGGA CTCACATATA GATTGTCTCT TTGAGTCCGA ACGACAGACT TCCTCACCGT CGCTGGTGGC CAATTCTTTT CGCGCGGTAA CAAAGAAAGT ACTGTGAACG TCCGCTATCT AGCGACCTTG CGACGACGAC CAAAGTCAAT CTTTCGAAAA CCACAACGCC AAAAGATTCC AGGAGTGATC ATCATTAATT CCGTACGTCT GGCTCTCGTC ATCTTCTTCC CATTTTATCA TCAGCTGATT CCAGCTTTAA AGCAGTTATT ACTGGTGATT CAAGTGGTAT TCTAAATTTT TTATAATTAT CTGCCCTCCT ATTCGTGGCA GAAGGACGAG TGAAAAGAGT GGCAGCCAAC TCGCCGACCT TTCCCCCACT TCCCCCGCCG GCTCGGGACT TTTGTTTACT TCATCAGATA ACACCTCGGT TCAGCGCTAT TTAACCAACG ACGTATATAT TGTTGGTCCT ATCCACTGTG CGGCTAGTTT CTCTTGTTCA TTACAGCTAT AAAGGACTTA AGTCGTCGAC CCTGTGTATA GCCGGAAAAT GAGCCCCTAC ATCCATCCAT CCGAAGCAAG GGCCCCCGGG GAGTCTGTCA GATGTACCAC ACGGCTACCC CCCATTCTGG CGTCCATCGG TTAGTTATGA GATCAAAACA TCTCCTTCCT TAGTTTCACT CGACTGATCA GTCCTCCTAG GAACCGTCCT GATTGATGCA TTCGACAGTT TGCCTCGACC TATTGTGGAC GATAGCCAGG CTTCTTCTCG GCCAGCGTCC CCAGTCATCG ACGAGCGGGC GCCTCAACCA CAGGGGCATC TTCAACACTT AGTTATTCCC GCCCGTCGCC TTCGGCTCTC CCCTCCCCTG TCGTCCCCGG CAACTTCCGG AACCTCGGGC CCTTCGACTC CCCAGATGAC CCTCGATGAT ATCCCTGTTC CGAATGCGGA AGAAGTTTAT GAAATGCTAG GCGGTGGTGC TCTGTATGCA ATCGTCGGGG CGAGATTCTG GCTTCCTCCT TGCCAATTAC GAACTCTTGT CGATCGAGCC CCGGCTGAAA ATGATGACTG CCCAAAAGAT GTGGAGCAAA AGCTGGCGAA ATTGGGTAAT GAAATATGGG TTTGGAATAG AGGTGAGGGA ACGAGAATGA CAAGAGCAAG AATACGGTAT GAGGGCGATG TCCGATAGTA CGTGACAGTC ATGAATCTGA TAGTTATAGC TAATGGATAT CATCAGTTTC CAACCCGTTG TGAAGGCTCC ATATCGGACA ATACAGGAGC TTTCTACTTC ACCTCTTCTC TATGCTGAAT ATCTTCACAT TTCCCCTCCA TACTCTCCAG AAAATGTGGC TGTTATCGTC TCTGATCTGA AAGCTTTGCC GAAAGATAGC TGGCGACCTA AGATTGTCTT TGAGCCTACC CCCCCTTCAT GCCATCCTGG CCAGAAGGAC TGGCTCGAAC ACATTCTTCC CGATATCGAA GTACTCTCGT AAGTCGTTGC TTGATGAAGG AAACCTATTA TGCTGATTGG CCCTTAGCCC CAATCACGAA GAGCTCTTTT CTTTCTACTC TATCCCTACC ATGGCGACCT CTTCTATCTC GCTGCGTCCA ACAGTTGAAC GCCTGGTGAC CCATATTCTG CACGATGTCG GCATTGGCGC GAATGGACAA GGTATAGTGG TCGTCAGGTG TGGTCGGCTC GGAGCATGTG TAGGCACCAA GAAAGGCGGA TTAAAATGGT GTCCGGCTTA TTGGGAAGGT GATGATGTGA AGAATGTAAA AGATGTGACT GGAGGTGGGT TGGATTGGAA ACCATGTATC GTCAATATTG ATGTTTTCAT TGTCGTAGCT GGCAACTCTT TCCTGGGAGG TTATGTAGCA GGCCTTTCCC TAACTAATGA CCCTTATGAA GGTAAGATAC TTTCCAACAC TATCAGGAGT CCTACTGATT CCGTCCATAG CTCTTTTATA CGCCACCATT TCATCCTCCT TCGTTGTAGA GCAGTTCGGA CTGCCACGTC TAATGGATTG CACCGATCCT CTGACGGGCG AAGAAATTTG GAATGCCGAC ACACCCTCTC GTCGATTGAA GGAACTGAAA CGACGCTTGG GTCTACTATA ATGTTTTCCC TACATATACA CATCATGCGA GATCGGTTCA GGACATTTCT CTGCCACTTT TTTAGGATAC CGTTCATTAG GTGTATACCG TGATAAGCCT CGTGGTATCA TTATAGTTTG ATTAATAGAA TATGTCATAG CAATACAACT GTAACATCCA CAAAATATGC ATGTCAACGT ATGT
|
Protein sequence | MSPYIHPSEA RAPGESVRCT TRLPPILASI GTVLIDAFDS LPRPIVDDSQ ASSRPASPVI DERAPQPQGH LQHLVIPARR LRLSPPLSSP ATSGTSGPST PQMTLDDIPV PNAEEVYEML GGGALYAIVG ARFWLPPCQL RTLVDRAPAE NDDCPKDVEQ KLAKLGNEIW VWNRGEGTRM TRARIRYEGD VRYFQPVVKA PYRTIQELST SPLLYAEYLH ISPPYSPENV AVIVSDLKAL PKDSWRPKIV FEPTPPSCHP GQKDWLEHIL PDIEVLSPNH EELFSFYSIP TMATSSISLR PTVERLVTHI LHDVGIGANG QGIVVVRCGR LGACVGTKKG GLKWCPAYWE GDDVKNVKDV TGAGNSFLGG YVAGLSLTND PYEALLYATI SSSFVVEQFG LPRLMDCTDP LTGEEIWNAD TPSRRLKELK RRLGLL
|
| |