Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00030 |
Symbol | |
ID | 3259220 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 1185634 |
End bp | 1188578 |
Gene Length | 2945 bp |
Protein Length | 543 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258482 |
Product | Alpha-L-arabinofuranosidase, putative |
Protein accession | XP_572197 |
Protein GI | 58270082 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGATA CCAAAGAACA TAGCAAGAAG CTCAAAAGTA GGCATATACG TGATTCATTG AGGGCTCTGG GAGGAGACAA GACGAGAATA TGGGAAAGCA GGGACACCGC TCAAGAGGGA ACGGAAACTC AGGCACACTA CAGCTATAGG TGAGCTTCGT GTGAAGTCGC GAGCCAAATC GCCTTTCATC TGATGATCAT CATTCATAAT AAACCGCTCA TCAATTGTGC TACCAGATAA GTACCGGCCC GCCAGCGTTA GTGTACGGAT TGTGGTCGGG CCGTCGGGTG CCTGTTAGGG ATACGTACCG ATACACGGCG CCCCTGCTTT GCCAGTGGAA GAAGGCGGTT GCAATTTGGA CTCTTATCGG CTCAATATTG TCACGCCGCC GCCTGAGCCT ATCTGGCCGA GACAACGCAT GATTGCGCTT AGCTTATGAT CCGATAACCG TATGAATGCC GCCGACCTCG AGGTTTAGTG GTTGCAGAAT ACAGCTGTAT GAGGGCATGT AGGGGCCTTG CGACTTACAG CTTATAGCTT ATACGTCGAC ACTTCTATTC TCACGCTTCA TCCTCCAGCG ATCACCATGG CCGTCTACAG CCAACACGTT ACAGTGAATC AGTCTCTTCT CCAGCAACAG GCTCCACGCG ACTTTCCCAA CACTCCCTCC CATCCCAAAA CACTGGCATC AACTCCTATC CTCCTCGATG TCACCCGTCT CCTCTATCCC TCCGGACCTC CCGTCACAGA CAAGCTCTTC AGCGGCTTCC TCGAACATCT CGGGAGGTGT ATCTACGGAG GTATTGTGGA TGACCCGAAG GATCCAAGTC CGAAATGGCT TTTGGAGGTG CAGGATGAAG GAAAGACGTT TCAGAAAGAT AGATTGGGGT GGAGGAAAGA TGTGATGGGG TGCTTGGCGA AGGATGGGGA GCTCGAGATT CCAATGCTGA GGTGGCCTGG TGGTGAGTAT CTGCTATTAT CTTCGTCCAT TTGCAAATCG ACTAACCTTT GTGATAGGTA ACTTCGTCTC CAATTACCAC TGGCAAGACG GTATCGGCCC CATCTCTGAG CGTCCAAAGC GTGTCGAGCT CGCTTGGCTC AGCGACGAGT CCAACAGGTT TGGCACCGAC GAGTTTATCG ACTACTGCCG AGCGACGGGT GCTGAGCCTT ACATCTGTTT GAATAGTGCG TGTCTTTCTT GACCTTGTTG CGCAGTATTG ACGTCTGCCG GTAGTGGGCA CAGGAAGTCT TGAGGAAGCT TTGGCATGGG TTGAGTATTG CAACGGGACT GGCGATACCC AGTGAGCTTT CTCGCCTTCC TATCTATCTT ACATCGCGCT TATCGTCGAT CAAGCTGGGC AAACCTTCGA CGCAAAAATA CTGGAAGGGA CGAGCCTCAT GCTGTCAAAT ACTGGGGTCT CGGTAATGAG AGTAAGTCTG CTTTTCTTCC CAAGATGAGA TTTGAGTCTG AATTTCTGCT GCAGTGTGGG GTCCATGGCA AGTTGGCAAC CTTCTCCCTT CCGAGTACGT GAGGAAAGCA AGGCAATGGG CCCATGCTCT CAAGCTCGTT GACCCTAGCA TCGTATTGGT ATCTTGTGGC GAGACTGTGA GTATATTCAA TTTGGCAGCA GGGTTGAAAC TGACAATAGG TGTAGGGTGC GTCTGATTGG GATAGAGAGG TTATCCAGGG TCTTCTTCCA TGGGCTGATA TGCACTCCAT TCACTTCTAG TGAGCAAAAT CCCTACTCCA TCGCAATCCC CAACTTACAA AGATCTCAGT ACTATGCTCG GCCACGACAA GATGTCCAGC GTATCTGGTT ATGACTACGA GAAGAACGTC TTCGGGCCAG CCGTAAGTAT CACCTCTTCC TGCGCTGAAA GCTGCCTCGT TAATACTGAT ACGATGATAT ATAGGCTGCC GAGAAGCACA TCGACATCTG CAAGTCACTC ATCGACCTCG CTAACATCGG CCGAACCTGG GAGCGCATGC CAGCGAAGGA TATGAAGATC TGCTTTGATG AGGTGCGCTT CCTATCCTTC CATCGTTATC CGTAAGTGTA AGAGTTGACG ATGTTGGTAG TGGAACGTCT GGGACGACGT CAAAGCGCCC GGGTCCAACG GGTTGGAGCA ATGGTACGAC TACACCGACA TGCTTGGCTT TTGTGCATGG TTGAATGTGC TTGTGAGGAA GCATAAGGAT ATAGGCATTG CATGTCTTGC TCAGTCTGTC AACGTCGTAC GTCCCATCTC ATGAGCTTCT ACCTTATTCT ATTGAATCAA AGCTGACAGA TGGAGCTTAG ATCTCACCTC TCATGACCAA GCCAGATGGC ATTGTTCGCC AGACTTTGTA TTACCCCCTT CAGCTTTTCA GCAAGTACAT GAAGAACGGA CACCTTCTCC AACTTCCTGT CTTCCCTGAC GTCTAGTATG TATCCGTCCG TACCCATGTC TCACTCGTTA CTGACAAGCA TGGCAGCACC GGTCCCACGT TCCCTGTCTA CATCCAACAA GGCAACTACA AACCCTCCTA CGTCGACTCT GTCGCTATAC TCGTAGAAAC GGAGAAGGGA GCAAGCATCA GGGTTAGCAT TCTGAATAGG CATCCGGAGG TGGATTGGAG CTCGAAGATA GGGTTCTCGG GTTTCGGTAA GTACATTCTG GAGACGTTAT ATCTGAGGCG TATTGGAACT GACGATATGG GATAGAGGTC GAGAGTGTCG AGGTTCACGA GATCTACTCG GACGACCTTG CTGCTGCGGT CAGTTCCCTT TAACATGTAT TTGTACACTG CATCAACTGA AACCTCCTTA CTACAGAATA CGTTCGAAAA CCCTGACACT ATTATTCCTA AGATTACAAA GAAGAACGCG AAAGAGTGGG ATGGAGATGT TTTGGTTAAG AAGCATTCGT GGTCCTTCAT CATTTTCAAT GGTCGTTTTC ATTGA
|
Protein sequence | MAVYSQHVTV NQSLLQQQAP RDFPNTPSHP KTLASTPILL DVTRLLYPSG PPVTDKLFSG FLEHLGRCIY GGIVDDPKDP SPKWLLEVQD EGKTFQKDRL GWRKDVMGCL AKDGELEIPM LRWPGGNFVS NYHWQDGIGP ISERPKRVEL AWLSDESNRF GTDEFIDYCR ATGAEPYICL NMGTGSLEEA LAWVEYCNGT GDTHWANLRR KNTGRDEPHA VKYWGLGNEM WGPWQVGNLL PSEYVRKARQ WAHALKLVDP SIVLVSCGET GASDWDREVI QGLLPWADMH SIHFYTMLGH DKMSSVSGYD YEKNVFGPAA AEKHIDICKS LIDLANIGRT WERMPAKDMK ICFDEWNVWD DVKAPGSNGL EQWYDYTDML GFCAWLNVLV RKHKDIGIAC LAQSVNVISP LMTKPDGIVR QTLYYPLQLF SKYMKNGHLL QLPVFPDVYT GPTFPVYIQQ GNYKPSYVDS VAILVETEKG ASIRVESVEV HEIYSDDLAA ANTFENPDTI IPKITKKNAK EWDGDVLVKK HSWSFIIFNG RFH
|
| |