Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00140 |
Symbol | |
ID | 3255537 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 48855 |
End bp | 51385 |
Gene Length | 2531 bp |
Protein Length | 602 aa |
Translation table | |
GC content | 52% |
IMG OID | 638254429 |
Product | hydrolase, putative |
Protein accession | XP_568522 |
Protein GI | 58262224 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.207981 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGGAAAGC GCCCCATCAC CTGCATTTAC CTCCTCTCTC ATCCTTTCAT ACGCTATCGC CCCCAGATGA TTCCCCCGCT CGACGACTCG TCCCTGGTGA AGGCCGCGGA TAAAGCCTGG TGGAAGTCAG CGACCGTCTA TCAAGGTTGG TTCCATCCGA TTCCGGGCTG TTGTGACAGG ACTGATGTTT CCGCTTTCGT CAGTCTATCC TGTAAGCCTT GCTTGATATT AGCCGTATGA TATAGTACCA ATCTTCTCAT TTATCTCTCT TTTTTCTTCC CAGGCATCAT TCTGTGACCA TGCCGATGCA GGCCACGGCA CCCTCCTCGG CATCCTCACC AAGGTGGACT ACCTGCAATC ACTGGGAGTC GACATCGTTT GGCTTTCTCC TATATACGAG TCCCCCCAGG CAGATATGGG GTTAGTTTAC CGCTCTATTC TTTTCTCTGG GTCATGGCTG ATTCGTCATC GCAGCTATGA CATTTCTAAC TACCGTCAGA TCGATAAGCG GTACGGCTCG CTTGAGGATT GGGATAGACT ACTGGCTGCG CTTCACCAAC GAGGCATGAA ACTTGTTATG GACTTGGTTG TGAACCACAC CTCAGATCAG GTAATGTTGG GGAGGCTTGC CTATCTATAA ATGTTGATGT CCCTTACTTT GTAGCACCCA TGGTTCAAAG AGTCGCGCAG TTCCCGAGAC AATCCCAAGA GGGATTGGTA CATTTGGCGG CCACCTCGAT ACAATGAGAA GAACGAGAGG ATCCCCCCGA ATAACTGGAA AGGCACTTTC GGCCAGTGAG TCACTATTAC CACCGTTCTT GCGGTTTCAT CTCAGCACGA AGACGTTGAC GTCGAATTAG GGGATCAGCG TGGGAATTCG ACGAGACCAC CAACGAGTAC TACCTCCATC TCTTCCTCAA GGAACAGCCC GACCTTAACT GGGAGAACCC TCAGGTCAGG GCCGAGGTTT ATGATCTTAT GCACTGGTGG CTCAAGAGGG GTGCCGATGG GTTCCGTATG GACGTAGTAA GTTTGGTTCT ACGGCTTGGA ACTGGATTTG ACTAGGATCA GATCAACTTT ATCGCCAAGG CACCCGGTCT GCCAGATGCG CCGGTCATCG ACCCGGGACG GACGTATCAG TCGTTTGGGA TGATGTCGAT AAACCGTCCA GAGGTGCATG GATGGCTGAA AGAAATGAAT CGCGCGGTAC TCTCACACTA TGATTGCTTC GCGTGAGTTC CTCCAACCTG TTTGATATCC ACAGCGCTGA TCTCTGTAAA GCGTGGGAGA ATGTCCAGGT GACGAGGCCG TAGTGTCATA TGCCCCTTAT TCTGTGCCCC ACAATAAGGA GCTACAGATG GTCTTTCATT TTCACCAGTA TGTTGCCTCC TTACTTCTTG CCATCGCTTT CCCTGAGTAA GGTTCGGTAT CTAGTCAGAG CTTCGATAGG GCGGCTGGCG GGCTGGGACG AGTTCACAAT CCTGATTGGA AGCTGTCCGA GTTGAAAAGG GTCTTCAACA CCTGGCAGAT CGAGATGGCA CGTGAAGGTG GTTGGAATAG CAACTATTGA GTCTTTTCTC ACTACCCTAA CAAGCGAATC TGACCACGGT CGCAGTCTTG AGAATCACGA CCAGCCTCGG ATCATCTCCC GGATGGCATC TGACCATCCT TCGGACAGGG CACGGTGTGC GAAGCTATTG GCGATGTTCC ATTGCTCCCT TGGCGGTACA ATATACGTTT ATCAAGGGCA GGAGCTCGGG ATGATCAATG TTCCGCGTGG CTGGGGACTG GTGGAGTACA AAGATGTCGA GACTATTCAA AACTCTGAGG CCGAGGTGCA GCATCGACAA GTGATATGCG GCCATGCGAA TCCGGACATA TCCGACTTGC TTGAGAGCAA CCGCATTACA GCTAGGGACA ACGGTCGCAC TCCCATGCAG GTGGGTGTCG GAGATGAGAC GATGTTATTG GAGCGACTAA TGGAGAAGTT CGGTAGTGGG ACTCAAGTCT GAATGCCGGG TTCTCAAAGG GCGAGCCGTG GATGCGTATA CACGATGACT ACCGCGAGGG CTGGAACGCT GCCGCTCAGG TTAATGATCC GGACTCGGCA TGGTCTTTCT GGAAGCAGAT GCTCCGCCTG AGAAAGAAGT ACGACGCCAT GATATACGGT AAGCACAAGC CATCCCCCTT TGATTCGGAA GGTACGATTA ACCACCCTGC AACTCAAGGC GACTTCATCG CGCTGGACGA GTCGAACGAG GAAACTTACG CATATATCCG CGAGCACCCT CCGAGCGGGC AGAAGCTCCT CGTCGTCCTC AATCTTTCCC GCGGAAATGA CGGTCGTGGT GCACCCTCGA CTTTTGTGCT CCCATGCGGG CTCGATACAA GCGGCAGCAA GCTATTGATC TCTAATGGCG AGGTACAAGA GGGCCAGCGA ATCGAGGGGA ATATACTTTT GGGCCCATGG GAAGGCAGGA TCTACCTCTT ATGACGAACC TATCATTATT CAATAGAGCT GCATCATGAA G
|
Protein sequence | MIPPLDDSSL VKAADKAWWK SATVYQVYPA SFCDHADAGH GTLLGILTKV DYLQSLGVDI VWLSPIYESP QADMGYDISN YRQIDKRYGS LEDWDRLLAA LHQRGMKLVM DLVVNHTSDQ HPWFKESRSS RDNPKRDWYI WRPPRYNEKN ERIPPNNWKG TFGQGSAWEF DETTNEYYLH LFLKEQPDLN WENPQVRAEV YDLMHWWLKR GADGFRMDVI NFIAKAPGLP DAPVIDPGRT YQSFGMMSIN RPEVHGWLKE MNRAVLSHYD CFAVGECPGD EAVVSYAPYS VPHNKELQMV FHFHHQSFDR AAGGLGRVHN PDWKLSELKR VFNTWQIEMA REGGWNSNYL ENHDQPRIIS RMASDHPSDR ARCAKLLAMF HCSLGGTIYV YQGQELGMIN VPRGWGLVEY KDVETIQNSE AEVQHRQVIC GHANPDISDL LESNRITARD NGRTPMQWDS SLNAGFSKGE PWMRIHDDYR EGWNAAAQVN DPDSAWSFWK QMLRLRKKYD AMIYGDFIAL DESNEETYAY IREHPPSGQK LLVVLNLSRG NDGRGAPSTF VLPCGLDTSG SKLLISNGEV QEGQRIEGNI LLGPWEGRIY LL
|
| |