Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG01530 |
Symbol | |
ID | 3258842 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 439223 |
End bp | 441240 |
Gene Length | 2018 bp |
Protein Length | 359 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257770 |
Product | endo alpha-1,4 polygalactosaminidase precusor, putative |
Protein accession | XP_571855 |
Protein GI | 58269398 |
COG category | [S] Function unknown |
COG ID | [COG3868] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTCCCTGCA AATGGCCGCC CAAAACACCT TTCGTTTCAC CATCCCTATT TCAATCCCAT GTATCATTTC GGCACGTTCT CGAAAGCAAT GATTTGACTG CCCCTTTGAC ACTCGCCCAT ATACACAATC AAGGCCCCTC TCGGTGTACA GGTGGAGGGC CCTTCGGGAG TCCGATAGAG CAATCCATAT TCGTTTCACG ATGTTGTGAA TCTCGACGCG TTGACAGGCT CAAATACAAA GGCAGTCCTT CACTTGAAAG CAACCAAGGT AATCAATTCA AGGTATATAA CTGCCACCAT GTATCTCTCC AACATCCACC CAAGCTCTCA AGGTCAGTAT CTTTCCCAAC ACACGTTCTT TACAAGTCGC CGTTCTTTCT TCACCAACCA CTATACAACA TCTCCATTCA CAAACAGACA TGATCTCAAT TTCTCCTTCA AGCTGGCTTC TTGTCATCTC CTCCCTGGTG GCAGTGACTG TGAGCGCCAT TAGAATGTAC TATTGATATG TAGCTGATAT ATATTACACC ACATCAGTCC GCCAAGCCAT TTCCCCGAAA CGTTCCTCAT GGTCATTGCG GCTCCTTCAC ATCTTCCCTT ACCATCGTTG AGACGTCCAA AGATGCTACG AGCGCCGAAG AAACTGTTGT AACCTCTTTG GCCATTGAAG CTGCCTTGGC TTCTGCCACA AGCTCTAGCA GTAAGCACAG AGCAATTCGT CGAATTCCAT TACCATTAAC TCACATTCCA TTTCAGGTGC TTCTGCTGCT ACTACTAGCT CGATCTCTAG TGTCGCCTCC TCTGTCAGCC TCAGTTCCAC CTTTGTTTAC GAGCTTGATG CTCAGTCGGT GACCTCACCT GAGATCACCA CCGACACTGG CCTTATTGTC AATGCGGATG TGTATATCGT AGACGGTGAG GGTACTAGCG AAGACACTAT GTACGTCTTC GGAAGATCAT AACCTAGCTT GATTACTTGC TGAACACTTT ATAGTGCCCA GTACCACGCA GACGGCAAGA GCGTTATCTG TTACTTCTCT GCCGGTACCT GGGAGCCTGG CAGGTAAATT ATCATCTGCC CTACTTTGTG AGCCTGGTAT TGAGGGATCG TATTGACTCT TTCGTACAGG TCTGACGCCG ATGACTTTGA CCCTGCCTGT GTTTGCGGCA CTGGAGGCTC CTTTGCAGAC AATGCTTGTT CATCTGATGA CAATAAACTC TCTGACTGGG ACGAGTGGTG GCTTGACATC CACAGTGCCT CGTGTATTTC CAAAGTCGAG AGCATCATGT CTGCACGTAT CTCGTCATTT GTGGCCAAGG GCTGTGACGG AGTAGACCCT GACAACGTTG ACAGTGTAAG TAGCATAGAA GCCCGCCCCA GGCCTCCTCA TGATAACTTT ACGCTGATTA TCTCTACTGT TTTGTCACAG TTTGCCAACT CGGACCAGCT CCACGGTAAC ACTGCCGACG ATCAGGTCAA CTACCTCCTC TGGCTGTCCT CCATTGCTCG TGGCCAAGGT CTCATGATCG ATCTCAAGAA CGCCGGCTCG CTTCTGGTGG ACGATAACGG TGATGCTACC TCTTACCAGT CTGAGATCGT TGAGGCCTTT GATTTTGTGG TCATTGAGTC CTGTGTAAGT TGATTGTGCT TCTATTTCTC CTTATCAAAG CTCGACAACA CAGCCCATTT TGCTGACTTC AATTGTTACT CTTCCTTGTA GCATGAGTAT GAAGAGTGTG ATATTTACGA CTCCTTCCTG GCTGCCGGCA AGCCCGAGAT CCAGATCGAA TACGGAGACA TTACCACCTG TCCGTCCCTG CAAGACGGCC AACACCTCTT AGTTTACAGC CAAAACGATC TCAGCTCAGC GTTGATCACT CTTGAATGTG ACTGATCAAA GTCGATCCAT ATTTTGAACC CTCAAATGTA TAAAATTATT CAAGAAGTGA AAGCACAAGA AAGGAGCGTT CAAGAAAAAG ATGATCAGCA GACAACTAGC TGATTGATTG TTTCCGTT
|
Protein sequence | MISISPSSWL LVISSLVAVT SAKPFPRNVP HGHCGSFTSS LTIVETSKDA TSAEETVVTS LAIEAALASA TSSSSASAAT TSSISSVASS VSLSSTFVYE LDAQSVTSPE ITTDTGLIVN ADVYIVDGEG TSEDTIAQYH ADGKSVICYF SAGTWEPGRS DADDFDPACV CGTGGSFADN ACSSDDNKLS DWDEWWLDIH SASCISKVES IMSARISSFV AKGCDGVDPD NVDSFANSDQ LHGNTADDQV NYLLWLSSIA RGQGLMIDLK NAGSLLVDDN GDATSYQSEI VEAFDFVVIE SCHEYEECDI YDSFLAAGKP EIQIEYGDIT TCPSLQDGQH LLVYSQNDLS SALITLECD
|
| |