Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00800 |
Symbol | |
ID | 3259090 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 958768 |
End bp | 961768 |
Gene Length | 3001 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 47% |
IMG OID | 638258402 |
Product | GPI anchor biosynthesis-related protein, putative |
Protein accession | XP_572273 |
Protein GI | 58270234 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.512246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTCCATGGA CCAGCAAAGC ATCAGAGTAT TCTGGCCTAT CACTGGGGTC GATATGTCAG AGGGGAAAGT AGTCGGTTGG AGATTGAGAG ATACTCTGTG TGTTGTGGGG ATCGTTCAAG ATCGGGTATG CCTTTCCATT CTTATTAATA CTGTAAGTTA AATATTGAGT GCAGCTATGG AACAAGGTCC TTGCCCAAAT TGGTGAGGAA GAAGATTTGG TGGGGCTGGA GAGCATCGGT CGGGCAGTAT TGGATGCGAC CAAGAACACC GATGTCAATG AGAAGCAGTA TATTTTTTGG GTCAATAAGG AGCGTATACC GCTTTCATGT TCGTACGTAT GCTTCGAGAA GGGAAAATTA CCAGCTTGAT ACTGACCTTA TGACCATAGG GTTCCCACAA TTCTTATACT GTATAAACCA TTGGATTCCT CTCGCCTGCA GTACCTAACC CCATCATCCT CGTCTCCTGA TCTGCACGTA TCTGGGCAAG ATAGGCATGA TCCTAGGTAC CAGTTAACTA TTGGAGATGA TCAACTATCT GCCATCGTCG ACCTTGTAAG TCGAGTCAAG CCCTCCATCA TACGGTCTTA GGAAAAGGAC ACCTAATGCA AGAATAGATT AATAAGACAA GGCATGTTCA ACAGGTTCTT CGTTCGTTAC AAATGGAGAG CGCTGCAGAG GGAAAGAAAA AGAGAAGAAA GCAAGCATCC TTGCCATCTG CTCCTCGTTT CCTTTTCGCT CTCGACACTT GCGCACAGAT CACCATCTTT CTCTTCTCCA TTTCTATCCC CTGTTCAAGC TCCTTCCGCG CTATTTCCAC ATGTAAGTTC AACGGACTTC TCTACCTACC TCACGATTAA AGAAGCTCAT CATAGGTGAC CTCTTCTATT TGTCAGCGGC AGATCAATTG TGCACAAGGA TGGAGCAGTC AATCAGAGGA CCTATTCGGT ATTTGACGAC TCGAAACGAT GGGGGAATCA ATGACAGGGC AGCGCGATAT AATGTGTGAG TCCCATCAAA CATCAAGTTT ATCCCCCAGC ATTGCTTGGT TTTGCGAAGG GCTACTAGGC ATCGTAAAAA GGTCCTAACA AGGCATGCTG AGAATGTGCT GACTTTTGGG ACAAGGTTTT GGAATACGGT TTGGCTCGTT GTAGTGCGTT CCTGCCTCCA ACTGCCGTAC TCAGAGGGTT CAAGAACTGA TGAAACCTTC TCATAGAACG ACTTGGTCTT GGGATATGTA GCCCACAACC TCATTCGTCG ACATTCCGAA TGGATTTCCA CCACCACGAG CACCTTCTTC TCGGTAAGCA TTAAATGAAT CCTATGGCGA AAGTTGTTAA AATACCCGCT GCAAAGTCCT ACGTAATAGA TATGCCCATA CATGCTTTGA AGTGGTTGAA CGACTGGCCT GTGGGGCTGA AACTCAATAC TCCACTCAGC CAATTTTTTT GTTCGACGTT TACGTTCCTC ATCCAACGTT GGGGTGGTCC GTTCTTCCTC TCGTTCTTCC CCAGACTGTT TAGCTTACCA AGATGTTTCA TCTACATAAT TAAACTCCAG ATTGCGTCAC ACCCTCGCTC CATTCACTTT TACCCCAGCT CATGTACTTG TTATCCATTC TCTCCCTAAC AGGTTTTACC ACCCTCCTCG CAGCGTCACA CGACATACTC AACCTATTAA CGCTTCATCT TCTCTTCGGG TATAATGTCA TGAGAGCCGT GTGTGTCTGG CAAATTGACA GTTTGGGGGG CCTATGGAAT CTTTTCCGTG GTATGGTAAC TCTCCCTTCC TCCCTTCTCT CCTTTCGCCA TATCCCTGTC ACTGTTTTGT TTTGCTAATC GGAGGTATAG GTAAACGATG GAATGTCTTA CGGCGGCGGA CGGATTCATA CGAATACGAT ATCGATCAGC TCTTTCTCGG TACGCTCCTT TTCACAGTAT CAGCATTCCT TTTCCCTACC GTCCTCAGCT ACACAGCTCT CTTCTGCCTC GTGAGTCTTA TCGAGACAAG CATTTCCTTT GCAGTTTTGT TAGCATAAGT AAGCTGACAG GAAGATGGAA GACGAGAGGA ATGATATTCA TAATATGTCG CGTTTCGGAA GTGACCCGAC AGGCGATGAA CAGGTTTCCC ATATTTGAAC TTATATTATG GATAAAGGAA CCTTCAAGAG TGCCGGGTAA GTGGACATCT TTAATCAAAT TGGGCGATAG CAATGCTTAT CATCTGAAAG GGGGCTTGAA CATTACTGTG CAAACGGTAC CCTTGGGTGA CGAGGGGGAG AAAACTGAAG GTAGATTCAT CATGAGGAGA GCATTGGTTT TGAAGGTAAG TATACTTCGT ATCCGTGAAG GACAGTATTG AAAAGAGGAT GATAGAGTAC ACCCAAAGCA CTCTCGGACA TTTTATTCCA CCAGTGAGAT TTTGATTGCG GACGACAGGC CGGTACCATC GCCACGTATA TGCATCAACC GTTTACCAAA GCCATCTCTC TCCGAAATGT ACAAATTTCC ATCTCAAACT CAAAACAGTC AGGCAACGAA AAACACCTCT ATCCGTCGCA AAGAGGATTG CCAAGGGGGC GGCAGGTCGG TGACTTAGCA AGCGCAAGAA GTGCAATTGC AGCTATAAAG TGCGTCATTA AAACGACCTT CGCAAGAAAG AAGGACAAGG GAGAACCGTG AATGGACGTA CGTAGGAGAG CCGCAAGTGC TGCAATTGCC ATTTGCACCG CTTGAGTTAC ACTCGTTCAC GAGGGTCCTG CCGTTAGAGG CGCAAGACCC GCAGGAAGAA CCGTGAGAAC AGTCGGAAGA GTTGCCGCTA TCATTGAAGG TCGAAAGGAA GCCGGGGGCA CTAATGTTTA GTATTGGGGG GGGAATGGGG AGAAGAAGCG TACTTGGAGC AAGAACAGCT GCTGGAGCCG CAAGCGGAGT TGAGAGCCTA TGTGTCAAGG TTAGGCGATG AGCGAAGATG GGGACGTACT CAAAGTAGAT AAGATACGTA C
|
Protein sequence | MDQQSIRVFW PITGVDMSEG KVVGWRLRDT LCVVGIVQDR LWNKVLAQIG EEEDLVGLES IGRAVLDATK NTDVNEKQYI FWVNKERIPL SCSVPTILIL YKPLDSSRLQ YLTPSSSSPD LHVSGQDRHD PRYQLTIGDD QLSAIVDLIN KTRHVQQVLR SLQMESAAEG KKKRRKQASL PSAPRFLFAL DTCAQITIFL FSISIPCSSS FRAISTSADQ LCTRMEQSIR GPIRYLTTRN DGGINDRAAR YNVFWNTVWL VNDLVLGYVA HNLIRRHSEW ISTTTSTFFS WLNDWPVGLK LNTPLSQFFC STFTFLIQRW GDCVTPSLHS LLPQLMYLLS ILSLTGFTTL LAASHDILNL LTLHLLFGYN VMRAVCVWQI DSLGGLWNLF RGKRWNVLRR RTDSYEYDID QLFLGTLLFT VSAFLFPTVL SYTALFCLTR GMIFIICRVS EVTRQAMNRF PIFELILWIK EPSRVPGGLN ITVQTVPLGD EGEKTEGRFI MRRALVLKST PKALSDILFH Q
|
| |