Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG00780 |
Symbol | |
ID | 3258966 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 222727 |
End bp | 225517 |
Gene Length | 2791 bp |
Protein Length | 655 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257694 |
Product | cytoplasm protein, putative |
Protein accession | XP_571803 |
Protein GI | 58269294 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.698042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGCATACT GCACTGTCAT GGCCTGTTTT CTCCCCTCCA GCAGGGCCCC CGGTCCTATC CGCCTTGAAG ACTCTTTGTC TGATATTATC CTCGAGAAAT CGACCAATCT TGAGCTCCCG ACCTATGACA CTCTCGATGG TAGCGAAGAG CTCGAACAGC GCCTGAAAGC ACTCAAAGAC GAAATTCAGG ATGCAAAGGT CGATTGGTAG TATGTATAAG CCTTCTTGTG CAATATGGAA CGATATCTAA AAGGGGGATG CTAATGGCTT TCCTACACAA CAGCATAGTA CCGAGTGAAG ATGAACACCA AGTCAGTCAT CTCCCTTCAT CTGTACATTG GCCGATATTA ACGACTTGAC CACCTAGTCT GAAGAAGTAG GAGATTCGGA GAAGCGCCGT CAGTACATAT CAGGCTTCAC AGGTTCAGCC GGTACCGCTC TCATCCCCTC CTCGACATCT CAATCAGCGC TCTTGTTCGT CGACTCGCGA TACTGGATAC AAGCAGAACA ACAAGTACCC AAAGGGTGGA AAGTGGTCAG AGTCGGGTCA AGTAGCGGGG GAGGAAGTGG GAGAGCGGAT GCACAGAGTG GCTGGATGGA CTGGGTCGTG AACAAGCTGG AAGACGGGTC AAGAGTTGGG ATCGATCCGA AACTCATATC TCTGGGTACT GTTTTTCTTT CCTCGATGAA TTTTGATCAG TGCTTACTTC CTTGCAACGT AGACCTTGTT CACTTGATCC AATCGCGCCT GTCATCAATA GACTCTTCCA TCACCCTCGT TCCACTGTCG ACCAACCTCA TCGACAAAAT TCGCAATGTC CCTGCCCGTT CCCTTGGCCC TATCAGCCCC TACCCTCTCG CTTTGTCAGG GGAAGATACG CCTTCCAAAC TCTCTCGTGT TCGAAAGGCC ATTTCGCAGG CTGTGGGAGG GAATAGGAAG AGCAAAGTAA AGGAATGGGT GTATATTCTA CCTACATTAC CTGCTATCGC CTGGCTGCTC AACTATCGTT GTCCTTCGGA TATACCCTTC TGTCCGGTAG CTTATGCCTA TCTCGTCCTC ACACCGTCCC AGTGTGCTGT ATTTGTGGAT AAGCGTAAAG TTGAGAATGA GCTAGATGAG AGATGGAAAG GGGAAGATGT TGAAGTGAGA GACTATGGAG TTGAAGAGGT AGGGAAATTT GTGAAGGCAT TCGTAAATGA GAATTCAGAG GAGAGAAACG TAAGGGTATT TAGCCCTGCA GAGTGCAGCT GGGCTCTAGC CGAGGCATGT TCACCTGTAA GCTGCTAGAA CCTTCTCTTC AACACCAGAC CGAAGCTAAT CGCCATTCTC TCTTTATCAA GTCCAAAATA GCCACCATCA CTTGTCCCGT AGACGTCCTT AAAGCAGTGA AAAACCCTGT CGAACAGCAA AACTTTCGTA ATGCATACCT CAGGGACGGA CGGGCCATGG TCAGATGGTT GGCATGGCTG GAGAAGATGT TGCTCAAGAA CGGGAAAAAG GTTGGAGAAT GGGCTGCTGC CCAAGGCTTG ACAAGAGAAA GGAGAAAGGA AGACTACTTT GCGTGAGTTT TGATAACAAA TAAACAGGGT TGACGAATTT GGGCTGATTG AGATATTAAG CGGCCTTGCG TATGAGGACA TCTCTGCTTC TGGTCCTAAC TCTGGTAAAC TTTCATTTTG CATAAGTGGA CTTTACTGAC TCTTGATGAC TAGCTTTGCC ACATTATGCT CCTCAGCGAG GAAAGGACAG GTTGATTGAC CCGGATACCA CTTATCTGAT GTGAGTCGAT TTATCTAACC CCTGGGGCCA GATGCTCATT CATTGGCTGG ATCCAATCAA CAGCGACTCT GGAGCACAAT ATCAGGGTCA GTCCGCTCCT ACATCCATCA TATATCCAGC ACCGTAATTA GACCTTGATC TAACTCATCC CGTACAGACG CGACCATTGA TACCACTCGC ACTTTTTACT TTGGCTCTAC CCCTTCCCCC GAGCTCAAAC GCGCATATAC CCGGGTGCTT CAAGGACATA TCGCAGTCAG TATGGCCAAG TTCCCTAGAG GCATGCCGGG GGATAGGTTG GGCATGCTTG CGAGGAAAGC GCTTTATGAG TGAGTCTGGG GTTTGGAATG CTATCGTGAA GGAAACGCTG AAGTGCTCAT TCTGATCAGC GATGGATTAG ATTTTGGGCA GTAAGTATCG GAAATATGGA TCCTCCTTTT TATCACCTTG TGAACTTGCT AAGAAATAAT CATAGTGGAG TAGGTCATGG GATAGGTTCG TATCTCGGCG TACATGAAAG TGAGTGCCTC TTCCTTCACT CTTAAACGTC AGACAGTTTG GTTTGGAGCT GATAATCTAA CTCCAGACCC GATGTACTCG CACGATATCG CCTTTAAACC GGGTCATATT ACCACTGTCG AGCCTGGATA TTACAAAGAG GGAAAGTGGG GTATCCGAAT TGAATCGGTC TTGTTATGTA AACAAGTCGA GGTAAAGTCA TCAATTCTTG TGTCACAAAT GTAAATGGCT TATCAGGTGG TATAGACTCC AGAGGACGGG GAAGCATCTC AGTTCCTTGA ATGGGAACGA ATCACTCAGG TGCCAATCCA AACTTCGCTC GTTGATTGGT CGCTTATGGC AAAGTACGAG ATGCGCTGGC TGAATGAACA CAATAAAACA GTTCAAGAAG CTTTGGAGCC GCTTTTGCAG GGTGATGAGG ATGCAGAAGC TAGGGAATGG TTGAAAAAGG CTTGCAAACC CCACAAGGTT TGGCCTTGGG ATGGAGTGTA G
|
Protein sequence | MACFLPSSRA PGPIRLEDSL SDIILEKSTN LELPTYDTLD GSEELEQRLK ALKDEIQDAK VDWYIVPSED EHQSEEVGDS EKRRQYISGF TGSAGTALIP SSTSQSALLF VDSRYWIQAE QQVPKGWKVV RVGSSSGGGS GRADAQSGWM DWVVNKLEDG SRVGIDPKLI SLDLVHLIQS RLSSIDSSIT LVPLSTNLID KIRNVPARSL GPISPYPLAL SGEDTPSKLS RVRKAISQAV GGNRKSKVKE WVYILPTLPA IAWLLNYRCP SDIPFCPVAY AYLVLTPSQC AVFVDKRKVE NELDERWKGE DVEVRDYGVE EVGKFVKAFV NENSEERNVR VFSPAECSWA LAEACSPQNF RNAYLRDGRA MVRWLAWLEK MLLKNGKKVG EWAAAQGLTR ERRKEDYFAG LAYEDISASG PNSALPHYAP QRGKDRLIDP DTTYLIDSGA QYQDATIDTT RTFYFGSTPS PELKRAYTRV LQGHIAVSMA KFPRGMPGDR LGMLARKALY DDGLDFGHGV GHGIGSYLGV HENPMYSHDI AFKPGHITTV EPGYYKEGKW GIRIESVLLC KQVETPEDGE ASQFLEWERI TQVPIQTSLV DWSLMAKYEM RWLNEHNKTV QEALEPLLQG DEDAEAREWL KKACKPHKVW PWDGV
|
| |