Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00660 |
Symbol | |
ID | 3257838 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 176797 |
End bp | 179610 |
Gene Length | 2814 bp |
Protein Length | 666 aa |
Translation table | |
GC content | 48% |
IMG OID | 638256652 |
Product | KEX1 protein precursor, putative |
Protein accession | XP_570787 |
Protein GI | 58267262 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGTGTTCCC GTTCAGCCGG TTCATTTCGA TCTTCCATCA TCACACTTTG GAACGGCAGC AATATCAACG GTGACTGTAT TATATCTGTC TCAACCTGTT CGCTCTTGTA TACTACCAGC AATCAGTTCA ACACCGTTCG ACCTCTGTGT TCCCTTCCAA CATGTCAGGA AACGTGGCTT GTGCGCGCGT GAGATCGCAG TCCTCGACAT TTTCACCTCT TGAGACTAGA GACGGTGATG AGCAAGAATC AAGCTCGTCG GATTCTCAGC ATGGAAGTAG CAGTATGTTT TGGACTTCCT GTATAAGATG AGCTTTGCTC CTGAACACGG AGCTGACGAA GCCTGCAGAA CGTGGACCCC AACGGCGAGC GACAGGTCAG TAACAAGCTT GATACAGGTA TTGCACGAAT AGCTCATCGA CTTTGTAGAC CTGCCGTCAG CCGCCGACCT TTATGTGCCT TCACTACCAG GTCTCCCTGA GATGGCCACG CACCCAACGC ATCCGCTGAA TATATATGCT GGCATGCTGC CTTCGTATCC TGGAGAGGGC AAGGTTGGAG GTGAAGGCCA GACTGGGAAA GATGCCAAAC TCTAGTAAGT CTACAATAAT GCTCCCAAGA TCACCGTGGA ACGCATGGGC GATTTCGTAC TCAATCCTTG CCTTAGCTTT TTGATGGCTA AAGCGCGGCG TAACGCTGGT AAAGAACGTG TCATCTTCTG GTTCAACGGA GGTCCTGGCT GTTCCTCTTT CGATGGCTCT CTCATGGAAG TCGGTCCATT CCGAACTGTT CCCGCCACCG AGACGACAAG CGGCATGGTT GAAGCCAAGC TTGTAGAAGG TGGATGGGAA GAGTTTGCGA CTGTTGTCTT TGTGGATCAA CCACCGGGCA CTGGATACTC TTATGCGGCA ACAGATGGGT ATCTGCATGA TTTCGATGAG GTTAATAGTG ACCTTCATGG ATAATAAGGT TGAAATGCTG ATATCGATCA ATCTTTAGCT CTCTGCGCAC TTCATCGAGT TCTTGCAGAA TTTCTATACC GTTTTCCCAG AGCTGAAAGG TGTCGACACC TATCTTGCAG GAGAGTCCTT TGCTGGGCAA TACATCCCCT TCTTTGCCGA CGCATTGATC AAGTCTATTG AGCTTCCAAA CTTCCCTCTC AAAGGTATCG CCATCGGTAA TGGGTGGATC GATCCTAAAG AGCAATATCC GGGATATGTT GAGTTTGCTT ATGAGAAGGG CTTGATAGTC TCCGGAACTC CGGTGAGTGA AACCTTGTTG GTCTATATAA GGAGCGAGCT GATCAAAGTT TAACTGAGTA GGAAGCAGAA GAGATGGAAT CTGCGCTGAA ACGTTGTCAG GAAGAGATGG ACAAGTACTC GGATCCATTT ACAACACCCG TAAATATCAA CAACTGTGGG CAAGTCATGG ACTCTGTCAC CAGGCCTTTT ACCCAAGAGT ACGTACATTG CCCTGCTTTC CAATGAATAT AAATTGACAA TGTGCCACAG ACTGAACGGG AAAAAGGTCT GTATGAATGT GTACGATGTC CGACTAGTTG ACGACTTCCC TGCCTGTGGT ATGAACTGGC CACCAGACTT GCCCGATGTC TATACTTTCC TTCGTGTGCG TTACTTCCCT ATCCTCCGTA TCTTAAACCC TCCAAGCTAA CCTTCCTCAT CACAGCAAGA TGATGTTATA TCCGCCCTCC ACGCCACATC CAAAGAAACC GCCTGGGTCG AATGCAACAA TAAGGTCTCT TACGAACTCA ACCTCAAAAA ATCACACATG TCAGCTGCCT TACTTCCTAG TATCCTAGAA GCAGGCGTGC CAATTTTGAT GTTTGCTGGT GCGGAAGATC TGATATGTAA CTATAAGGGG ATTGAAAGGA TCGTAAACGG TTTAGAATGG GATGGTGAGA AAGGTTTTGG GGTGAGTAGT CGGTCAAGGA GGTGATTGTG AAGTCATGCT AATAAAGATA TAGAATGCTA CAAGCCAGGA ATGGTATTTT AATGGTACCC AAGTCGGGAC ATGGCAAACA TCTCGAGGCC TCTCATATGC CAAGGTAAAT ATTTATTCTG GTGCAAGATT CCTCCTAATA GAATCTGTTA GATTTTTGAC TCGTCACATA TGGTCGGCTT TGACGTCCCT CACGTTTCCA ACGATATGAT TATGCGCTTC ATGGATGTTG ATGTCTCCCT TCTGCCTGGT ATGACTGCTC AATGGCCCTC ACGTATAGGC GACGATGAGC GCACCATGAT CCATGTTGGT GACGGCGAAT CGGGCGGAGT CCCCTTGATC GAGGGTGGCA ATACTGACTG GGAGGGTGAG CATAATTTGT TCAAGCAGGA ACTCAAGCAA GCTGACGAGA ATACACTTTG TAGCCTGGTA CAATGCCATT TTCGCCTTCC TCGTCCTTGG TATTCTCGTG TCCATCGCCG GCCTCTATTT CTACTTCCGC CGCAAGCCCG TCTCATACCG TTCCCGTATT TCTCTCAAGC AAAGAAGCAG ACGTCACCGG GGCCATGATA TGGATGAAGA TGAGGCTGCT GAGCGAATGC CTCTAGGCTC GGAGAGGTTG GAACTGGATG ATATTGAGCG GGCGGAGGGG TATGAGTTTC ATGATGGGGA TGGTGAGAGG TATAGTAGGG AAGGTAAAGG AAAGGGAAAA GAACGGGCAA AGGATAGAGA AGAAGTCGTG TTTGCGCTTG GGGATGACGA TGAGGATGAC CATCATTAAA AAGTGCAGCC CATGCCTCAT CGCGAGGGTT GTTGTTGTTG GATATTAGAT TGACCAGATG TAAATTTATA TGCC
|
Protein sequence | MSGNVACARV RSQSSTFSPL ETRDGDEQES SSSDSQHGSS KRGPQRRATD LPSAADLYVP SLPGLPEMAT HPTHPLNIYA GMLPSYPGEG KVGGEGQTGK DAKLYFLMAK ARRNAGKERV IFWFNGGPGC SSFDGSLMEV GPFRTVPATE TTSGMVEAKL VEGGWEEFAT VVFVDQPPGT GYSYAATDGY LHDFDELSAH FIEFLQNFYT VFPELKGVDT YLAGESFAGQ YIPFFADALI KSIELPNFPL KGIAIGNGWI DPKEQYPGYV EFAYEKGLIV SGTPEAEEME SALKRCQEEM DKYSDPFTTP VNINNCGQVM DSVTRPFTQE LNGKKVCMNV YDVRLVDDFP ACGMNWPPDL PDVYTFLRQD DVISALHATS KETAWVECNN KVSYELNLKK SHMSAALLPS ILEAGVPILM FAGAEDLICN YKGIERIVNG LEWDGEKGFG NATSQEWYFN GTQVGTWQTS RGLSYAKIFD SSHMVGFDVP HVSNDMIMRF MDVDVSLLPG MTAQWPSRIG DDERTMIHVG DGESGGVPLI EGGNTDWEAW YNAIFAFLVL GILVSIAGLY FYFRRKPVSY RSRISLKQRS RRHRGHDMDE DEAAERMPLG SERLELDDIE RAEGYEFHDG DGERYSREGK GKGKERAKDR EEVVFALGDD DEDDHH
|
| |