Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE02200 |
Symbol | |
ID | 3257817 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 595578 |
End bp | 597585 |
Gene Length | 2008 bp |
Protein Length | 546 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256811 |
Product | prolidase, putative |
Protein accession | XP_570893 |
Protein GI | 58267474 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.555786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTCATATA TATCCATATA CTCTCCCCGT AAAAGCGCCA TGGCACAGAA GTACCCCTCT AGGGCACATG CTCTCAAGCT GATCCACGAG CTTGCCAAAC TGATCCCCGA CCACGAACGA GGCAAGGTTA GTTTCCCTTT TGTTGTCGAC AATCTTCGCT TGTCGTTCAG GCTTTATACA TGGCTGCTGG CTTACCATAC ACATCCATAG CTACATGGGA TCTTTTTGCA GGGGTCTCCT ACCCTATTCA GAGATGATAC CGACCATGAA CATCCCTTTC GTACGTGACT GTCTTTTGCT CGTCTCGGTG CTATAGCACT TACGCTTGCC GGCAGACCAA GAAGCCAACT TCAACTATCT CTCTGGAATC ATTCACCCCA ATTGCTCTCT TGCCGTGTTC TTCTCCCTTC CTGCCACGCC CTCTTCCTCT TCCGTGATCG AGCACCATCT TTTCATTCCG GCAGCAGACC CTGCCGAGAC TATGTGGTCG GTCGCTCCTC CAACAATTGA AGTGGCCAAA CAGGTATATG ACAGCGATAA TATCACATTT ACTAGCTCTA TCCCCGGAGT GTTGGGATCG GCTGTCAAAA GTGGCAATGG CGAGCTGGTT CTCCATGTAC TTCCTAGGAC TATGGAATAC CCGGCTTTGC CAGAAGTGAT CGACCAAACA TCGGGCCTTC GACTCGAATC GTCGTATCTG TTCAAAGCGC TTCACATTGC TCGTCTCACA AAGGACGAAC ATGAGATAGA TTTGATTCGC CAGGCCAATC GAATTTCCAG TGCCGCTCAC GAGGTTGTAA TGCGCGAATT AGGAAGGTTC GCATCAGCGA GAGAAAAGGG CGGAAGAGAT TTGAAAGAGA GGACCGGCAA GGAAGGAGTG AAGGAATGGG AAATTGAGAG CGAGAGGGAT GCCGAGGCTG TCTTTGTGGC CACGTGCAAA CGTATGGGGT GAGTTCAAGT TCTTTTACTA TTATATTAGA ATCTCACGCT TCATAAAGGG CAACAGACCA AGCTTACTTG CCTATCGTGG CAAGTGGCAC TCGGTCCAGT ACACTTCATT ATGTGTAAGT CAACATATCT CCACGGAATG GAAACGCTGA CAAGCATTTA GCTGCAATGA CCGTCTTTTC CCATCTATTC CCCGTAAACG TGGAGACGTC ACCTTCACCC ATGAAGTATC TCGTGGTTGC TGTGGCGACG ACCATAATCA ACCCATTTCT TCCGTTTTGC ACAATGATGC TTTCCTTCCG CAATTACTCT TGATCGATGC GGGATGTGAA TGGAAGGGGT ATGCCTCCGA TATCACGAGG ACAATGCCCA TCGGAAACGG CGGCAAATTC ACAAAGGAGG GTGGTGAGAT TTACGAATTG GTTTTGAGGA TGCAAAAAGT GAGTGAAGGC CGGTAGATGC TAAGTGACAT CATTAATCCG GCAATAAATA GGAATGTGAA GAGCTGGTTA AACCAGGAGT TCACTGGGAC ACTATTCACC TCCATGCTCA CAAAGTGTTA ATCGATGGAT TGCTCTCCCT TGGCATCCTT ACCGGATCTC CAGAAGACAT CCTTCAGAGC GGTGTCACTG CCGCCTTCTT CCCTCATGGG CTTGGTCACT CTCTCGGCCT CGACACCCAC GATTCTCTGC AGTATCTCCG TCTTGTTCAC GAAGATCTCC CACCAACAAC AACTTCCACC CCTTCGAAGC TCTACAAATT CCTTCGTATC CGCTTGCCCT TGACGCTCAA CATGGTTCTC ACAGTTGAAC CAGGCTGTTA TTTTGCCCCT CAGCTAATGG AAGAGCATGG CGTCTGGACA AGCAGATTCG TGGTCCAGGA CAAACTGAAG GAATATGTGG GTATTGGAGG GGTAAGGATT GAGGATGTCA TTGTCGTAAG GGAACGGGGA GTAGAAAACT TGACCACGGT TGGAAAAGAG AGAGACTGGG TCGAAGCGGT CTGCTCTGGA ACCCTTTGAT CAGTGGTTTC AAAGGGGATT ATTTTGTAGC AGATATGC
|
Protein sequence | MAQKYPSRAH ALKLIHELAK LIPDHERGKL HGIFLQGSPT LFRDDTDHEH PFHQEANFNY LSGIIHPNCS LAVFFSLPAT PSSSSVIEHH LFIPAADPAE TMWSVAPPTI EVAKQVYDSD NITFTSSIPG VLGSAVKSGN GELVLHVLPR TMEYPALPEV IDQTSGLRLE SSYLFKALHI ARLTKDEHEI DLIRQANRIS SAAHEVVMRE LGRFASAREK GGRDLKERTG KEGVKEWEIE SERDAEAVFV ATCKRMGATD QAYLPIVASG TRSSTLHYVC NDRLFPSIPR KRGDVTFTHE VSRGCCGDDH NQPISSVLHN DAFLPQLLLI DAGCEWKGYA SDITRTMPIG NGGKFTKEGG EIYELVLRMQ KECEELVKPG VHWDTIHLHA HKVLIDGLLS LGILTGSPED ILQSGVTAAF FPHGLGHSLG LDTHDSLQYL RLVHEDLPPT TTSTPSKLYK FLRIRLPLTL NMVLTVEPGC YFAPQLMEEH GVWTSRFVVQ DKLKEYVGIG GVRIEDVIVV RERGVENLTT VGKERDWVEA VCSGTL
|
| |