Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02680 |
Symbol | |
ID | 3254574 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 778557 |
End bp | 781408 |
Gene Length | 2852 bp |
Protein Length | 647 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253760 |
Product | cytoplasm protein, putative |
Protein accession | XP_567867 |
Protein GI | 58260914 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.980721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACCCTATAT TCATCGTTCA GAATATCGCC TATATTACCA AAAATCACCC TCAGACTCAC ACGGAAAGTC TGTTCCAAGA TGGCAATGTC GTCCAGCAAG CTTGCGGACT TGAGACAATT GATGAAGGAG CAAGGCGTTG ATGCTTAGTG CGTACAATGG ATCTATCACC TTAACGGCAC TGATAAGATG CAGTGTGGTA CCTTCGGAAG ACGCTCGTAA GGCATATACA CCTACATCTC TAGCTGGTTA TGGATTGACT GAGCCTTTCT GATTGTTATT TCTTGTCAGA CGCTTCCGAA TATCTGGCAC CATGTGACGC ACGACGGGCC TATATTACAG GATTCACCGG GTCTGCCGGC TGTGCAGTTA TAACCCATGA CAAAGCGCTT TGCTGGACTG ATGGCAGATA CTGGCTTCAA GCAGAGAAGC AACTCGGTGA AGGGTAAGTA GACCCTAAAT GCAGAGTGGT CACATACTGA TCTGTTTCAG ATGGGCATTG ATGAAGAGTG GGCTGCCTGA AGTTCCTACT TGGGCCCAAT GGCTTAGCAC AGTAGATATC AATCTTCAAT TATCTGAGAA TCCATCTAAC AACGTTATCA GGAAGTTTCA CCCAATTCCT TGATCGGTAT TGATCCCACC GTCATTCCCT ACTCTGAAGC ACTCTCACTC CTCTCTTCCC TTCCCTCATT GTCTCCTGCC CCAAGTGCAG CATCTCCTTC AAGACTCATC GCTACTCCAA ACTTGATCGA TTCCCTTTGG GTGCCTCCTT CCCGTCCCCT TCGACCTTCT CAACCCATAT TCCATCTGGC CGATAGGTAC ACTGGGGAAC CCGTCTCTTC CAAGTTGAGG CGACTGAGGG ACAAGCTTAT AAGGATAGGG AGTCCCGGTA CAGTTGTAGC ATCGCTTGAT GAGATCGCTT GGGTGTTCAA TTTGAGAGGA GCGGACATTC CTTATAACCC TGTAAGCCTC GGTCGTGACA AATGAAATTT CAGTTGCTGA AAGAGGGGCT TAGGTATTCT TTGCGTATAC CATCATCACT CCGGATGATT GTACCCTCTT TGTCTCGCCT TCCTCTCTCA CCATTGAGGT TCGATCCTAT CTCCACTCCA ATGGAATAGC CGTTCTTGAC TATTCTCATG TGTGGACTTC ACTTGAAGCT TGGAAGAAGA GGGTCAAGTT TGACCAAGAG AATAAAAGCA GGGAGCAAAG AGATGGTGTG AAGCGGGCAA GGCTCGAGGA GGAGGCAAAG AAAGAGGAAG AAGGAGAAAG GCTGAAGAAA ACAGACAAGA TCTTAATTGG AAACAAGACG AGTTGGGCTG TTGCCAAAGC GGTTGGAGAG GTAAGGCATG TGGATACATA TGCGTCAAAC AAATCTAATT ATTGAGCAGG ATAATGTGGA AGTACGACGA TCTCTAATTG AGGAGATGAA AGCCAAGAAA AACGCGGTAT GTCTTTGGCT CTTTCTTAGT ATTCGATATT TGCTAACATG CATGGATTCT TGCAGACTGA AATTGAAGGC TTTCGCCAAT GTCATATACG TGACGGGGCC GCCCTTGTGC GATATCTTGC TTGGCTGGAA GAAGCGCTTG AGAATGGAGA AAGCTGGACG GAGTATGATG CAGCGACCAA GCTTGAAGAT TTCCGCAAGT GAGTTTACCG TCCCTCATTT CAAAGTCTTG TAAGCTGACT GTTCAATAGG GAAAACAAAC TTTTCATGGG ACTTTCATTT GAAACCATCT CGTCTACTGG TGCAAATGCC GCCGTCATTC ATTACTCTCC GCCCGCAGAG GGGAGTAAGG TGATTGAAAA AAAGCAAATG TACTTGTGTG ACTCTGGCGG TCAGTACTGC TGGTATTCAA TGGAGACCAA CGAGCTGATT GACCTTTTAA ACATAGCCCA GTACTTGGAT GGGACCACAG ACGTAACTCG AACACTTGTA GGTCCAAGCG CGTCGACGTC AATCGACGTC TTTCAGGGAA GCTAATGCGT ATGTAGCACT TTGGCACACC CAACGAGGAC CAAAAGCGTG CATTCACCCG AGTGGTGAGT TGCATCAAAT GATAGGTGCC GAATTAGTAG CTGATCCATG TACATTAGTT ACAAGGACAC ATTTCCTTAG ATACTATCGT TTTCCCTCAG GGTACAACTG GTAAGTCAAT TTCTCTTGTT GTGATGATCT GAGCTGATTA CTTGCTTAGG CTATATTCTG TAAGGGATTA CAACACGATG ACCGGGAAGG TGCTGATGCC TGTTTAGAGA TGTACTCGCC CGTCGAGCTC TTTGGAGTGA AGGACTGGAC TACCGGTATG TGCCTTTTTA CAAAGCATCG TTATGCAACT TATGTATTTG AATCTTGACA TTAGCCATTC AACATCCCAC GGCATTGGTT CTTTCCTCAA TGTCCACGAA GGCCCTCAAG GTATAGGCCA ACGACCGGCG TACAATGAAG TGCCTTTACA AGAGGGTATG GTTATCTCGA ATGAACCCGG CTATTATAAA GATGGTGAAT GGGGGATTCG AATCGAAGGG GTGGACGTCA TCGAGAGAAG GGAGACGAGG GAGAATTTCG GTGGTAAAGG GTGGTTGGGA TTTGAAAGAA TCACCATGGT GAGTTATGCG AACTACTTGA TGTGCCAGCG CTCATTTTCC CATCTTTTCA AGTGTCCTAT CCAGACAAAA CTTGTGGATT CTTCGCTGCT CACCATCGAA GAGAAAGACT GGCTCAATGA ATATCACGCA GAAGTCCTCG CAAAACTAGC GCCGGTGTTG AAAGAGATGG GAGACGAAAG AGCAGGTAAA TGGCTGGAAA GAGAGTGCCA ACCTCTGTAA GAGGGGTTTT TTTTGGACGC GA
|
Protein sequence | MAMSSSKLAD LRQLMKEQGV DAYVVPSEDA HASEYLAPCD ARRAYITGFT GSAGCAVITH DKALCWTDGR YWLQAEKQLG EGWALMKSGL PEVPTWAQWL STEVSPNSLI GIDPTVIPYS EALSLLSSLP SLSPAPSAAS PSRLIATPNL IDSLWVPPSR PLRPSQPIFH LADRYTGEPV SSKLRRLRDK LIRIGSPGTV VASLDEIAWV FNLRGADIPY NPVFFAYTII TPDDCTLFVS PSSLTIEVRS YLHSNGIAVL DYSHVWTSLE AWKKRVKFDQ ENKSREQRDG VKRARLEEEA KKEEEGERLK KTDKILIGNK TSWAVAKAVG EDNVEVRRSL IEEMKAKKNA TEIEGFRQCH IRDGAALVRY LAWLEEALEN GESWTEYDAA TKLEDFRKEN KLFMGLSFET ISSTGANAAV IHYSPPAEGS KVIEKKQMYL CDSGAQYLDG TTDVTRTLHF GTPNEDQKRA FTRVLQGHIS LDTIVFPQGT TGYILDVLAR RALWSEGLDY RHSTSHGIGS FLNVHEGPQG IGQRPAYNEV PLQEGMVISN EPGYYKDGEW GIRIEGVDVI ERRETRENFG GKGWLGFERI TMCPIQTKLV DSSLLTIEEK DWLNEYHAEV LAKLAPVLKE MGDERAGKWL ERECQPL
|
| |