Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI00800 |
Symbol | |
ID | 3259442 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 192827 |
End bp | 195791 |
Gene Length | 2965 bp |
Protein Length | 750 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258565 |
Product | peptidase, putative |
Protein accession | XP_572743 |
Protein GI | 58271174 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCCGTAACA CCATGAGAGC CACCGTACTC ACCCTCCTTG GCCTATCAGC CTCGGCATGG GCTACTCCAG CGCCATTCAC CGTAGAAGAC ATGCTTGCTG CCCCTAGGCC TTTCCCGGCT ATTGCCAGTC CTGACAAGCA GCACGCCATC GCTGTCGTCG ACTACTGGGA ACCTAGGGAT GACTCGTACG TTTTCAGGAG TTTGCCCATG AAATCAAGCT AAATTACAAT GTAGTATGAG GAGAGAAGCA TACCTGGCGA CGCTCAACAG GCCCGAAGTG AAGCATCCCA TCTCCTTGTT CAATACCACT CCTTCGGCTG CAGCCGATTT CTTCTGGCTT GACGATGTTA CCATCGCATA CCTCGATGGA TCGAATCTTT TTTCTTACCC GGTTGAATAT GCATTCAGCC AATCCAATTT TAAGCCTAAA CACAACCCAC CTCGCTCCCC CAGGCATCAA AAGATCCTCT CATTCCCTCA CGGCGTCAAC CCTACTTCTC TTCAATATGA AGCAAGCACC AAAACCCTCG CGTTTACTGG CCAAGTGTGG TCGGATGGCT CATTCTACCA GACTCGACAT CACGACAAGC TCTACGAAAA GAAACGTGAC AGTGCCCAAG TATACGACGA CTTGATGGTC AGGCATTGGG ATACTTGGAG AGTCAGCGGA AAGGTCTGGA CGCTAGGTGT TGTCAAGCTG ATCAACATCA ACGATGAATG GGCAGAGCTT GATAATGATA TCAACAAGCA TCACAAACGT CGAGCTGAGT TTATTAACAT CTTGAACGGT ACCGATTTGG TATCCCAGAC CGACCCTATC GATGCGGGTT CTTACTCTAT CAGCTCTGAA CACATCGCTG TAGCCGTTAA GCCTCCTTAC CTCCAGACTG CCACACATAC GAGAGAAGAC ATATATCTTT TCCCTCTTCC TTCCTCTTAC GACTCTGCGT CCATCCTTCC CAAACACGTT ACTCCGCACG CCCATGGAGC TATCAGCGAA ATCAAGTTCT CACCTGATGG GAAGAAGCTT TCATGGCTTG AGATGAAGAA GGATGGCTAC GAGAGTGATA GGCGGGTGGT CGTTGTTTAT GACTTGATGA GTGGGAAGAG TGAAAGATGG ACCGAAGTTT GGGACAGGAG CCCTAAAAGC ATCTCCGTAC GATAAATTCC CCTCAACTTG ACATCAAAGA TATGCTAACA GAGCACTGGA TCATGTAGTG GGCGGTCGAC TCTCAATCCA TCTTCCTTTT GGCCGAGTTC CGAGGACGCA CCCTCCCATA CCACCTCACT CACCCCAACC ACCTCCCGAC TCCCCTTCTC TTCAACGGTA CAACCGTTTC CCTCACTCCA CTGAACGAGA CCGACATCCT CATCGCCCGT CAATCATTCC GAACTCCCAC TGTGGAATGG ATATTGACTT TGCCCGACCC TGCCGAGGAT GGAAATGCAG TTGGAGACGG AGACGGAGAC AAGATACCGG CTGTTGAGCC TTTGAGGCAA CTCACTCGAT GGAATGAACA TTTCATCCGT GGGAGGTTGG ATGTTCAGAC TGGTGAAGAG TTTTGGTTCA AGGGTGCTGA AGGCAAGGAT GTGATGGGAT GGGCTTTGAA GCCTCGTGGG TGGAAGCCTG ACCAGAAGGC CAAGTATCCT CTCGGTGCGT CCTTTTAACG TCGTTTTTTT TTTGCTTCTG GGAGAGGAAA TCTGATAAGA TATGTACAGC TTTCTTGATT CATGGTGGCC CTCAATCAGC TTGGGACGAT TCTTGGTCAA CTCGATGGAA CCCTGCCCTG TTTGCCGCCC AGGGTTACTT TGTCGTCGCC ATTAACCCTA CTGGTTCTAC CGGTTATGGA CAAGAATTTA CCGATGCTAT CCAGGGCGAT TGGGGAGGAA GTAAGTTTTT TTCGCCAAAA AGCCAAGCTT GCAAAAGATT TTTTTACTAA ATGCATATTA CAGAGCCTTT CAAGGACCTC CTCGCAGGTT ACCACTACGT CCTAGAAAAC TACCCTGAAG TAAATCCAAC CTTGCCCCTC CCTGATGCTC ACTTATTAAC ATTACGTAGA TCGACCCCGA ACGTACTGCC GGCCTTGGAG CTTCTTATGG TGGCTACATG GTCAACTGGA TCAACGGGCA TAACGACCAC TTTGGTTTCA AAGCTTTAGT ATGCCATGAC GGCGTGTTCG ACACGGTCAC TACCTTCTTC TCCACGGAAG AGATTTATTT CCCCACCCAG TGAGCAGCAC TTTCTGGGCG ATAAAATATG GCTAATATCG TATGCTGGTA GAGACTTTGC TGGTACACCT TGGACGAATA GGGCTACTTA TGAAAAGTGA GTGAGATCAG TGATGCAAAG AAGATGAAGA AATTACTAAT GTGATATTAT AAGATGGAAC CCTGTGAACC ACGTTATCGA GTGGAATACT CCGGAACTTG TTATCCAAGG CGGAAAGGGT ACGTGCAGAG AGGCGTTGAT TTGTCCTTGA CTAATATGGG CTATGTAGAC TACCGTCTGG AGAACTCTCA AGGTCTTGGT AAGTGAATAG CGCTATATAG TTCACCCTAG ATGCTGATGA ATGGAAATAG GCGCTTTCAC CGCTCTGCAG CTGTGCGTTT AGACTATGGA TTCAAGATGC AAGACTCTTG CACTGATATA CTGACGACCA TCTTAGCCAA GGAGTCCCTA GCCGATTCGT CTACTTCCCC GACGAGAATC ACTGGATTCT CAAACCTCAC AACTCTATCA AGTGGCACGT AAGTCTACCC ATACTCAAAA TGGTACTGGA TCAGTACTTA CCGGGTGATA TATTAGCACG AGGTGTTCCG ATGGCTGGAG GAATGGATTG GCAAGCCCAC GGATGAGAGT GAGGCTTTCG TCGTGCAGCG AGAGTGAAAG GAATAGATCA ACCGATTTTA TGCCTGAATA TTTAGCATAG TTATCATCAA TACAAATAAT GAAAAGGGAA ACACA
|
Protein sequence | MRATVLTLLG LSASAWATPA PFTVEDMLAA PRPFPAIASP DKQHAIAVVD YWEPRDDSMR REAYLATLNR PEVKHPISLF NTTPSAAADF FWLDDVTIAY LDGSNLFSYP VEYAFSQSNF KPKHNPPRSP RHQKILSFPH GVNPTSLQYE ASTKTLAFTG QVWSDGSFYQ TRHHDKLYEK KRDSAQVYDD LMVRHWDTWR VSGKVWTLGV VKLININDEW AELDNDINKH HKRRAEFINI LNGTDLVSQT DPIDAGSYSI SSEHIAVAVK PPYLQTATHT REDIYLFPLP SSYDSASILP KHVTPHAHGA ISEIKFSPDG KKLSWLEMKK DGYESDRRVV VVYDLMSGKS ERWTEVWDRS PKSISWAVDS QSIFLLAEFR GRTLPYHLTH PNHLPTPLLF NGTTVSLTPL NETDILIARQ SFRTPTVEWI LTLPDPAEDG NAVGDGDGDK IPAVEPLRQL TRWNEHFIRG RLDVQTGEEF WFKGAEGKDV MGWALKPRGW KPDQKAKYPL AFLIHGGPQS AWDDSWSTRW NPALFAAQGY FVVAINPTGS TGYGQEFTDA IQGDWGGKPF KDLLAGYHYV LENYPEIDPE RTAGLGASYG GYMVNWINGH NDHFGFKALV CHDGVFDTVT TFFSTEEIYF PTQDFAGTPW TNRATYEKWN PVNHVIEWNT PELVIQGGKD YRLENSQGLG AFTALQLQGV PSRFVYFPDE NHWILKPHNS IKWHHEVFRW LEEWIGKPTD ESEAFVVQRE
|
| |