Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC01970 |
Symbol | |
ID | 3256599 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 545380 |
End bp | 548579 |
Gene Length | 3200 bp |
Protein Length | 782 aa |
Translation table | |
GC content | 48% |
IMG OID | 638255417 |
Product | ATP-dependent peptidase, putative |
Protein accession | XP_569450 |
Protein GI | 58264588 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCAACAGAA CAGCAAAGAG GTGTCAAAAG GGGCTCTCCA AATAATGCTT TCTAGGTCTG TGCAGGTGGC AGGGTTGGAC CTGTTTGTGT CTTCCAGGGC AGCCAGTTTA AGATGTCAAA AATACTGTGC GCGAAGTCCG TCAAATCTTA AGGTGAGCAA TCCGATGCAT ATCACGCAAC ATGGACGCTG ATGGAATAAC CGGCTTTGCA GGAAATGCGG ATCCCGAGGA AAACGTCGCA AATGGTCTCG CGACCTTGCA TTTCTTTTCG AGGCCTCCAC AGTTCTACCT CCTGGAATGG CATTTTCAGT TCTCGGTCTA CCACCGCGAA ACCATCAGAA ACCCAAACAG ATGTCGATCA ACCTTTAACT CCTTTCCAAG CTCGAGTTGC GGAACTGGAG ATCAAGGCTC ATGCAAATAA AGAGGATCCG GATGCCCAGC TTGAATTTTT ACGCCAGCTT TCCGAAGGAG GAGAATTCGC CGGTTTGGTG GCGTACTACG AAGGAATGGC TCTTGCCGAG GATACATCTG GAAGTCAGGC TCTTCTGAGG AATGATGAAG CATGGGCCAT ATTCATGGAT GCATTGGCGA GATCAGGCAG GTTGGGTGAT ATAGTGACTA AGGTCAGGAG AAGAGATCAG CTGCTGGCAT CCATAGGTGC TAATGGCGGG TCTTCCTCGT CTGCGCCTCT GGTATCTAAT CCTACACCAT CCTCTGTATC AGCAAACAAT TCGTCAACCT CAACGCCTTC TGTCAGTTCT CCAGGTCCTT CATTGACGTC GTCTTTGCTA AGCCGGGCGG TGTCACCGAC TTCCCTAGCA AATGCTTCGA ATGCTTCTAC CTCTCAGTCT CATCCTGGCG CAGGTTCTCC GCTAAATCCG ATATACGTAC AAATGGCCCC CCCTACTCCG CAGATGAATG CCTGGCGCGC TTTGCGTTGG GTGGCTGGAT TCCTGCTTTG GGGGTTTATT ATTCTCACGG TCATGTCGAT GGTGATAGAG AACACTGGGC TACTGAAGGC AGGCCCTGGT CCTGTCGAGT TTGAACCAGA AGAGGGCAAA ATAGTCAAAT TCAGTGATGT CCATGGGGTG GAAGAAGCTA AAGCAGTAAT TTTCAACCTC CCATGACTAG CTTATACAAT ATGGCTAATG TCGCTGGGTC CTTAGGAATT GGAGGAAATT GTCGAATTTC TCAAGAATCC GGAGAAGTTC TCGGCTCTTG GGGGCAAGCT TCCAAAAGGA GTCCTTCTGA CTGGCCCTCC TGGTACTGGT AAGACTATGC TTGCTCGTGC TGTAGCAGGT GAGGCGGAAG TTCCGTTTTT GTTCGCCTCT GGTTCAAGTT TTGACGAAAT GTTTGTTGGT GTCGGAGGTA AGTTTGGCCG CAACACTTCA TGAGGCAGCG GCTGCTGATC TGATTGACCG CAGCCAAACG TGTCAGGGAG CTGTTCGCTG CCGCTAGAAA GAAAGCTCCC GCCATCATTT TTATTGATGA GCTCGACGCT ATTGGCTCCA AACGAAGCGC CAAAGATCAA CACTACATGA AACAAACTTT GAATCAGCTA CTTGTGGAAC TCGACGGCTT TGAACAGGCG GAAGGTGTTA TCATCATCGC GGCTACCAAC TTCCCTGAAT CTCTCGACAA AGCTCTTACC CGTCCTGGTC GTTTTGATAG ACATGGTCAG TGTGGCGCCT CTCTAGTAAC TCTTAGCTTA CTTATATGCG ATCAGTTGTG GTCGGTCTTC CTGACGTCCG CGGGCGTATA GAAATTCTCA AGCATCATAT GTCCGAAGTG CAATACGATG TGGACGTTGA CCCTAGTGTC ATTGCACGAG GCTGCCCTGG TATGAGCGGT GCAGATTTAC AGAACCTAGT CAACCAGGCG GCTGTCAAGG CTTCCAGGGA TGGATCGAAC AGCGTTCAAT TGAAGCATTT CGAATGGGCT AAAGGTAAGG AAACTGGGTG ACTGACGAGT GTGGGGCTAA TATAAGTCAC AGACCGTATT TTGATGGGAG CTGAAAGGAA ATCTCATTAT GTGACAGAGG AGTCCAAGCG AGCAACTGCT TATCACGAAG GTGGTCACGC TCTTGTTGCT CTACATACTC CGGGGGCCAT GCCTCTACAT AAGGTGTAAG CGGTTTCTGG GTTCGTGACA AATGGTACGC TGCTAATACA TAATCAAGTA CTATTATGCC CAGAGGTCAA GCTCTTGGCA TTACTTTTCA GCTACCCGAA CAAGACAAGG GTGAGACTTT CGCCATTGCA ATGGGTGTCA TCGCTTATGA TCTAGCAGAT TCATATACCC GTCGCGAATT CAACGCTATG ATTGACGTTG CCCTTGGTGG CCGTGCTGCT GAGGAAATGA TATTCGGACA TGACAACGTG ACAAGTGGAT GCTCAAGCGA CCTTCAACGT GCAACAGATG TTGCTACTAG GATGATTCGG GTGCGTCATT TCTTTGATAT TCCAGCGTCC ATCAGATTGA CCCTATCTAG AATTACGGTT TCAGTGACAA AGTTGGATTA GTTGCTCATG GGGATGAAGA ATCTGTCTAT CTTTCAAGTA AGAAGAAAGA CGAGATCGAA AGTGAAATTC GGAGGTATGC AGCAATGGCT CCTATGATGA TCATTTTGAT GGCTAAGTTG GCTTTTAGTT TCCTTGATCA AAGTATGACC AGAACGGAGA ATCTTCTCAA GACGCACGAG AATGAGTTAC ATCGAGTGAG TACTGCTTGA TTGAATCAAT ATCGGAAAAT AGAATTAACG ATGATTGTCG CAGCTGGCTG ATGCACTCAT TGAGTACGAG ACTTTATCGT TGGATGAAGT GAAGCAGGTG CTAGAGGGGA AGCGATTAAG CAGACCAACA ACTGAAGGGG AAAGTTTAAA AGGTCAAGGT GAAAAGAGTG GGAAGGGTCC CATTGTTGAC GGCATTTAGC TTCTAAGAAG ATCACTATGC ATCGGTCACA GAATAGACGT CAAGGAGCGG ATGCTAACTC CTTGTTGTTT GGAGCCGAGT TATTGGTTCT GGATCTTCTT CTATGTCCTT ATATTCAATG TCTGAACGAT GATCGACATC CCCGTGCAGT AATAGGGGAT AGCAAAGAGT CGACGGCAAC TTAAGAAAGA CAAAAACTTT GCTGCCTATC AGATATCAGA TAATCTTGAT ATACGCACCT CGTAAGGCTC ATTAATCTTG TCTGTTGTTG
|
Protein sequence | MLSRSVQVAG LDLFVSSRAA SLRCQKYCAR SPSNLKEMRI PRKTSQMVSR PCISFRGLHS STSWNGIFSS RSTTAKPSET QTDVDQPLTP FQARVAELEI KAHANKEDPD AQLEFLRQLS EGGEFAGLVA YYEGMALAED TSGSQALLRN DEAWAIFMDA LARSGRLGDI VTKVRRRDQL LASIGANGGS SSSAPLVSNP TPSSVSANNS STSTPSVSSP GPSLTSSLLS RAVSPTSLAN ASNASTSQSH PGAGSPLNPI YVQMAPPTPQ MNAWRALRWV AGFLLWGFII LTVMSMVIEN TGLLKAGPGP VEFEPEEGKI VKFSDVHGVE EAKAELEEIV EFLKNPEKFS ALGGKLPKGV LLTGPPGTGK TMLARAVAGE AEVPFLFASG SSFDEMFVGV GAKRVRELFA AARKKAPAII FIDELDAIGS KRSAKDQHYM KQTLNQLLVE LDGFEQAEGV IIIAATNFPE SLDKALTRPG RFDRHVVVGL PDVRGRIEIL KHHMSEVQYD VDVDPSVIAR GCPGMSGADL QNLVNQAAVK ASRDGSNSVQ LKHFEWAKDR ILMGAERKSH YVTEESKRAT AYHEGGHALV ALHTPGAMPL HKVTIMPRGQ ALGITFQLPE QDKDSYTRRE FNAMIDVALG GRAAEEMIFG HDNVTSGCSS DLQRATDVAT RMIRNYGFSD KVGLVAHGDE ESVYLSSKKK DEIESEIRSF LDQSMTRTEN LLKTHENELH RLADALIEYE TLSLDEVKQV LEGKRLSRPT TEGESLKGQG EKSGKGPIVD GI
|
| |