Gene CNC01970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01970 
Symbol 
ID3256599 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp545380 
End bp548579 
Gene Length3200 bp 
Protein Length782 aa 
Translation table 
GC content48% 
IMG OID638255417 
ProductATP-dependent peptidase, putative 
Protein accessionXP_569450 
Protein GI58264588 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCAACAGAA CAGCAAAGAG GTGTCAAAAG GGGCTCTCCA AATAATGCTT TCTAGGTCTG 
TGCAGGTGGC AGGGTTGGAC CTGTTTGTGT CTTCCAGGGC AGCCAGTTTA AGATGTCAAA
AATACTGTGC GCGAAGTCCG TCAAATCTTA AGGTGAGCAA TCCGATGCAT ATCACGCAAC
ATGGACGCTG ATGGAATAAC CGGCTTTGCA GGAAATGCGG ATCCCGAGGA AAACGTCGCA
AATGGTCTCG CGACCTTGCA TTTCTTTTCG AGGCCTCCAC AGTTCTACCT CCTGGAATGG
CATTTTCAGT TCTCGGTCTA CCACCGCGAA ACCATCAGAA ACCCAAACAG ATGTCGATCA
ACCTTTAACT CCTTTCCAAG CTCGAGTTGC GGAACTGGAG ATCAAGGCTC ATGCAAATAA
AGAGGATCCG GATGCCCAGC TTGAATTTTT ACGCCAGCTT TCCGAAGGAG GAGAATTCGC
CGGTTTGGTG GCGTACTACG AAGGAATGGC TCTTGCCGAG GATACATCTG GAAGTCAGGC
TCTTCTGAGG AATGATGAAG CATGGGCCAT ATTCATGGAT GCATTGGCGA GATCAGGCAG
GTTGGGTGAT ATAGTGACTA AGGTCAGGAG AAGAGATCAG CTGCTGGCAT CCATAGGTGC
TAATGGCGGG TCTTCCTCGT CTGCGCCTCT GGTATCTAAT CCTACACCAT CCTCTGTATC
AGCAAACAAT TCGTCAACCT CAACGCCTTC TGTCAGTTCT CCAGGTCCTT CATTGACGTC
GTCTTTGCTA AGCCGGGCGG TGTCACCGAC TTCCCTAGCA AATGCTTCGA ATGCTTCTAC
CTCTCAGTCT CATCCTGGCG CAGGTTCTCC GCTAAATCCG ATATACGTAC AAATGGCCCC
CCCTACTCCG CAGATGAATG CCTGGCGCGC TTTGCGTTGG GTGGCTGGAT TCCTGCTTTG
GGGGTTTATT ATTCTCACGG TCATGTCGAT GGTGATAGAG AACACTGGGC TACTGAAGGC
AGGCCCTGGT CCTGTCGAGT TTGAACCAGA AGAGGGCAAA ATAGTCAAAT TCAGTGATGT
CCATGGGGTG GAAGAAGCTA AAGCAGTAAT TTTCAACCTC CCATGACTAG CTTATACAAT
ATGGCTAATG TCGCTGGGTC CTTAGGAATT GGAGGAAATT GTCGAATTTC TCAAGAATCC
GGAGAAGTTC TCGGCTCTTG GGGGCAAGCT TCCAAAAGGA GTCCTTCTGA CTGGCCCTCC
TGGTACTGGT AAGACTATGC TTGCTCGTGC TGTAGCAGGT GAGGCGGAAG TTCCGTTTTT
GTTCGCCTCT GGTTCAAGTT TTGACGAAAT GTTTGTTGGT GTCGGAGGTA AGTTTGGCCG
CAACACTTCA TGAGGCAGCG GCTGCTGATC TGATTGACCG CAGCCAAACG TGTCAGGGAG
CTGTTCGCTG CCGCTAGAAA GAAAGCTCCC GCCATCATTT TTATTGATGA GCTCGACGCT
ATTGGCTCCA AACGAAGCGC CAAAGATCAA CACTACATGA AACAAACTTT GAATCAGCTA
CTTGTGGAAC TCGACGGCTT TGAACAGGCG GAAGGTGTTA TCATCATCGC GGCTACCAAC
TTCCCTGAAT CTCTCGACAA AGCTCTTACC CGTCCTGGTC GTTTTGATAG ACATGGTCAG
TGTGGCGCCT CTCTAGTAAC TCTTAGCTTA CTTATATGCG ATCAGTTGTG GTCGGTCTTC
CTGACGTCCG CGGGCGTATA GAAATTCTCA AGCATCATAT GTCCGAAGTG CAATACGATG
TGGACGTTGA CCCTAGTGTC ATTGCACGAG GCTGCCCTGG TATGAGCGGT GCAGATTTAC
AGAACCTAGT CAACCAGGCG GCTGTCAAGG CTTCCAGGGA TGGATCGAAC AGCGTTCAAT
TGAAGCATTT CGAATGGGCT AAAGGTAAGG AAACTGGGTG ACTGACGAGT GTGGGGCTAA
TATAAGTCAC AGACCGTATT TTGATGGGAG CTGAAAGGAA ATCTCATTAT GTGACAGAGG
AGTCCAAGCG AGCAACTGCT TATCACGAAG GTGGTCACGC TCTTGTTGCT CTACATACTC
CGGGGGCCAT GCCTCTACAT AAGGTGTAAG CGGTTTCTGG GTTCGTGACA AATGGTACGC
TGCTAATACA TAATCAAGTA CTATTATGCC CAGAGGTCAA GCTCTTGGCA TTACTTTTCA
GCTACCCGAA CAAGACAAGG GTGAGACTTT CGCCATTGCA ATGGGTGTCA TCGCTTATGA
TCTAGCAGAT TCATATACCC GTCGCGAATT CAACGCTATG ATTGACGTTG CCCTTGGTGG
CCGTGCTGCT GAGGAAATGA TATTCGGACA TGACAACGTG ACAAGTGGAT GCTCAAGCGA
CCTTCAACGT GCAACAGATG TTGCTACTAG GATGATTCGG GTGCGTCATT TCTTTGATAT
TCCAGCGTCC ATCAGATTGA CCCTATCTAG AATTACGGTT TCAGTGACAA AGTTGGATTA
GTTGCTCATG GGGATGAAGA ATCTGTCTAT CTTTCAAGTA AGAAGAAAGA CGAGATCGAA
AGTGAAATTC GGAGGTATGC AGCAATGGCT CCTATGATGA TCATTTTGAT GGCTAAGTTG
GCTTTTAGTT TCCTTGATCA AAGTATGACC AGAACGGAGA ATCTTCTCAA GACGCACGAG
AATGAGTTAC ATCGAGTGAG TACTGCTTGA TTGAATCAAT ATCGGAAAAT AGAATTAACG
ATGATTGTCG CAGCTGGCTG ATGCACTCAT TGAGTACGAG ACTTTATCGT TGGATGAAGT
GAAGCAGGTG CTAGAGGGGA AGCGATTAAG CAGACCAACA ACTGAAGGGG AAAGTTTAAA
AGGTCAAGGT GAAAAGAGTG GGAAGGGTCC CATTGTTGAC GGCATTTAGC TTCTAAGAAG
ATCACTATGC ATCGGTCACA GAATAGACGT CAAGGAGCGG ATGCTAACTC CTTGTTGTTT
GGAGCCGAGT TATTGGTTCT GGATCTTCTT CTATGTCCTT ATATTCAATG TCTGAACGAT
GATCGACATC CCCGTGCAGT AATAGGGGAT AGCAAAGAGT CGACGGCAAC TTAAGAAAGA
CAAAAACTTT GCTGCCTATC AGATATCAGA TAATCTTGAT ATACGCACCT CGTAAGGCTC
ATTAATCTTG TCTGTTGTTG
 
Protein sequence
MLSRSVQVAG LDLFVSSRAA SLRCQKYCAR SPSNLKEMRI PRKTSQMVSR PCISFRGLHS 
STSWNGIFSS RSTTAKPSET QTDVDQPLTP FQARVAELEI KAHANKEDPD AQLEFLRQLS
EGGEFAGLVA YYEGMALAED TSGSQALLRN DEAWAIFMDA LARSGRLGDI VTKVRRRDQL
LASIGANGGS SSSAPLVSNP TPSSVSANNS STSTPSVSSP GPSLTSSLLS RAVSPTSLAN
ASNASTSQSH PGAGSPLNPI YVQMAPPTPQ MNAWRALRWV AGFLLWGFII LTVMSMVIEN
TGLLKAGPGP VEFEPEEGKI VKFSDVHGVE EAKAELEEIV EFLKNPEKFS ALGGKLPKGV
LLTGPPGTGK TMLARAVAGE AEVPFLFASG SSFDEMFVGV GAKRVRELFA AARKKAPAII
FIDELDAIGS KRSAKDQHYM KQTLNQLLVE LDGFEQAEGV IIIAATNFPE SLDKALTRPG
RFDRHVVVGL PDVRGRIEIL KHHMSEVQYD VDVDPSVIAR GCPGMSGADL QNLVNQAAVK
ASRDGSNSVQ LKHFEWAKDR ILMGAERKSH YVTEESKRAT AYHEGGHALV ALHTPGAMPL
HKVTIMPRGQ ALGITFQLPE QDKDSYTRRE FNAMIDVALG GRAAEEMIFG HDNVTSGCSS
DLQRATDVAT RMIRNYGFSD KVGLVAHGDE ESVYLSSKKK DEIESEIRSF LDQSMTRTEN
LLKTHENELH RLADALIEYE TLSLDEVKQV LEGKRLSRPT TEGESLKGQG EKSGKGPIVD
GI