Gene CNA01380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01380 
Symbol 
ID3253591 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp377082 
End bp380656 
Gene Length3575 bp 
Protein Length950 aa 
Translation table 
GC content51% 
IMG OID638252470 
Productpeptidase, putative 
Protein accessionXP_566599 
Protein GI58258373 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCAACTGCG TCCACTGTTC CCCGTTCCGC CTCGAACCCT TCCGTCCTCT TTCATCTTCT 
CATTTCCCTT TCATCCTGCT TTCCACAACC TTCTTGCACA CCGGTATTCC GCCGCCCACA
TTCTTTATTA GCCCTTCTTA TAATACATCG GGCGTAGGCG TTCTTTCCAT CCGTAATGGT
CGCCTTCCAC GCGTCCTCCG CGCTCCTCTC CTTTGCCCTT CTTGCGCCTG GCTTTGCCAA
CGCCTTCAAT CTTGATGATA TCAAAAGGGG TTCCGAGTAA GTTACATGGT CAATAGTAGA
CACGCACAGG GCTGATCCCT CTACAGCTCC GTCTTACCGG GGAGGTACAT TGTCGAATTC
GATAGCGATG CTCACCTCAC CTCTGCCGGT CTAAAGAGAG CCGCTACAGT GAGCTGACCC
TCGCGTACCA TCCGCGTAAC AAGCTGACTT TAATTTTTCT TCAAGCCCCA TGAATACTTC
TATAAAGAAC TTGATGCTCG CTCGTCCTCT TATACTGTCC ACCAGGAATA CGATTGTGAC
TTGTTCTACG GCGCGTCTCT CTCCATTTCC TCTGATGTTG TGAGTGACGC ATCCTGAGCC
CAGTTGATGT CCATCTAACC CAAATTTTAG GATCTTGAGA GCCTGCTCGA CATCGCTGGT
GTCATCGATC TCCGGGCTGT TCATTTACTC ACCTTGCCCG CCGAGCCCCT TTCCAAGGAA
AACACCCAAT GGTCCGCCAG TTCTCACTCT TCGTCATCCA GCTCTGCATC TGCCACTGCT
TCAGCTGCTT CCACTTCGGC ACCGTTCTCC AACCTCCCCC AGATCCAGGC TGACGTTGTC
CAAGCCTCTG GGAACAAGGG AAAAGGAATC AAAATCGGTA TCATTGACGG TGGTGTGGAC
TATACACGAG AACCTCTTGG TGGCTGTTTC GGGCCCGGCT GCAAGATTGC GGGAGGTTAT
GACTTTGTCG GTGATGATTA CGATGGAACT AATGATCCTG TGCCGGACAA CGATCCTTAC
GACAACTGTT ATTCCCACGG CACTTTTATC TCTGGCATCA TTGGTGCCAA CGAGAATGTC
TACGGAGCCG TAGGTGTCGC CCCCGAGTCG AGCTTGTATG TCTACAGGGT GTTTGGTTGC
AACGGCGCGG CTTCGGATGA CATTGTCCTT GCGGCGATGC AAAAAGCGTA TGACGACGAC
ATGGATGTCA TCAACCTCTC TCTTGGTAGG TTTACTCCTT TAAGTGAAAG ACAAAACGTT
TAGCTCATAC GATTTCTAGG TGAACCCTCT GGATGGACCG AGAGTACACT TAGCGTCTTT
GCCTCCCGAC TTGTTGCCAG AGGCACAATT CTCACCATCT CTGCTGGCAA TCAGGGGCAA
GTCGGTGGCT TTTACTCTTA CTCTCCTTCC GCGGGTAAGG GAGTCATCAA CGTTGGTTCC
AGTGACAGCT CTATCTATCC TGCTCATCTG GCCACTGTTT CCACCGGTTA CGGCCCCATT
GCCTACTACA ACTACAAAGC GTACAGCGAC AAGACGTTGC CCCTCTACAC TTTTGACTCG
GACATCTATG GATGTACCTT GCCAGACGAT GTTCCCGACT TGTCACCTTA CCTCGTCGTC
GTCCGTCGAG GTGGATGTTC CTTGTCTGAA AAGGCTCAAA ACGTCTACAA CGCCGGTGGA
ACTGCCATCT TCGTCGTCAA TGACGAGACT TCCCTTCCCA TCTATCAAAA TTTCCCTCTT
ATCGACTTTG CCCTCATTAG CGACGATGAC GGTAACTACC TCCTCAATCA GCTCAACACA
TCCGCCAACA CCACCGTCTC TTTTTCTTTC AACCCTATCG CCCTGCCCAA CATATGGACA
GCCAATACCA CATCTTACTT TTCAGAAATT GGTCCTACCA ACGATTTGTA CTTTGCTCCA
TCTGTGCTCG CCCCTGGTAC CAACGTTGTC GGTGTCACGC CTACGGCTTT TTACAACTGG
ACTATCGCGG ATGGCACTTC CTATTCTTCT GCCTATGCTG CTGGAGCTGC CGCTCTCTAC
CTCGCCGCCA AAGGTACAAA CAATGTCAGC CCGAGCGACG TCTTGTCTGC GTTCGAAGTC
ACTGCTCAGC AACTTCCCGT CTCCGTCTCT GATAGCTCTC TTGTGAGCGT CGCTGTCCAG
GGTGCAGGCA GAGTGCAGCT CAGCGACGCT ATCTATGCCG TCGCCGACAT TTCTCCTTCC
GAAATTACAT TGAACGACAC GGCCAACTTT GATAAATTGC ATGTGCTGAC AATCAAGAAC
CCTGGTAAGA AGTGGGTAAC GTACAAACTG TCTCACGAGC CTGCCGGTAC CGCTTTGGCT
TTCCAATCCG GGCTTAATCA GTCGAACGAC CAGCCTTTGC CCCAAGTATC CAACGCAGCA
TCCGTCAACA TCTTCCCTTC CTCACTCACC CTCTGGCCTG GTCAGTCTCT CCTCACGACC
GTCAAGTTCA CCGCTCCTAC TGGTCTTGAC GCTCAGACCT TTCCCATCTA TTCTGGATTC
ATCAAGGTCA CCGGCGGTGG CAGCACTGTC AAGGTCCCTT ATATGGGTGT CGCGGCCAAC
ATGAAAGACA TGCCCGTCCT CGACCCTACG GATTGGTATC TCGGCATGAA CTCGCCTGCT
ATCGTCGATG TGGATGGCAA CGTCCAACAA GGCCCGGCAA CATATACGTT TAGCAACGTG
AGCTACCCCT CGGTGCTTTA CCGTCTTGCC GGAGGAACAC CTTTGCTCGT GATTGACTTG
ATTGACGCAA ATGCCAACCT TACTTTTACT CCCGATTACA CCACTCGAAA GCGTTCCCCG
ACTTTTGAAC AGGAAACGGA GAACGACGAG CGCCGCCGCT CACTTTCCGC TCGACGTCTT
AAATCTACCG GCGCATCCTT TGCTGCTACC ACGAAGAGCA AGGGTCTCCA CTCTCTTTGG
TGTCACTTGA CACACTATAA AGCTTCCGGG TGCTCAAAGA CTGGCAGCAC GTTCCAGCAG
GTATCTATCA TTGGGAACTT GTATGTGGGC GAGTACTTGC CGAGGAGCAC GGATAACGTT
GATGGCCAAG GAGGGGATTA CTCGACCTTT GAATTGAGTT CAGCGACATT CTCGAATGGG
ACGACTATTC CCAATGGAGA TTACAAATGT GAGTGTTTTT TGTCGACATG TGTCTTGCCT
CAACGAAAAG GGAAAAAATT GAAAATGCTG ATGGTTTCTA GTCCTTATGA GGGCGTTGCA
CATCACGGGG GATAACACGA ACGAATCAGA TTATGAATCT TGTAAGCCTA CTCACCTCCA
GTCGTTTGTG AGAACCTTGT GCTGATGACG GCTTGGACAG GGGTTTCTCA ATCATTCACC
GTTGCCCAAT AAGCTCACCA TCTCTGCTTT TATGTGTGTG TTCCTAGTGT TTCAAGGACA
GTTATTATCC ATACCGAACG GACATTTCCA AACCAAAGGC TGCCATGTAC GTAGTTCAGT
AGTATAATAC GCTTTGCTTT TCTTTTTCTT TTTTTCCTTC CTGGAATCTT TGGACACTCG
CGATGAGAAG AAAAATAGCG TCATGAAACG ATGAA
 
Protein sequence
MVAFHASSAL LSFALLAPGF ANAFNLDDIK RGSDSVLPGR YIVEFDSDAH LTSAGLKRAA 
TPHEYFYKEL DARSSSYTVH QEYDCDLFYG ASLSISSDVD LESLLDIAGV IDLRAVHLLT
LPAEPLSKEN TQWSASSHSS SSSSASATAS AASTSAPFSN LPQIQADVVQ ASGNKGKGIK
IGIIDGGVDY TREPLGGCFG PGCKIAGGYD FVGDDYDGTN DPVPDNDPYD NCYSHGTFIS
GIIGANENVY GAVGVAPESS LYVYRVFGCN GAASDDIVLA AMQKAYDDDM DVINLSLGEP
SGWTESTLSV FASRLVARGT ILTISAGNQG QVGGFYSYSP SAGKGVINVG SSDSSIYPAH
LATVSTGYGP IAYYNYKAYS DKTLPLYTFD SDIYGCTLPD DVPDLSPYLV VVRRGGCSLS
EKAQNVYNAG GTAIFVVNDE TSLPIYQNFP LIDFALISDD DGNYLLNQLN TSANTTVSFS
FNPIALPNIW TANTTSYFSE IGPTNDLYFA PSVLAPGTNV VGVTPTAFYN WTIADGTSYS
SAYAAGAAAL YLAAKGTNNV SPSDVLSAFE VTAQQLPVSV SDSSLVSVAV QGAGRVQLSD
AIYAVADISP SEITLNDTAN FDKLHVLTIK NPGKKWVTYK LSHEPAGTAL AFQSGLNQSN
DQPLPQVSNA ASVNIFPSSL TLWPGQSLLT TVKFTAPTGL DAQTFPIYSG FIKVTGGGST
VKVPYMGVAA NMKDMPVLDP TDWYLGMNSP AIVDVDGNVQ QGPATYTFSN VSYPSVLYRL
AGGTPLLVID LIDANANLTF TPDYTTRKRS PTFEQETEND ERRRSLSARR LKSTGASFAA
TTKSKGLHSL WCHLTHYKAS GCSKTGSTFQ QVSIIGNLYV GEYLPRSTDN VDGQGGDYST
FELSSATFSN GTTIPNGDYK FLMRALHITG DNTNESDYES WVSQSFTVAQ