Gene CNF01000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01000 
Symbol 
ID3258019 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp309430 
End bp311877 
Gene Length2448 bp 
Protein Length491 aa 
Translation table 
GC content50% 
IMG OID638257224 
Productendopeptidase, putative 
Protein accessionXP_571542 
Protein GI58268772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCATCACAA TGCACTACCT CGCCGCCGCC CTTCCACTTC TCACACTCGC CCTCGCCGCG 
CCCAGCAACC GCCCGAGCGT GCCCCTCACA CCCCTCAACT CTAGACAATA CTCGGATGAC
CTCGTCGAAC GCCAGTCATG GCTGCTCGAT CAAGCCAAAG GCCTCCGGAG CAAGTACGCT
CCTCATCTCG GTGAGAGAGG ACAAGAGCTT CGTAGGAGAG ATATAATAGA CGAGGGTATC
ATGAGACGGA AGCGGGCGAA CCAGAAAAGA GCTACGGGGA CCGTTTCGTA AGTTATAGTT
TTTGACCTTG TTTTTTTTGT GTTTGGCTGC GGCTGCGGAG GCGGCAATTC ATTTTCTGCT
TTACAAGCCG TGCTGGCTTA AAAATTGCCA CCGACCACTT AAGCATGGAT GGACAGATTA
CCTGTGCACC GAGAACACCT GGGTGCCAGG CGTAGAGGAG TGGAATCACC TAACGGAAGA
GCTGACAGCA GGACATAGCC TTACCGATGT GGGTCTCGAT GCTTCGTACG CTGGACAGGT
GTCGATCGGG TGCGTCTTCA TTCCTCATTC CGCCATGCAT TCACCGCTGA CGCTCTCCGC
AGCACACCGG CACAGGATTT TCTTGTGATC ATGGACAGCG GCTCCTCTGA TCTGTGCGTT
TTCCCCTTGT TTGCGCATTG AGACTCGCTC AGGGTCTCTT GCTTGGCATT TTGCATTGGC
TGACGATCCG GCCGCCGGTA TTTCTAGCTG GGTTGCTGGT TCTACATGTA CAGACAGCTT
TTGCAGTCAA ATAACCACCT TTGACACCAG TGCCTCGTCT TCGTTCACCA CCAGCAATGA
GGCGTTCAAC ATTACCTATG GATCGGGCGA CGCCGACGGG ACTCTTGGCA CGGACACTGT
CTCGATGGCC GGCTTCACCG TTTCTGACCA GACTTTCGGT ATGTTTTCTT GTCGTCTGCA
ATGCCGACAA GCACCGGCTG ACTTTTGAAT GCATAGGCGT CGTCACTTCT ACATCCGCTG
ATTTGATCAG TTACCCTCTT TCCGGTCTTA TGGGCTTAGC ATGGAAGTCG ATTGCTTCTT
CCGGCGCGAC TCCCTTCTGG CAAGCTCTTG CCGCCTCTGG TGACTGGGAT TCCCCTGAAA
TGGGTGTTTA CCTGAAGAGA TACAGGGGTG ACAGCACTGC GAGCCAGATC GAGACTGATG
GCGGAGAGAT CCTCTTTGGG TGAGTGCATC GTCGTATCAT GACGATCACG AGCAAAGGAC
AGAATTGATG AACGTTTATA GTGGGCTTAA TACGAGTTTG TATAATGGTA GTGTCAACTA
CATCTCCATC GACGAATCCG ACGAAGATTA TTGGAGAATT CCTCTCGAGG CTATGGTCAT
CCAGGGCAAC TCTGTCTCCA TTGCCTCCTC TTCCGGCGGC AGCAACCCTT CTTGTGCTAT
CGACACTGGA ACTACCCTCA TTGGTGTCCC CTCCCAGACC GCTTACAACA TCTATTCTCA
AATTGAGGGT GCCGAGGCGC TTTCTGAGTC TACTGGTTAT GAGGGTTACT ACCAGTACCC
ATGTGACACT GACGTGACCG TATCTCTCCA ATTCGGCGGC ATGTCTTATA GCATCTCCAA
TGCCGACATG AACCTCGGCT CTTTCACTAG GGATACTTCA ATGTGTACCG GTGCCTTCTT
TGCCATGGAC ATGTACGTTT TCATTCCCAT CTTTCTCTTC CAGTCTGGAA CTGTCATCTG
ACCTTGATCT TTTTACCAGG TCTTCCCGAT CTCCCGTCCA ATGGATCGTC GGTGCATCCT
TCATGAAGAA CGTCTACACC TCTTTCCGAT ACAACCCTGC CGCCATTGGC TTTGCCGAGC
TCGTTGGCGG TTCCTCCGTC TCAACCGGCA ATTCTTCTAG CTCTACCACT TCTGGCGGTA
CCTCTGGTTC TAACGGCGGT GGATCTTCTT CCAGTGGCAC TATGGAGAAG AGTGTGCAGT
TGGGATTGCT CGTCGGTGCT GCGGTTGTTG GCCTTGCCGC AATGATCTAG AGAAAAAGTT
GATAAGTATG GGAAGGCGGG GAAATCTAGG TTGGTCGTAC CTGTGTATTA AAAGCGAAGA
CAGTCATAAC TGTAGCCATT AAGGGAAGGG AGGGAGAAGT CAACCTGAGT GTGGATATAC
GAGGACACTC TTGGTTAATT TTTGCCCCTC AAAAAACTAG AAAACCCCTG CGAAAACCAT
CAAGTGTTGT ACATATCACC TCACCGTCTT TTATAACAAT AGCAACAGCA TTACTTTTAC
CATTTGGATT ATCGGTGGCC GGCCAATAAT CAGCCTCATT CATTATATCT TTACATATAT
TTATCTTGTT GCATTTAGCA TTTCCCCGAA CGTTGCATTA GGTAATTCAT TCGTATGATT
GTCCAGATTT TCGATAGCAA TTGATAGCAA ACGATAACAA TTAGTAGC
 
Protein sequence
MHYLAAALPL LTLALAAPSN RPSVPLTPLN SRQYSDDLVE RQSWLLDQAK GLRSKYAPHL 
GERGQELRRR DIIDEGIMRR KRANQKRATG TVSLTDVGLD ASYAGQVSIG TPAQDFLVIM
DSGSSDLWVA GSTCTDSFCS QITTFDTSAS SSFTTSNEAF NITYGSGDAD GTLGTDTVSM
AGFTVSDQTF GVVTSTSADL ISYPLSGLMG LAWKSIASSG ATPFWQALAA SGDWDSPEMG
VYLKRYRGDS TASQIETDGG EILFGGLNTS LYNGSVNYIS IDESDEDYWR IPLEAMVIQG
NSVSIASSSG GSNPSCAIDT GTTLIGVPSQ TAYNIYSQIE GAEALSESTG YEGYYQYPCD
TDVTVSLQFG GMSYSISNAD MNLGSFTRDT SMCTGAFFAM DMSSRSPVQW IVGASFMKNV
YTSFRYNPAA IGFAELVGGS SVSTGNSSSS TTSGGTSGSN GGGSSSSGTM EKSVQLGLLV
GAAVVGLAAM I