Gene CNN00920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00920 
Symbol 
ID3255463 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp293083 
End bp296591 
Gene Length3509 bp 
Protein Length846 aa 
Translation table 
GC content51% 
IMG OID638254508 
Productconserved hypothetical protein 
Protein accessionXP_568648 
Protein GI58262476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAAAAATC GGGGGCCAGC GCCGTGAGCC ACCGCCCACA TACACTCAAC CCCCTACGCA 
TGGTCGCCCC CGGCAGAATG GTCACCTCGT CCCGCCGAAT AGGGCGGGTG ACCAACAAGA
CCAAGCTTAT AATCTATCGC GGCTCAGACA AGGTCGACAC TTCAGCGGCG GAAACGGTCC
TGTGGGATCA GGAAGCCGGT GGAGCCGGGA AAGACAGCAA CAAACATCAG CACATCGGTG
CAACCGGTGT AGAGTCTGGT GAACTATTGG TACGTATCTG GATGTGCTCT TTCACCTTTC
ATTATGCATG CAGGGGGGTT GCATTATGCA GTTGTTGAGG AAATGGGCCG CCAATTGATG
GTGGTGTTAT TTTCCTTTTT TTCACTCGAC GAAACGAGGT CCGCGCGGGG AGAGGCGCGG
CGTCGTGGGA GGATGTTTTC CTCCATCCAT GCCTTTTTCG CGCTCTTCTT CCTGCGAGTG
GGCCAGGATC GAGATGGATC CACGCGTTCT GTGCATCGGA GCAGCCGTAT AGGTCCATCC
AGCGTGCATG CACCAGCGAG GACGCGTACA GTGGTCGTCT GACCGAAATT TCCGCTTTCG
GCAATTCGCG CCTTGTTTGT TTGTCATATA ACTGATCCGC GTCTTATACA GGAACACCAT
CTTCAAGCAG CGTTGTCGTC CGCGTCCCTC CTCCATTCCA GCAACAAACC CTCGTCGCCC
AAGTCGGTCA AGGAAGCACC AGCTGCTGCG CTCAACTATC ATATTCCTAC ACCAGACGCT
ACGGGCCTCG TCTCAGATAC AGTTTTCAGC CAACTCTATC AGCGTACCAA GTATGTTGAA
CCTTACAACT TCATCAGGTT CTCGGATACG GTTGAGGAAA GTAGTTGCGG ATGGGGAGGA
TTAGGGTATT GCATGGATGA CGCGGATGAG CGATGGCTCA ACGACTTCAA CTCGAAAGCT
GAGGGATCGA GTGGGGACGT AAAGTCTGAC AAGGAGCAGG GCCGAGGGAT GAGAGTAAAG
GGTAAAGACA GGGAGAAGGA AAAGGGTGAC GCTCCGGCTC CCTTGGTTAT ATCCGAGGAT
ATGTTTGAGT ATATCATGGG AGTGTTTGAG AAGTACACCG AGGAGAATGC GCCAATGCTT
CACACCGTGA GTCCGCATGA CCAGTTGGGT CCAATGACTG ACGTCCCATA GGACCTCTCT
CTTCTTCCCC CATTCTCTGC TGTCGAGAAT ATGTTCTCGA CGCCCATCTC ACCTGCATTC
TTGCCCAGCA ATGAAATACC CAAGGAACTG GGCGACCTCA AGGCTTGTGC CAGGATGGCA
AGGAATGTCT ACCCACATTG GAAATCTCGA AGGGAACAGC GACAAGGGAA GTCCATCCTC
CCCCAGCTCA ACTATGACGA AACCAACGAC AACGATCCTT ATGTCTGTTT CCGTCGACGA
GATATCCGAG CTACCCGTAA AACCCGTCGT ACCGACAACT TTTCCATTGA ACAATTTCAG
AAACTTCAAT TCGAGCTCCG TAGTGCTCAT GCTCTTGCGG ACCGTGTGCT CACCCGTGAA
CGGGAGAAAA AGTCATTGTA TGAGGCCGAA AAGGAACTCT GGGAGGCGAG ATGGAAATTC
TTTGAAACCA AGAGACGCTG GCCTAGCTTG GGTATGACTA GCGATGAGGA ACACAAGATT
ACCGGTAGAC CAACGATCGT CCCTCCTATC CAAATTCCAT CGCTCTCTGG TCAAACTCCT
CTCACATCGG GCCAGTCATC GTCGCATATG CGCAAGAGGA CCGACAAGGA TCGAGAAGAA
AGAGCTCAAC GGGAGAGATA TGATGCACAG AGGAATGCGG AGAGATCAGG CATTCTTTCT
GGCAGGTCGA ACGCTCCGGA TGCTCTGAAA GAGAGGTTAC AAGCTCTGCA GCAGAAGACG
GAGGAAATGC TGGCGAGGAA GAAGGAGCAG GATGCGCACT GGGACGACTC TATAGATGTA
AGTTCCATCG ATACCGCTTT TAGACCGTTA CTCATTCGAT CTTCAGTCCC CGTATCAACC
GTTACCACCT TCGAACTCTG TACACGCTTT CCGATCACTC TTCGTCCTTG ATCCTTGTCG
TGCTCAATGC AAAGACTCAG AGACAGGGAA TGAAATCCTT CATCCCGAGT CATTCCGTAT
CCGCCGAGGC CGAGGAGGTA TTGTCCGACT TGACCGACGC ACTTCTATCT ACTCCCATCG
TCGCGGCATT CAACCCACAT CACCATCCGA ATATCCTACA TGGCTGTTCC CTGATATCGC
CCCTCGTCGA TCAGAAAAGA AGAGGCCTAG GTCGATTGAT GAGGTCGAGG AGGAAATGCA
GGAGCAATCG CCCAAGGTAA TGAGGAAAGA CCTAAATGAG ACTTGGAGAT ATGACGTGGA
TAGAGGAGGA GCAGTGGGTG TGGGCATGGG GTTGGAAGAA GACTATGACA GAGTCATCAT
CGATGATCTC GAGGCGAAGT GAGTACTCTG GCTCTGGGCT ATGCTTCACT ACTAATTTAT
TAATAGATAC ATTCGGCATC GCATTTCCTT ACTTCAAGAG AGCGACTGTG CTAAGCTTAG
ACCAGATAAT TATATTCTGG ATCAAACACG CGAAGCTCTT GATGCGGCCG CCGATGCGAA
ACCTCCACCC GCTCCAATTT TCCAGAAGCC TCCAGCGCCT CAACCGAATC CACAACTCCT
TGCAGCGCAC TTGCAACAGC AGCAAATGTT GGCGCAGCAG CAGCAGATGG AGCAGTTTCA
GAGATTCCAG CTTATGGCCC AGCAGCAAGC TATGGCTCAG GCTCAGGCTC AAGCACAAGC
ACAAGCTCAA GCGCAAGCCC AGGCACAAGC TCAGGCGCAG GTGCAAGCTC AGGGACAAGG
ACACCCCCAA GCCCATCTGC AAACACATCC ACAAGGTGTT CCTCAGCCCA ACGGCGTCAA
TTCTCCCATG CCAAATGGTC AACAAATGCT TCCACCGTCC GACGGTGTGA AGCAACTCAA
ACTTCCGCCA CATGCTGTCG CAAGGCTTGG AGCGGCGATG GCGAATGCAA ACGCAAATGC
CAATGGTGGT CTCCATGTAC TGCAACAACA GCAACAACAA CACGCGCAAA CATCTCAGCA
ATGAGCAGCG TCGCCGCTGT CAATTGGTGT GTAAAAAGAG GAAGATTGCG CCGGCCGCGT
TGTACTATGA CCAGTTATCC TCATCGCTCC ACGCACGTCC GTCATCATAG AAACCCGACC
TCTCAATCAC ATATCACCGC TTTTCAGACC CATGATATCT CAGCATAAAT ACCCCTTTCT
CATTCAACTG AAACTATTAG TGTTCTTGAT AAGCTCTCGC CCAAATAAGG GAAGAGCAAA
GGTGTAGGCT GGAGTGAAAG AAGATGTAGC TGTTAGACCA AGTAGTGTCT AGGGATGGTT
TAAATAGCCA GGAGCCAAAT TACTCAGTTG TACATTAGAC AATACAAGAA AACGCATATA
AACAGTGGTA TGCAGTTTTG GATACTTTT
 
Protein sequence
MVAPGRMVTS SRRIGRVTNK TKLIIYRGSD KVDTSAAETV LWDQEAGGAG KDSNKHQHIG 
ATGVESGELL EHHLQAALSS ASLLHSSNKP SSPKSVKEAP AAALNYHIPT PDATGLVSDT
VFSQLYQRTK YVEPYNFIRF SDTVEESSCG WGGLGYCMDD ADERWLNDFN SKAEGSSGDV
KSDKEQGRGM RVKGKDREKE KGDAPAPLVI SEDMFEYIMG VFEKYTEENA PMLHTDLSLL
PPFSAVENMF STPISPAFLP SNEIPKELGD LKACARMARN VYPHWKSRRE QRQGKSILPQ
LNYDETNDND PYVCFRRRDI RATRKTRRTD NFSIEQFQKL QFELRSAHAL ADRVLTRERE
KKSLYEAEKE LWEARWKFFE TKRRWPSLGM TSDEEHKITG RPTIVPPIQI PSLSGQTPLT
SGQSSSHMRK RTDKDREERA QRERYDAQRN AERSGILSGR SNAPDALKER LQALQQKTEE
MLARKKEQDA HWDDSIDSPY QPLPPSNSVH AFRSLFVLDP CRAQCKDSET GNEILHPESF
RIRRGRGGIV RLDRRTSIYS HRRGIQPTSP SEYPTWLFPD IAPRRSEKKR PRSIDEVEEE
MQEQSPKVMR KDLNETWRYD VDRGGAVGVG MGLEEDYDRV IIDDLEAKYI RHRISLLQES
DCAKLRPDNY ILDQTREALD AAADAKPPPA PIFQKPPAPQ PNPQLLAAHL QQQQMLAQQQ
QMEQFQRFQL MAQQQAMAQA QAQAQAQAQA QAQAQAQAQV QAQGQGHPQA HLQTHPQGVP
QPNGVNSPMP NGQQMLPPSD GVKQLKLPPH AVARLGAAMA NANANANGGL HVLQQQQQQH
AQTSQQ