Gene CNA02800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA02800 
Symbol 
ID3253585 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp727175 
End bp729248 
Gene Length2074 bp 
Protein Length522 aa 
Translation table 
GC content47% 
IMG OID638252611 
Productzinc-finger protein zpr1, putative 
Protein accessionXP_566683 
Protein GI58258541 
COG category[R] General function prediction only 
COG ID[COG1779] C4-type Zn-finger protein 
TIGRFAM ID[TIGR00310] ZPR1 zinc finger domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.700154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTTGAGCCG TCTCCTCGAG AGACGTCTTT GAAACCAAAA TTTTGCCTAT ATGTCTGATA 
CTCTTACTTT TGTCCCCAGT ACATGATTTT GTTCACGACA ACAGTAGACT AACAAATATT
CTTAGGCTTT GAAATCGATC AACCATGTCC TCTGATAAAA CCAACCTTTT CCCCACTTTA
GGTGAGGTGG CGGACCGCAC AGGGAAGGCT GAGAGTTTGG AGCAGGAAGG AGATGACAGA
CAGATGCAGG AGATTGAGAG TCTTTGTATG AGATGTCATG AAAATGTATG TCATTGCCGC
AATTTCGTAC ATGTTGCAGA CATTCGGAGC TGATGCAAAG GGCAGGGCAC GACCAGACTG
CTCTTGACCA GCATCCCCTA TTTCAAGGAA ATAGTGGTGT CTTCTTTCAG ATGCGATCAT
TGTGGTCACC GTGATACGGA GATTCAAAGT GCCGGTGAAA TCCAGCGTAA GTGACTGTCG
CTGGCGGCAG AAACAGCGTG AACTAAATTC CGTTACTTGC TAGCCAAAGG CGTCAGCTAC
ACCGTACACC TTCTCACACG TGCCGATCTC GACCGACAGA TTGTCAAGTC TAATTGGGCT
ACAATTACCA TTCCCGATAT CCAGTTGACT ATCCCTCCTG GTCGAGGGCA AATCAATACT
GTTGAGGGCA TTATTCGTGA CACTGTACGA GATCTTAACA TCAGCCAACC TGTCCGACGA
GTCATGGACC CCGAGACGGG TAAAAAGATT GACGAGCTCC TCGAGAAGCT TAGGGCGGCA
ATTGACATGG AGGAGGATGA TGAAGACGAT GGAGGTGTTG GAATGGATGA CGATGTGAAA
CCCGTACACC ACGAACCATC CAATTCTTCG TCTAAAGAAG AAAAACCTTT CGTCCCCTTC
TCTATGATCG TCGATGATCC GTCTGGCAAT TCTTACTTCC AGTTTAAAGG GTCTCAATCA
GATCCTCAAT GGAACATGAG AGCTTACAGT CGGACATTTG ATCAGAATGT GATATTGGGT
TTGGTCGCTC GACCGGAGGA TATGTCTGAG GAGCAGCCGG AAGGCGTCCC GATTGTCGCT
GCTGACCACA AACTGAGCAG TGCGGAGGAG TTTGAGTCGA AAAGGAACAA GAACGTGATC
AATCGGGATG ACGGGACAGT TGTTCCGGAC GAGATTTACA GCTTCCCTGC TACGTGTTCT
TCATGTGGAC ACCAGCTTGA GACTCTCATG CAGCAGGTCA ACATTCCTTA CTTTCAAGTA
AGTTTTTTTT CTTTGCCTAT GTAGCTCGTC GCTAATTCTT GTCGTTAGGA TATCATCATT
ATGTCAAGCA ATTGCTACGC ATGTGGATAC CGAGATAATG AAGTCAAGTC TGGTGGCTCG
ATCGCTCCCA AGGGTAAAAG GATTACTCTG AAGGTTGAGG ACGAGGAGGA TCTTAGTCGA
GACATGCTCA AGGTGAGCGT ATCTTTGTAC TGTATTACAG CTAGTAACTT ACTGCTTCTG
TAGTCTGATA CTGCTGGTCT ATCAATTCCC GAAATTGACT TGGTGCTTCA ACCTGGTACC
CTTGGAGGCC GTTTCACCAC TCTTGAAGGT CTTCTCAATG AGATTTACAC CGAACTCAGT
ACCAAAGTTT TCCGAGCTGG TGACTCTACT ACCGCTGGTA TCGGACAAAC GGATTCGAGC
GCCGGTGAAG ATGAAGCAAA CTTTGGGGAT TTCCTCAAAG GCTTGAAGGA GTGTATGTCG
GCCCAGAGGC AGTTCACTCT CATCCTTGAC GATCCAGTGT CCAACTCCTA TCTTCAAAAC
CTTTATGCGC CTGATCCTGA CCCGAACATG CAAATCGAGG TGTATGAGCG AACGTTTGAG
CAGAATGAGG AACTTGGTCT TAACGATATG GTCGTGGAAG GGTATAATAA GGAAGCTGAG
GGAACGGCGT AAGGTTCAAA TATCTGGATT GGGCAAGCTG GAATGTTATG CTTTGTAATC
ACATAGGGAA ATTAGAGTAG AAGACGTGCT GCTATCATCG TCTCCGACAT GTGGTTTGCT
TTCATTTATT GTACTTCATA GACATGCATA ATTA
 
Protein sequence
MSSDKTNLFP TLGEVADRTG KAESLEQEGD DRQMQEIESL CMRCHENGTT RLLLTSIPYF 
KEIVVSSFRC DHCGHRDTEI QSAGEIQPKG VSYTVHLLTR ADLDRQIVKS NWATITIPDI
QLTIPPGRGQ INTVEGIIRD TVRDLNISQP VRRVMDPETG KKIDELLEKL RAAIDMEEDD
EDDGGVGMDD DVKPVHHEPS NSSSKEEKPF VPFSMIVDDP SGNSYFQFKG SQSDPQWNMR
AYSRTFDQNV ILGLVARPED MSEEQPEGVP IVAADHKLSS AEEFESKRNK NVINRDDGTV
VPDEIYSFPA TCSSCGHQLE TLMQQVNIPY FQDIIIMSSN CYACGYRDNE VKSGGSIAPK
GKRITLKVED EEDLSRDMLK SDTAGLSIPE IDLVLQPGTL GGRFTTLEGL LNEIYTELST
KVFRAGDSTT AGIGQTDSSA GEDEANFGDF LKGLKECMSA QRQFTLILDD PVSNSYLQNL
YAPDPDPNMQ IEVYERTFEQ NEELGLNDMV VEGYNKEAEG TA