Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA02800 |
Symbol | |
ID | 3253585 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 727175 |
End bp | 729248 |
Gene Length | 2074 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 47% |
IMG OID | 638252611 |
Product | zinc-finger protein zpr1, putative |
Protein accession | XP_566683 |
Protein GI | 58258541 |
COG category | [R] General function prediction only |
COG ID | [COG1779] C4-type Zn-finger protein |
TIGRFAM ID | [TIGR00310] ZPR1 zinc finger domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.700154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTTGAGCCG TCTCCTCGAG AGACGTCTTT GAAACCAAAA TTTTGCCTAT ATGTCTGATA CTCTTACTTT TGTCCCCAGT ACATGATTTT GTTCACGACA ACAGTAGACT AACAAATATT CTTAGGCTTT GAAATCGATC AACCATGTCC TCTGATAAAA CCAACCTTTT CCCCACTTTA GGTGAGGTGG CGGACCGCAC AGGGAAGGCT GAGAGTTTGG AGCAGGAAGG AGATGACAGA CAGATGCAGG AGATTGAGAG TCTTTGTATG AGATGTCATG AAAATGTATG TCATTGCCGC AATTTCGTAC ATGTTGCAGA CATTCGGAGC TGATGCAAAG GGCAGGGCAC GACCAGACTG CTCTTGACCA GCATCCCCTA TTTCAAGGAA ATAGTGGTGT CTTCTTTCAG ATGCGATCAT TGTGGTCACC GTGATACGGA GATTCAAAGT GCCGGTGAAA TCCAGCGTAA GTGACTGTCG CTGGCGGCAG AAACAGCGTG AACTAAATTC CGTTACTTGC TAGCCAAAGG CGTCAGCTAC ACCGTACACC TTCTCACACG TGCCGATCTC GACCGACAGA TTGTCAAGTC TAATTGGGCT ACAATTACCA TTCCCGATAT CCAGTTGACT ATCCCTCCTG GTCGAGGGCA AATCAATACT GTTGAGGGCA TTATTCGTGA CACTGTACGA GATCTTAACA TCAGCCAACC TGTCCGACGA GTCATGGACC CCGAGACGGG TAAAAAGATT GACGAGCTCC TCGAGAAGCT TAGGGCGGCA ATTGACATGG AGGAGGATGA TGAAGACGAT GGAGGTGTTG GAATGGATGA CGATGTGAAA CCCGTACACC ACGAACCATC CAATTCTTCG TCTAAAGAAG AAAAACCTTT CGTCCCCTTC TCTATGATCG TCGATGATCC GTCTGGCAAT TCTTACTTCC AGTTTAAAGG GTCTCAATCA GATCCTCAAT GGAACATGAG AGCTTACAGT CGGACATTTG ATCAGAATGT GATATTGGGT TTGGTCGCTC GACCGGAGGA TATGTCTGAG GAGCAGCCGG AAGGCGTCCC GATTGTCGCT GCTGACCACA AACTGAGCAG TGCGGAGGAG TTTGAGTCGA AAAGGAACAA GAACGTGATC AATCGGGATG ACGGGACAGT TGTTCCGGAC GAGATTTACA GCTTCCCTGC TACGTGTTCT TCATGTGGAC ACCAGCTTGA GACTCTCATG CAGCAGGTCA ACATTCCTTA CTTTCAAGTA AGTTTTTTTT CTTTGCCTAT GTAGCTCGTC GCTAATTCTT GTCGTTAGGA TATCATCATT ATGTCAAGCA ATTGCTACGC ATGTGGATAC CGAGATAATG AAGTCAAGTC TGGTGGCTCG ATCGCTCCCA AGGGTAAAAG GATTACTCTG AAGGTTGAGG ACGAGGAGGA TCTTAGTCGA GACATGCTCA AGGTGAGCGT ATCTTTGTAC TGTATTACAG CTAGTAACTT ACTGCTTCTG TAGTCTGATA CTGCTGGTCT ATCAATTCCC GAAATTGACT TGGTGCTTCA ACCTGGTACC CTTGGAGGCC GTTTCACCAC TCTTGAAGGT CTTCTCAATG AGATTTACAC CGAACTCAGT ACCAAAGTTT TCCGAGCTGG TGACTCTACT ACCGCTGGTA TCGGACAAAC GGATTCGAGC GCCGGTGAAG ATGAAGCAAA CTTTGGGGAT TTCCTCAAAG GCTTGAAGGA GTGTATGTCG GCCCAGAGGC AGTTCACTCT CATCCTTGAC GATCCAGTGT CCAACTCCTA TCTTCAAAAC CTTTATGCGC CTGATCCTGA CCCGAACATG CAAATCGAGG TGTATGAGCG AACGTTTGAG CAGAATGAGG AACTTGGTCT TAACGATATG GTCGTGGAAG GGTATAATAA GGAAGCTGAG GGAACGGCGT AAGGTTCAAA TATCTGGATT GGGCAAGCTG GAATGTTATG CTTTGTAATC ACATAGGGAA ATTAGAGTAG AAGACGTGCT GCTATCATCG TCTCCGACAT GTGGTTTGCT TTCATTTATT GTACTTCATA GACATGCATA ATTA
|
Protein sequence | MSSDKTNLFP TLGEVADRTG KAESLEQEGD DRQMQEIESL CMRCHENGTT RLLLTSIPYF KEIVVSSFRC DHCGHRDTEI QSAGEIQPKG VSYTVHLLTR ADLDRQIVKS NWATITIPDI QLTIPPGRGQ INTVEGIIRD TVRDLNISQP VRRVMDPETG KKIDELLEKL RAAIDMEEDD EDDGGVGMDD DVKPVHHEPS NSSSKEEKPF VPFSMIVDDP SGNSYFQFKG SQSDPQWNMR AYSRTFDQNV ILGLVARPED MSEEQPEGVP IVAADHKLSS AEEFESKRNK NVINRDDGTV VPDEIYSFPA TCSSCGHQLE TLMQQVNIPY FQDIIIMSSN CYACGYRDNE VKSGGSIAPK GKRITLKVED EEDLSRDMLK SDTAGLSIPE IDLVLQPGTL GGRFTTLEGL LNEIYTELST KVFRAGDSTT AGIGQTDSSA GEDEANFGDF LKGLKECMSA QRQFTLILDD PVSNSYLQNL YAPDPDPNMQ IEVYERTFEQ NEELGLNDMV VEGYNKEAEG TA
|
| |