Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ02920 |
Symbol | |
ID | 3254251 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | - |
Start bp | 903644 |
End bp | 907879 |
Gene Length | 4236 bp |
Protein Length | 1060 aa |
Translation table | |
GC content | 51% |
IMG OID | 638253441 |
Product | gata factor srep, putative |
Protein accession | XP_567572 |
Protein GI | 58260324 |
COG category | [K] Transcription |
COG ID | [COG5641] GATA Zn-finger-containing transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.742606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCATCTAAA TCATCTATAC ATACATTATT TACATAGTAT TTCCTATCCA TCCAACTCGA CAGCCACGGC CACGTTGTTT CAAGTCCTGT CCGCTCTCGA AAATTGTCTC GATCGAGGTC GCCGCCGTCT CCACACCCAC TGATAACACT TCCCCAACAG ATCCCACCCC GCTCTGTACA TTCTTTTGCA ACACCATCGA CATCGTGAAA GCCACCTAAA CGTTTTATAG TACCAGTGAG TACTACTCAA TTTTTTTTTG GCAATGTTTT CCCTCGCATA GGGATCCTTT CATTTTTTAT CTTGTGGCTG CTAAGTTGTG CTGTGGGCAC CGCTCTCGCC CGCTGGCGGG GCAGAGTTGT AGATTTTATC TTATATTCTT TTGTCACCTA TCATCCATCC ACCCCTTTTA TTTTTGTTTA AAACATTGCC CATCCTTTAA GCGTGATTGG TATACCATGG CACGATTCGC TGACGTTGCC ACTCAGGTGA CAATGCCCGA AACGCATACC TCCGCGCCTC AGGACAACGA GCATGAACCT TCAATTGGCA GTCGATTCGA AAACCGATTT GGTCCCGGGT GGAGAGTAGG GTTCGACAAT TCTAATGAGC GAGGAGGCGT CGACGAAGAT GCTGCCGAAG ACTCACCCCC TCCTCGTCGA GAATCACTTG ATACATCGGA TAAGGAGGTC GAAGCGGCCG AAATGGAAAC GCCAAATGTT GAGAGAGATG AGTTGGAGTC TGATTATGGG GAGCCAGTTC AACAGCAAGG ACATGTTGAT TCTGAAGAAG AGCTGGATCA GAAAGAATTG ATCAGAAGGG CTGCAGCCAG AAGAAGACAT CAAAACAAGA AAGAAGAGCA AGGTAGTGAT GGTCAGCCAT CCGCGAAAAA GAGAAAACGT AAGTCATTCG CACACTTGGT CCCACCCATG CTGAAAGCGA TTAGCCCTCG CCCCTGCCAA CGCATCTCCA TTGCCTTCTC CGCCTCCTTC CACTGCTGTC ACGGCGTCAG CTCATCCTTC TGCTGGAACT TGTCCTGGTG ACGGGAGATG CAATGGTGCT GGTGGCAAGG CTGGCTGCGA AGGTTGCCCA ACCTACAATA ACTCGATCGC TTCTGGATTG GTCTCCGCCA GTAATTCCCA CGCAGCTTCA CATCCTCCCA ATGTGTCTGA AGGTATTGAA CGTCCCGTTC GAAACATTTA TGATCGCGAG CATCGTCCTT ACGGCTTCGA CCGTTTCATG GAGAACAGTA TGGGTAACGG TTTGGCTCCA AGGGCTTTAC CTCGTCAGAG CCCTGATCAA CGACAGGCTC ACCCTTCTCC CGTAACCACT CAGCCGTTGA TGCATCCGAC TTCCGAGAAG GGAACTCCAA CAAGGTTCTC ACCGGATAGC GACGTCGAAA CTCCTGCCGC TCCTGGTGGT AATGGATCGG GGCTTGCTGC TACTCCCGTT GGAATGAGCT GCAGGAATTG TGGGACAAGT ACCACTCCTT TGTGGCGAAG GGATGAAGAG GGTCGACCTC AATGCAACGC ATGCGGTAAG TAAAAAGGAA AAAAAAAAAT GCGAGCACGT AGCTAATGTC ATTAACAGGT CTTTATCACA AGCTTCATGG TGTTCCTCGA CCCGTAGCTA TGAAAAAGAC TGTTATTAAA CGACGCAAGC GTGTTCCTGC TGTTGGTAGT ACTTCTACTG GCGGTCGTGG CACAAATGCC GAGCTCCCTT CACCTGCTAG TACGCCCGTT TCCGTTCCGA CGGTCACCGC TCCGCCTCCT CACGTGGCAC CGCCTCTTGA TGACAAGGCT CACCGTACCT CTCCTCCTTT TGGTCACCGT GCTTCTCAAC CTCACTCGGA ACACCGAATC AACCATTCTC TAGGTCCAGA GGCATATGGT CTTGCCGGTA GGTACGGTAA GCCTTCTACC CCTGCTGGTA TGAATTTGCC TAGCTCCGCC TCCACCTCGT CTTTGAATCT TCCCGAAAGG AAGAAGCCTT GGTGGCAAGA GGGCCGAGAA GGCCGAGACC GTGAGAAGGA AGAGAAAGAC AGAGAAGCCA GGGAACGCGA AGGGGTGAGT CATGTTTCCT TCATAGATTT CCCTTGCTTT TCCCCCTCTT TTCATATGAT CGCCTACCAT CCTTCCATTT GCGAAAAAGT TTCATGGCCT TCTCATCTGA TAAGATGTGC AATTGATATC ATGTGGTGGT GACTGTGATT ATAATGATCC GTCGAGTCAA GTCACCATCT ACCTTCCTTT GTTTTCATTC TCCTGCTGGT TGGCCATGTC TCCCATTGTT TCACCTTTTT ATCCTCTTCT ATGCTCCTTT CATATTTTAC CGTCCTTTGT CCTTATCTCG TTCGAAAGTT ATGGATGAGA AGAGCATAAA CCTATCTTCG TCCATGACCT CGATCTCATA TCCTCCCCTT CTACACGTAA TGATCTGTTA CAATGTTTGA CAGCATTCCT TTACCACCTC ACTTCATACC CAAATCCCAT TCGGTTTCAG CGTCTCAAAA AGTTACAGAG CTGACGAGAT GTCTTTCTTC TCCCAATTCC AAACTTTCGA CTGACCGCCG ATTGCGTGCC ATTCAATTCA ACTATCGCCA TTATCGCCAC ACATCTGTTC CTCCCGAACC TTCGCCTACT GTCGACACTT CTCATCGACG CGCTTCCCTT CCTGGTGCGA CGCCTGAACA AATCGCTCAT CAGCTTGCTG CAGAAGCACT TCTCACGATG GCGCCCGCTG CTAATGGCGG ACCTTCTCCC GAAAAACGAG CCGAAAAGTC CTCCGGTGCT GGATCTGTGC CTGCTTCTCA ATCTAGGCGT ACATCCATTG ACGTCGACAT GGCTGATAGC GAGCCCCGAG GTATTAAGAG AAAGAATGAG GAAGAGTCAC GTGACATTCG CGATCCTCGT GCTTCTATGA GCCTTGGCCT TCACAGTATG GACCGTGACC GAAATAGGTC CAAGGACAGG ACATTTTCTC ATTCTCCTCT TATCAGCACC GACCCTCGCT CTGTTCAGCC CTCCAATTCC CACCTTCCAA GCTCTCGTCT AGGCCAGCCC CCCACGAGCT CAGCATCTCC TTATCCTGCC ACTACCCAAC AGGGTGCACC TAACCGTTAC TCTGTATACG GTCCTACCAC TCGCGACCCA CTTGCCGGTA GCTCCCCTTA CTCTTTCAAC GCTTCGCGAT ATTCCAACCT TCACATGCGT CGCGACCTTT CTCCTTCTGT TGGTGGCGCC ACCTCCAAGC CCAATGTCCT TTCTCCCCCA CGACGTGCAT CACCCGCGCC TGCTCCTGAT CCCCGGGAAC GATTTTATCC CTCACCGTCC GCCGCCGCAT CTGCCGGCTC GGCGCCGGCC AGTGTCAATG CTGCAATGGC TGGCTATGGT CACTACTCTA TGAGCCGCAG GGAGTTGCAG GAACACCGAG AGCAGCTCAA GGAAGGCAAG CGATGGCTCG AAGCCATGAT GGCTAAGACG GACAAGATGC TGCACATGGT GGAGAACAAA ATGGCTTTGA CTGTTGAGAT GAGCTCTGGT CCTTCTGTAG CTGGACCTGG CGACAGGCCA TCTTTGTCTT CCAATGCCAG CCCCGTCCCC CCACCTGGTG CAGTCCACAA GATGAGCGAC GATTGGGAGT TTGAGGAGAG AGAAAGACAG AGACAGAAGG AAATTCAGAG GCTTGAGCAG GAAAGGGAAA TGGACCGAGC TGAGAGGGAG AAACGTGAGA GGGAAAGGGA AAGGGAAAGA CCTGGATCTT ATGAGGATGT CCGTGGGCGC CCTAGAGATA AGTCGGAGGC TGAACGTAAC CGTGACATCC TTCTCGCGAG CCGTAGAGTT TCAGCGGTAT CCCCTAACCC GGCTACTCGA GCCGCTGCCG CGTCTCGTGA GAGTGCGGCG TCTTCCAACG GAAGCGCTCC TCAACAAGGT GAAAAGTCTC ACGGCGGTAC GAATGGAGTT TCTGGCGGGA AGCGAGAAGG TAGTCAGTGG GACGGAGAAC CCGTGATGTC TGGTGTGCCT TTGCCTAGAA GAGAGCAGCA AAATGGGATT GGGAGCAGAT TGGGGAGAGG ATTGTGGAGT TTTGACGTTA GGAGTTGATA GAGTTGGAGT TGGCACAGGC GACTAGTGAT TATACAGGCA GTTGCTTTGT TGAAAGTTGA TATACGAGTC GGAATATGGT TTAGATCCTC TTTTTTTTAT CTGTCATTCT ATGTACTAAG GGCAATGCAT AATCGA
|
Protein sequence | MPETHTSAPQ DNEHEPSIGS RFENRFGPGW RVGFDNSNER GGVDEDAAED SPPPRRESLD TSDKEVEAAE METPNVERDE LESDYGEPVQ QQGHVDSEEE LDQKELIRRA AARRRHQNKK EEQGSDGQPS AKKRKPLAPA NASPLPSPPP STAVTASAHP SAGTCPGDGR CNGAGGKAGC EGCPTYNNSI ASGLVSASNS HAASHPPNVS EGIERPVRNI YDREHRPYGF DRFMENSMGN GLAPRALPRQ SPDQRQAHPS PVTTQPLMHP TSEKGTPTRF SPDSDVETPA APGGNGSGLA ATPVGMSCRN CGTSTTPLWR RDEEGRPQCN ACGLYHKLHG VPRPVAMKKT VIKRRKRVPA VGSTSTGGRG TNAELPSPAS TPVSVPTVTA PPPHVAPPLD DKAHRTSPPF GHRASQPHSE HRINHSLGPE AYGLAGRYGK PSTPAGMNLP SSASTSSLNL PERKKPWWQE GREGRDREKE EKDREARERE GVSHVSFIDF PCFSPSFHMI AYHPSICEKV SWPSHLIRSS QKVTELTRCL SSPNSKLSTD RRLRAIQFNY RHYRHTSVPP EPSPTVDTSH RRASLPGATP EQIAHQLAAE ALLTMAPAAN GGPSPEKRAE KSSGAGSVPA SQSRRTSIDV DMADSEPRGI KRKNEEESRD IRDPRASMSL GLHSMDRDRN RSKDRTFSHS PLISTDPRSV QPSNSHLPSS RLGQPPTSSA SPYPATTQQG APNRYSVYGP TTRDPLAGSS PYSFNASRYS NLHMRRDLSP SVGGATSKPN VLSPPRRASP APAPDPRERF YPSPSAAASA GSAPASVNAA MAGYGHYSMS RRELQEHREQ LKEGKRWLEA MMAKTDKMLH MVENKMALTV EMSSGPSVAG PGDRPSLSSN ASPVPPPGAV HKMSDDWEFE ERERQRQKEI QRLEQEREMD RAEREKRERE RERERPGSYE DVRGRPRDKS EAERNRDILL ASRRVSAVSP NPATRAAAAS RESAASSNGS APQQGEKSHG GTNGVSGGKR EGSQWDGEPV MSGVPLPRRE QQNGIGSRLG RGLWSFDVRS
|
| |