Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00550 |
Symbol | |
ID | 3255380 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 187237 |
End bp | 189851 |
Gene Length | 2615 bp |
Protein Length | 737 aa |
Translation table | |
GC content | 53% |
IMG OID | 638254471 |
Product | DNA-binding protein cre-1, putative |
Protein accession | XP_568732 |
Protein GI | 58262644 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.141726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCTAGCAG TCTGGCAGTC CTCAGCACTT CATTTTCACA TTTTCACATT CGGCCGCGTC GCCGACTTTT ATCTTCTTTT TTCATCACTG CCTAGGGGAT AATCATCTCT GCTTGGTGAA ACGTCCTCCT TCCTTCAAAA ATCCCATCAA CCCTCCTTTC GGGCCACATA GACTACCTCT GTGACCGATA TCTAGCGACG GAGTTCATTC GCAGCTCCAC ACATCCAACA ACATCCTTTA ATTCCTTATC TTTACTCCAT CCTGCTGTAT TCGACTACAG CCATGTCATC CACAGACAAC AACTCTCCCA ATAGTGGAAA CACAAGTGAA AATACCCTTA ACAACCTCCC TGCGCCTATC CCCGACCCCA TCACCGGTCG CTTAGATCCC AATGACCCCA CAGTCAAGGC GTTGACGGAG GCGGCGTTGA ATATGGATAA GAGCAAGATT CCCCGACCAT ACAAATGTCC TCTATGTGAT CGAGCATTCT ACAGGCTAGA ACATCAGGTG AGTTGACCAT TTCGACACCT GAAGGAGCCA TGGGGGAAAA GGTAGAGTTA CTGATTGTTC AATGTAGACT CGGCATATCC GAACGCACAC GGGTGAAAAA CCCCACGCGT GCACGCATCC TGGCTGTGAT AAGAGATTCT CACGTTCCGA CGAGTTGACA AGACATGCGA GAATCCACTT GCCAACAGCA AACGAGAATG GGTCCAAAGG AAAACACAAG TATCATGATG ACGAGGTGAG CAAATTTCAT ATCTGACAAA TGGGAACTGC GTTGACGTAG ACAGAATGAC AACGACGACC ACCGATCCCA CAGCCTTCCT CATCTCGGTC CTTCATATAA CATGGATGTC GACCGTTCAA GCTACCCATA TAACCTCCAT TCACTCCAGA TGAGCGGTCC CAGTGGCGGC ATCAGCGATA TCTCTGCTCT CGCCGCGGCC GCTTCCGATC AGCTCATTGA ACTTGAACGA CACGAGGCAT TTCGGCGAGC AGAATGGGAG TTGCGCCATC GACAAATAGC TGGTGCCAGG AAGAGCAATG GTGGAAGTCC AGTGGGGACG CCTGGAGGAC CGTATGGGTT CTCAAATGAG CGCGAGAGGA TGTCTTTGAG CGGTGTTCCC ACACCAAACG GTGGCCAATT GGTGTATCCT GTTTCTGCTC CTCAACCCGC CACTGGCACC CTTCCAGCTG TCCCTGCTGG CACTTTGGCC GACCCAACCT ACCTCGTCCC ACCAACGTGC TGCCATGACG AATGTCACAA ATCATATCGC AAGCGACTCA AGCTCGCAAA GCAAACGGCT GCTTGCCCCA ACTGTTTAAC TTTGGCTCAT CCTGGTAACA ACTCTAGTGG ATTTGGTGGT CTTGGTGGTG CCGGGTACGG AAGCGGTGGT GGCGACAGCC ATCATTCAAG TAGTTCCAAT ACGCCCAAGG ATAGGTCGAC ACATAACTCA TCAGAGGACT TGACCAAGTT TGCGGGTGGC GGACAGAGTC ATTACAGCTT GCAACAAGCC AATTTTGCCC AGGAGCTAGC AACTCTGCAG TTCCAACACC TCCAGGCTCT TCAGCGAGCT CGTTCCGGTA CCTCGGGTCA CAACACTCCT CATTCTGCAC CGCACTCCAA TTCACAATCT CGATCACAAT CTCATTCTGC TTCTCCCGCT ATTCCCGTCC CTCCCAACCA TCTCCGCCCT TACACTATCG ATCTCCACTC ACATCGTGGT GGTTTGAATT CGTCACACGT CTCTGCGGCT CCTAGTCCGG CTTCCAGTGA TGACAGCGAC GAAGAACCCA TGAACGAGAT TATCTCTCAT GGACCGTTCG ATTTCACTCC CGCTACAAGC CCGGTGTTGA GTGGTATGCG TCAGATGTCT TTGTGGCAGG GCAAGGCTAT CACAGCTCCA CCTTCGCGAG CGACTAGCCC GGTGCACAGC ATTTCTCGTA ACCCTTCTCG CGCAGGTTCA CCAGTCGAGG GCCACAATGC CAATTCTGGT AGGCATGGCC ACACCTCTCA TTCGGCCCGG GACGCGAAGA ACCGCTCACA TCCTTACACT CACCACTACT CTACTACTCC TAACTCTCCT CACTTTCCTG CCGTCATCAA GTCTCGCATG TCTCCACCCA AGCTCAATCG AACATTAAAC GGTGTTAACA ATGGCAACAA GACTGTTCAG GATATCTTGA ACGGTCCTTC CATCCCTCCG CCACCTAGCG ATAGGATGCT TCCCCCTCCT AACTCTTCCG GTAGCTTCAC CGCAGTTCCT AGCGTCAACT ATTCTATCAC TTCTCAACCG ACTTCTGCTC ACCAAAGCCC CAATACCTCC CGCGCCTCTT CTCCCACACA CCTATCATCA AGTAACGCCG GCGGTCACAA CAGCCACAGT CATATAATCC ACGGCGTCCG CGCGGCGTTT GGGATGACAC CAATAAGCTC GATGTCGCGG GAAGGCAAGG CTAGCGGGCA ACATGTGAGC TCAAGCTACA GTCCTCCGCA TAAGCTTGCA CCGTTGGGGA TGGGAGGTGA AGGTGTGAGG TTGCCGAGTT TGAGCAGGGG TAGTAGTCCG GTGCATTTCG ACCACATAGG GATGGAGCTG GATGGGCATT CGTAG
|
Protein sequence | MSSTDNNSPN SGNTSENTLN NLPAPIPDPI TGRLDPNDPT VKALTEAALN MDKSKIPRPY KCPLCDRAFY RLEHQTRHIR THTGEKPHAC THPGCDKRFS RSDELTRHAR IHLPTANENG SKGKHKYHDD ENDNDDHRSH SLPHLGPSYN MDVDRSSYPY NLHSLQMSGP SGGISDISAL AAAASDQLIE LERHEAFRRA EWELRHRQIA GARKSNGGSP VGTPGGPYGF SNERERMSLS GVPTPNGGQL VYPVSAPQPA TGTLPAVPAG TLADPTYLVP PTCCHDECHK SYRKRLKLAK QTAACPNCLT LAHPGNNSSG FGGLGGAGYG SGGGDSHHSS SSNTPKDRST HNSSEDLTKF AGGGQSHYSL QQANFAQELA TLQFQHLQAL QRARSGTSGH NTPHSAPHSN SQSRSQSHSA SPAIPVPPNH LRPYTIDLHS HRGGLNSSHV SAAPSPASSD DSDEEPMNEI ISHGPFDFTP ATSPVLSGMR QMSLWQGKAI TAPPSRATSP VHSISRNPSR AGSPVEGHNA NSGRHGHTSH SARDAKNRSH PYTHHYSTTP NSPHFPAVIK SRMSPPKLNR TLNGVNNGNK TVQDILNGPS IPPPPSDRML PPPNSSGSFT AVPSVNYSIT SQPTSAHQSP NTSRASSPTH LSSSNAGGHN SHSHIIHGVR AAFGMTPISS MSREGKASGQ HVSSSYSPPH KLAPLGMGGE GVRLPSLSRG SSPVHFDHIG MELDGHS
|
| |