Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05180 |
Symbol | |
ID | 3255898 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1473634 |
End bp | 1476575 |
Gene Length | 2942 bp |
Protein Length | 803 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255162 |
Product | hypothetical protein |
Protein accession | XP_569266 |
Protein GI | 58264220 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.349279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCTT TCTCCTACCT TCCTCCGCCT CCTCGTGAGC CCAACTTGGC TCTTTTTCAA CCGTCGTCCT ACCCTCCACC ATCGGCATCT CCTTTTGGCA GACCCAACCT CCCACATCAA ACCTCGCTTC ATCATCAACA CAACAACAGC AATGGCAGCT CTTACCAACC AGATAACAGC CCAGTTATAT TAGACGACTG GGATCCCAAA TCACCTGTCT CCCAATCTAG AACTTATGTC TCACCCGATC CACCTCCTAG GGATGAGGAT GTCTTCCCAC CACCTTACGA GCATCACCAT TCAAACGTCA ACCAACATGA GCAATCACAT TCACAATCAT ATCCACCAAG ACGGCCCATT TCTGTTCCAG CAGCGTACTC GGCTTCCACA TCATCGTCAT CTATGCGCAA AGTGCTGTTC CCGCATCATC ATGGATTAAT AGCGTCTTCT TCAAACATGC CTGCAGGGAG TGATGGTGTC AGAGATGGAG GGTTGCATAT TAGTATAGGA AAAGGTGATA TGACGCCTGA GATGGACCCG TTTTACAATC CTATTATGTC TCCGGTTCCT CAGCAATCGT CGAAAAAGAA GAAGGCCAGG AAGACAGAGG GGAAACAACC GACTTTCTTG ATAAAGCTCT ATTCGTGAGT GATTAATATA GTCATAGGCT GTCATATGGG TGACTGACGG ATTTTGCAGG TTACTGTGAG TAGTACTTGG TAAGGCATAA GTGATGTTAG CTGACTCGAC TATCAAGGTC TCAACCCGAA TATAGTCATG TCAGTTCCGC TTTTGCATTC AGTTTCACGG AATAAATATT GATGCTAGCT GCAGATCATC CGATGGGACG AAACAGGTGA ATTAATAATC ATCGAAAATC CTGAAGAGCT GGCAGACAAG ATCTTACCTG TGGTATATCG ACAAAGCAGG TTCGCAAGCT TCTCACGACA ACTCAATGTA AGTTTCACAC CTGTTTCAGA GGGTAAGGCT GATGGAATGA CAGATTTACG GATTCAACAG AAAGCTTAGC TTGAGGAATG TCGAAAAAGG CATCTGCGAC CCCGATGCCA GCAGTTGGTG TAAGTAACCT TTTATCGCTG CACTGAATCA TCACTTGAAC TTATGTTCAA ACGCAGCTCA TCCCTTCCTT CGGCGAGATT CGACCAAACA AGAGATTACC TCTTTCAAAC GTCGCGTTCC TCCTCGCCCC TCTCAAGCCC AAAAACGTCG CATGTCGATG GGCCTTGGCA TCGGCATCAA GCCATCTTAC GCCGGCGCCT CCAACTGCGA AGATCAAGCC TCACCCACAT CTTCCGAACG CTCGCTTGAT TGGCAATCAC CTCCAGACCC TTATCGACAT CACCTCCTAC CGGACGTTGA CGAGGAAGCG CCGTTTGTAT TTCCAACAAG GGATTACTTT GGTATGGCCG GCCATGCGCC TATGGAGTAC GAAGGATGGA AACATAATGG TACAGCGGCA GGGAATTTAC AAGGTGCGGT AGGACAAGTG GAAGAGGGAT TTTCGCCTAC GATGTCAATT CATTTTGATT ATGGGATACC ACCTGATAGA AATAGGTTGG ATGGGTGTAT GGCGTCGAGT CAAGTCCGGC ATGGAGGAAG CCCGAAAGGA CTTGCGATCA ATATTCCCTG CTCATCGCAC CTTCCTAACC ATCACACTCA ACAGCAGGTT CAACCATCTC TCCCACTTTT ATGTCAACCA GCATCTTTCT CTCTACTCAC CCAGAAATCA CCAACAAATA TTGTTCCACA GAGCGCACCA GCCAATACAG GCTCGTTTCC CATTCCTATA CAGGTGACGC AGCAGCACAT CCGTACTCGG AGCGTACAGG GTGAACCTCC AAGTGCTATG TTGTTCTCCC CATTCGGTGA AGAGTTGGGG GAAGTCCCTG GACCTGCTGG ATTTTCTCAG AGTCTCAATG GTACAAGAGG TTACAGACGT CAAGCGGGAA ATATGGGCCA GCCTGCTATT CTGGCAGCAC CGATTCTTGA TCCTTCTGAT CCATCTACCT GGGCTCGACG CGGGTTCATA GACCTCACCA CGGCAGGCGC TTCCAACCCG CTCCCATTCA ACCCAGTGCC AATTTCTGCA AGCCACGCAA CCTCGCCCCA CTCTTTACCT ACTAATCTGA ATTCCTTGCA CCAACAGCAG ACTGCGTCAC CCTCGGAATT GATGGGTCAG TCAATGAGCG CTGCACTGAG CGATGATTCG CCAACAACGG TCTCGCCAGG GATATACCAA TTGGGCTTTT CATTGCCTGC CTATCCACCT TTGAAGAGAC ACATCTCGTC GCTTAACCCA CCTGTAAGCG CTCACCTCAA CCCGACTGCA AACATGACCT CTAACACTGG TCTCGCAAAG TCGCCTGACA TGGCTAACGG GAATAATGTC CGTCTTTCAG CTATCATCCA GACGAAACAA GATCGACGGC AGTCAATCAG TGCAAGCCCA TATCCGCATT CTGCGCAGTC TCCAAGGCAA AGGCCAGGGG TGCTTAATGC GAGTGATAAC GTGGGTGGGA ATGGGTCTTG GACGGGGATG AATGGAGGTT CGTTGCGGAT GATAGGATGT TCGGGTCGGG GTAGTGAAGG AGGCAGTTCG GCAGTTGATG TAGGTAATGT TGACGGTGGT GAAAGGAAGC AGAATGGGCA TTCGTCTTCT TGAGCTGGTG GAGGTTTTCA ATGGGTTGAT GGTTTGCGTA CTGGTTGCGA GAATCTTCCC GCTACTGTAT CCATATGTTT ATATCCTCTT CGACTTTACA TTTCACATTC CTTTTATTAC GATTTCGAAT TAACAGCCAG AAATAATGTC GATAGCCAAT TTAGTCAACG TTTGTCAGAC AGGAAAGATG GCCGCCCGTC ACGAGACCCT CAGAGTTGTA CATTGTGTTA GATCCCTCGT ATCTTATTGC TTTGCGTATG TT
|
Protein sequence | MNSFSYLPPP PREPNLALFQ PSSYPPPSAS PFGRPNLPHQ TSLHHQHNNS NGSSYQPDNS PVILDDWDPK SPVSQSRTYV SPDPPPRDED VFPPPYEHHH SNVNQHEQSH SQSYPPRRPI SVPAAYSAST SSSSMRKVLF PHHHGLIASS SNMPAGSDGV RDGGLHISIG KGDMTPEMDP FYNPIMSPVP QQSSKKKKAR KTEGKQPTFL IKLYSSQPEY SHIIRWDETG ELIIIENPEE LADKILPVVY RQSRFASFSR QLNIYGFNRK LSLRNVEKGI CDPDASSWSH PFLRRDSTKQ EITSFKRRVP PRPSQAQKRR MSMGLGIGIK PSYAGASNCE DQASPTSSER SLDWQSPPDP YRHHLLPDVD EEAPFVFPTR DYFGMAGHAP MEYEGWKHNG TAAGNLQGAV GQVEEGFSPT MSIHFDYGIP PDRNRLDGCM ASSQVRHGGS PKGLAINIPC SSHLPNHHTQ QQVQPSLPLL CQPASFSLLT QKSPTNIVPQ SAPANTGSFP IPIQVTQQHI RTRSVQGEPP SAMLFSPFGE ELGEVPGPAG FSQSLNGTRG YRRQAGNMGQ PAILAAPILD PSDPSTWARR GFIDLTTAGA SNPLPFNPVP ISASHATSPH SLPTNLNSLH QQQTASPSEL MGQSMSAALS DDSPTTVSPG IYQLGFSLPA YPPLKRHISS LNPPVSAHLN PTANMTSNTG LAKSPDMANG NNVRLSAIIQ TKQDRRQSIS ASPYPHSAQS PRQRPGVLNA SDNVGGNGSW TGMNGGSLRM IGCSGRGSEG GSSAVDVGNV DGGERKQNGH SSS
|
| |