Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI03130 |
Symbol | |
ID | 3259781 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 846868 |
End bp | 848997 |
Gene Length | 2130 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 47% |
IMG OID | 638258805 |
Product | conserved hypothetical protein |
Protein accession | XP_572639 |
Protein GI | 58270966 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGAAG ATGAGGACAT CGCTAAGACA GGATTAATCT ATTGGTCTGC CAATGGCACA ACCTTTACGT GCCCTAATCC CACCGAGTTC TCAAAGTACG TCAATAGACT ATTGGAGTTC ATTTTTACTA CCGAGATGCC TTGGATACAG CTGACATAAT TTTTTTCTTT TTTGCAGAGT GGTTTTGCCT CGATATTTCA AGCACAATAA TTGGCAAAGC TTTGTACGCC AATTAAACAT GTACTCGTAT GTATTACCAC ATCGCGATAA CCTTGTCGGT GGCTGCATAT GCTTATCATG ATATTCCTTA CGGCAGGTTT AATAAAGTTC GTCTTTTTCT CCGTTAGCGT CCGATTTCGC ATCTTTTACC TTTCGCCGTG CATCTAATCC TAAGCCTATT AGGTCAACGA TATATATTCA ACTTCCACCG ATCCTCAAGC TTGGGAATTC AGACACAGCC TTTTTCGCCG TGGCGAAGCG CATCTGCTTC CCAGCATTAA GAGGAAATCG TCCCGACCCA GTGCCCCCGA TGGCTCTAAT ACTGTCACGT CACCCACTGA TGAACTGCCT CCTAGTACTT CTAACCCGAT TAAACCTGTG GCAGGCTGGA TGAGAGATGC TGTACCAGTC CCTTATCGCA TGCCATCTCC TCCTCATATC CATGGCCAAC CTCAAAGGTC AGCCACCTAT CCTTACAGCG ATGGGTTTGC TACTCGCAAA GACGATGGTC GCTCTCCAAC GCGTGGTATG GCTTGGGATC CACTTCCCGC AGTTCAACGC ATGCCCCCGC CTCCAGATAA TCAGATCCCA ATACGTTATC ATCCAGATCC TAATCGTCCG GTTCTTACAA CACAGCGCTT CTATCACCCT GGATATCCAG AATCACCTCT ATATGGGCCG GCGCATTCGC CCAGTGCAGA AACTCTTCTC AACCAAATGT CTGTCTTGGA AGACAAGGTG CAAAAGCTCA CAGACGTGCT GAATAATGAT CGTATTGAGC ATGTGCGAAA CAACCTCGAC TTCACGAGCT ACCTCTTACA GATGATTGGA TGGGCTGCAG GTGATCAGCG TAAGTCCTCT GTGGCAATGT ATCCCGGAAT GCACATGTTA ACGAGAGTAT GCGATCAGAT GCTTCACCCG AACTGCGGGC GCTACAAGAT ACTCTGAGTC GCCAAAACGC CGATATGCGC CATAAGTATG AAGCGTTCAT GGCTTCTGAC GCGTTAGCTA TCATGGCGAG TGGTGGGGGA CGAGAGCGCT CCGATAGTAG GGACAGTACT CGTGAGCGAA ACGCTCGCCT GGGCTGTAAG TCTCCCATCA TTTCTATCAC CTTGTCCCAT CTGATGTGCC GTTGTAGTTG AGATACCGCC TTTTCCAGGA CATCCTCACC CTTCTATTAC GGACCTTCGC TTGCCTCAAA CCGCCCAGTC TAGTTCATCA GCTATCTTAT CCCAACGTGC AGCTCCTCGA ACTTCTCCTC GAAACTCTAC GTCCTCTGAT CTTCTATTGA CACGGCCTTC AACTAGTGAG TCTATAAGAG AAAGGGAGAT ATACCCAACC TACTTCCCTC CTCAAACCTC TCATGGTATT GGACCCGCTT CGTCAATCCC TTCGAGAGGT CCGGAAACTA TAACTCCATC TCTATATGGT GGCGGACCAT CAGTACCTCC TCCGCTCTAC AGGCCGGCAC CCATCGTTGA AAAGCACCGA GAGATTGAGA AAGGAGAGGA ACACAAAGAT ATGAATGGGC CTTCCACGGT GGAGCTAGGT AGAGAAGAGC GGGATAACGA AGACTCGCGT ATGACGATGG CAGGGGAAGA AGCAGAGAGC AAAACGGGAT TACGAAACCT CCTCAATTGA CGTTATCTGC CTGGTGAACA GCGCTCCCAA GAATAATAAT TTCAGAAAGG AACATTGTGT CGAATAGCGA AGATGGGATG AAGGAAGGAA AGAAGGTTAC ATCTAAAAAG GAAGATCTTG GCCAGCTCGT TTGTGTGTGC TCTTGATAGT TGCGAAACGG GCATTTATAT AAACCATATC TTATCAGGCA TCTTGAGTCA TTTTACTTGC CATTCACCAC AGCTAATTCA AATAAGTTTA CGTCAACTGC ATTGTTATTA TTATTATAGT
|
Protein sequence | MLEDEDIAKT GLIYWSANGT TFTCPNPTEF SKVVLPRYFK HNNWQSFVRQ LNMYSYVNDI YSTSTDPQAW EFRHSLFRRG EAHLLPSIKR KSSRPSAPDG SNTVTSPTDE LPPSTSNPIK PVAGWMRDAV PVPYRMPSPP HIHGQPQRSA TYPYSDGFAT RKDDGRSPTR GMAWDPLPAV QRMPPPPDNQ IPIRYHPDPN RPVLTTQRFY HPGYPESPLY GPAHSPSAET LLNQMSVLED KVQKLTDVLN NDRIEHVRNN LDFTSYLLQM IGWAAGDQHT LSRQNADMRH KYEAFMASDA LAIMASGGGR ERSDSRDSTR ERNARLGFEI PPFPGHPHPS ITDLRLPQTA QSSSSAILSQ RAAPRTSPRN STSSDLLLTR PSTSESIRER EIYPTYFPPQ TSHGIGPASS IPSRGPETIT PSLYGGGPSV PPPLYRPAPI VEKHREIEKG EEHKDMNGPS TVELGREERD NEDSRMTMAG EEAESKTGLR NLLN
|
| |