Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02750 |
Symbol | |
ID | 3254694 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 809866 |
End bp | 812739 |
Gene Length | 2874 bp |
Protein Length | 757 aa |
Translation table | |
GC content | 52% |
IMG OID | 638253766 |
Product | fmHP, putative |
Protein accession | XP_567872 |
Protein GI | 58260924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.187782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTATAAGAA CAAAACACTG TCGTCGTCGC AACCCGTTCC CGCATCTTCG TCGTCCATCG CCCACGGACG CCCTCGCACC TCTTCGCTTT TTGATACACA GGCTGTCTCG GAAGGCGCTA GCGGATCAGT CAGGCGTCTT TTTCAGGATC ATATATTTTA CGGTAGTCAC CCATCATGCT TCATTCCCTC CAGCCACAGA ATTTTCCAGA CGACCTCGGC CGCTCGAACA GCGTGTATTC TTCGCTCTCG CCTGCTACAA ATTCTCTGCC TTCCAGCTCC TTTTCAGCTC GCCCCCACTC TGCTTTCGAA ACCACCACCC CGCGAGCTCG CCACTCATTT TTACTGCCCG CTGTTGATCG CTCCCCTTCC TCCGACGGTC ACTCTCCTCA GCAGTCACAG GCGTTGCGCC GCCCACCCTC TTTACCAAAC GCCGCCATTC TTTTAGAGCA CAGCCCGGTC CGCAATAGCA CTCTTTGCCG GAGCCATTCT ACCAGATCTG CAAGGACCAA GTCTATTCGA CGCAAGCCTG TCCCGGCCAT GGTGCTGCCT GCGGCCGATA AAGAAGTTAC AATCGGCGTT GGTCTTCCAC CTAACCATCC ATTCGCCAAT GTCGCTTCAC AAGCTATGAC GAGAAGCAGC AGCTCGTACG GTTACGAGGA CATGCGCTAC GGTACGGAGA AGATCCATAA CAGATTGGGA ACCAGTTCTC AGCTAGCCAA CAGGGCGGTA AGTGCGGGAG TGATGACTCC AGTGCCTCCG CACCGGGTGA CGAGCCGACC ACATCTCGAG CGTGCCTGTG CAAGTTCCCC CTCCACTCCC ATCCGCAACA CTCAACCGCC TTTCCCTCCG TCATTCTCCG TACATCGTTC GGCAGATCCT CCGCCCAAGC CATTTCCCCG CAGAGACACA TCTCCGCATC TCCCCGAACA ATATGGGCAG CGTCTGCACT CCCGTTTCCC GGAGTACACC CTTTCCGTCG ATCGTCCTCT GCCTCCTGCT CCGACAGACA CCGCGTCCAT CCGATCCACC GACACCGCAC CCCCCTCCGC TGCCACTCAT GGAGGTCGTT CCTCCGCTGA CAGTGCAAGG ATCATGACGC CCATCCATGG TTCCACCCAC GAGCAAGTCG GGCACAGTCG GGAGTCATCT TGTAGGAGTG GACACAATGT GTCCTTATCT GTCCCTCCTA TTATGCCCAA GAGATTGATG TCGGACGATA TCGTCGTCAG AAGCAAGGGT TCGAAAAAGG TTCAGCAAAA GATCAAAGTT CCCAAGAAGG TAATTCTGCT CGGAGGCGAG ACTCTTGCAG ACGTCGAGTT TAGCGTCGAT AAACTGGTTT CAAGGGATCG AGTGCTTGAA GCTTCGACAT GTTTTGTGAG AGATGAGAAT GGTCTTCCGG TCTGCTTTGG TGATCTCCTT CCGCCCCCTG GTCCTGTCGA AGCTGGCAAA CCTACTCCCA AAACTGTCGT CTTCTTCATC CGTACGTTCT GGTGTGGTCA ATGTCAAGAT TATACGCTTG CCTCTATCTC GGTCCTTTCT CCCGAGGCAC TCGAGAAGGC AGGTATCAAG GTGGTCATCA TTGGTCATGG GAGCTGGAAG GTCCTCAAGG CGTATAGGAG GTTGTTCAAG TGCCCGTTCC CTATCTATGT GGATGGACCG AAGAAGCTGT ACTCGCTCAT GGGGTACGTT GTCATGCTCG ACCACTAGAT GTTTAATACT GACCCATCCA TATAGCATGA CGAAAGGTGC ACCGAAGACA GCACCCTGGG GTCATTTCTG GAAAGGTCGA GCAGAGTATC ACCAACGTGC TGTGCCTGGT CAACTCGTCC ATGGTATCTC TGTAGGTCTT GCCTTTTATA ACGTTGACTT TGCTGAAGGC TCACGCTCGA CTTGCAGAAC GCCCTCTTCA AGATGCCTGT CAAGCCTCCA GGCGACCTTA CCCAATTAGG TGGTGAATTC ATCTTCAGCC CTGGGTCTGT CTGCGAGTTC GCACATCGCA TGACTCACGC CTCTGGTATG TCATATCGTC CTCTTCTCTT CGTGACGCTG AGCTGACAAG ATTCCCTTTT TCTACATGAA CTATCAGACC ACATGGAAGC TCCCGAAGTC ATTCGTCTTG CAGGTTGTGA CCACCCAACT GTTGAGGAAA CTAAAGCGGT CGAACTCGCC GAATCGCAAA AGGAAGAGCT TGAGAAGCTT CGCATGGAAA TGGAGAGGTG GAAGAAGGAG AGGGCTGCCG AGCTCGAAGG GTGAGTTTTT TTTCCCTTTT TCCTTTTCCC TTTCTCTTCT CCGTCCACGA CATGCACATA TTGATCAGTG TCTTCTCATA GAATCAAGAT GCGAAAAGCC GCTCGACGTG GGATTCCATA CTCTCGTTCA CTGGAGATTG GCCCCCAAAT GGTCCAAGTC TACGACTTTG AATATGACTT TGAGGACGGT CTTCAAGTAG CGTATGAGGA AGGAAATGAA GATGAAGCAG AGGCGGTGGA CATGGGGGAA CATTGGGATG AGAGGTTTGG GAAGGTCATG TCGCCTGAAC AGCAAGTCGG CAGAAAGGAG GAAAAAGAAA CGTTTGCAGG GATTGGGACA AGAGCGATGG AGACTCGACT CCGACACGAG GCGCGCGGGC AACAACGGGA TGATGAGGTT GAGATGGTTT CTCCGGAAGG AAGTGGAAAC CTTAACGTTC ATATTGGGCC GGTCTCATAA TTGGACTTGA AGACATTTAA CGAAGAATCA TGGTTTTCAT ATGACGATAT CATAGATGAT TTTTGTTTCC CTTTTTTCAA ATTGAAGGTG TACTGTTATC AATATAATGT CGAATATAAG CTACGAAGAC AATATACATC TCTCATGAAT TTAGACTGCT CTGG
|
Protein sequence | MLHSLQPQNF PDDLGRSNSV YSSLSPATNS LPSSSFSARP HSAFETTTPR ARHSFLLPAV DRSPSSDGHS PQQSQALRRP PSLPNAAILL EHSPVRNSTL CRSHSTRSAR TKSIRRKPVP AMVLPAADKE VTIGVGLPPN HPFANVASQA MTRSSSSYGY EDMRYGTEKI HNRLGTSSQL ANRAVSAGVM TPVPPHRVTS RPHLERACAS SPSTPIRNTQ PPFPPSFSVH RSADPPPKPF PRRDTSPHLP EQYGQRLHSR FPEYTLSVDR PLPPAPTDTA SIRSTDTAPP SAATHGGRSS ADSARIMTPI HGSTHEQVGH SRESSCRSGH NVSLSVPPIM PKRLMSDDIV VRSKGSKKVQ QKIKVPKKVI LLGGETLADV EFSVDKLVSR DRVLEASTCF VRDENGLPVC FGDLLPPPGP VEAGKPTPKT VVFFIRTFWC GQCQDYTLAS ISVLSPEALE KAGIKVVIIG HGSWKVLKAY RRLFKCPFPI YVDGPKKLYS LMGMTKGAPK TAPWGHFWKG RAEYHQRAVP GQLVHGISNA LFKMPVKPPG DLTQLGGEFI FSPGSVCEFA HRMTHASDHM EAPEVIRLAG CDHPTVEETK AVELAESQKE ELEKLRMEME RWKKERAAEL EGIKMRKAAR RGIPYSRSLE IGPQMVQVYD FEYDFEDGLQ VAYEEGNEDE AEAVDMGEHW DERFGKVMSP EQQVGRKEEK ETFAGIGTRA METRLRHEAR GQQRDDEVEM VSPEGSGNLN VHIGPVS
|
| |