Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05740 |
Symbol | |
ID | 3255915 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1609152 |
End bp | 1612047 |
Gene Length | 2896 bp |
Protein Length | 793 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255216 |
Product | conserved hypothetical protein |
Protein accession | XP_569315 |
Protein GI | 58264318 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00996322 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAACTATAC TTGGTCTTCA TTGTACAATT TCTACCCTCT ACAGCCAGCC GGGATGCTGA AGGTCAACTT TTCCGCCGTA TAATAACGCC CAAAAAGAAG ACAATGCACG GCCCACATCT CTCCTATTAT CTCTCGGTCC ATTGAGCCGC CGAGGCAGGA CTTTCGGTTC ATGATTCGGA CCGGTGCAAT TACAACATTT CGGGCTCGTC ATCCGCCGTA CGAAAATGTG TTCGACTGCT CTCAAAACCA TGGGCTACCA CAAAAACACG AACAATCAGA TGTCGAATTC GTTGCCTGAT TGAGCTTTGG TCTTGATCAA GTTTATCTTC GCTGCAAAGT ACATTCGTAA TGGTCGCTTC CGAAGAGCAA CAGGTAAGTC CCCACCTTCA CGTGTCGGCC TTGCTTAACT CCCCGTCATA GACGATAAAC AATCTCCAAG CAAACTGGGT TTGGGTTCCC AACTGGATCG ACTCCTCGCC TGACAACTCT GCCGCTCGCC TTGTCTCTTT TACCCGCTCT TTTAGCTTGT CTTTTATCCC CTCGTCTGCC ATACTCTACT TTTCTGCCGA TACACGGTAC AAACTTCTCA TCAATGGCGC GAGAGTTGCA GTTGGTCCTA CAAAAGGTCA TTCAAGTATT TGGTACTACG AGACTCTCGA TATCGCACCG TATTTGAAGG AAGGACATAA TGATGTCGAG TTTTTGGTGA TTAGGTACTT TGCGAGTTCG AGAGGGGGGA TGCCCTTTGA GAGGACGACG TTCCCCGGGT TGACGGTGGT TGGGGAAGTT GGAAATGTGA ACTTGGCGTC AAAAGAGGGG TGGAAGGCGG TGGTGGATCA GAGTAGGGTG TATCCTACTG GTTTGGTGGA TGATGTTTTC CTTCATGTAA GTTTTTTTTT CCTTATAACG CCTATGGAGT ATCGTTGACT TGGAACAGAT AAACGAACGG GTTTCAGCTA CTTCTTCCCG AGCCACACCC TTGACACCCA TCCCATACTC GCTCAAAACT GTCAACGGCG AACTGCCCCC ATGGCGTCTC CGTCCGAGAT CTATCCCTCT ACCCGAAAGT AGCCCAGTCG CCGTCAACAA CATTCACGCA TGTCAAAGCC CCACTCCTTC TTCCGACTGG CTCACCTTCT TCGACAGTTC TAACCCCCTC ATTCTTCCAG CTGGAACATC TCATTCGCTT GACATTCAAG CCGAGGTCCA TTCTACAGCC TTCACAAAAT GGATTTTCTC CTCCGAGAAA GGTTCAGAGG TGAAACTGAG GTTAACGTAC TCTGAGGGAT ATGAACTCGA TCCGAGACAG TATCCATGGT TACGCACTAA AGGCGATCGC CTCGACTCAA AGAATGGTCA CCTCCTTGGG CCTTACGACG AGGTCACCCT CCAACTTTCT CCAGGCCAGT CAGTCATATA CGAACCTTTC TGGTTCCGGA CCTTCCGCCT CATCCGAGTC CAGATTGAGG TGGGAGACCA GCCAATCAAA CTGGTGTCGT TTGAAGCTAT GCAGGCCAAT TATCCGATGG GAGTTAAGGC CGAGTGGAAG GAGCCGGCAA TGAGGGAGAA TGAGAAGATC TGGGAGGTAT CGATTAGAAC TTTGAGGAAT TGTATGTTTG ATGGGTACTC GGACTGTCCA TTCTACGAGC AATTGCAGTA CGCATTATTT TCACAGTCTG TTTTACGTTT GAATTCAACG CTAACTCGTC CTTTCAAAGA TATTCTGGTG ACAGTCGATC TGTCGGGTTA TTCCACTACC TTCTTTCAGG CGACGACCGT CTCATGCGTC AAGCAATTAC AAATTTTGCA GCTTCAGTCA CTCCCGAAGG TCTCACCCAA TCCCGTTTCC CGTCACACGT CCCGCAAATC ATCGCCGCCT TCTCTCTCTA CTGGATCTTA CAGATATCCG ATCATCATCT ATACTTTGGC GATACACCTT ACACTAAATC ATTCGTCCCC AAGATTGATG GTGTCCTAGC ATTTTTCGAC TCACATATCG ATGGGCTTGG GCTTGTAAGT GGGATTTCGG AGGATGTGTG GCAGTATTGT GACTGGGTGA CGAGCTGGTC TGCGACGGAG GATCATCCGG ATAAAGGCGT ACCTACCTCT GGGCGAAAAT CGAATCGACA TACCTATCTG AGTCTGCTCT ACGCGTACGT TCTGAAGCAA GCGGCTCGGC TTCTGAGACA GGTGGGAAGA GCGGGAAATG CCACCGAGTA TGAAGAGCGG GCTGAAGCTG TTGTCAAGGC TATCAAGAAA CATTGTTACG ACGGAGAGTT CTTCACAGAC TCTACAGCTG ATATCGCGAA CGATTCGGCG TACTCCCAAC ACTGTCAAGT ATTCGCTACC CTTGCGGGCG TTATCCCTCC TTCTGAAGCA TCGCAGCTCC TTACAAACGC GTTCTCCAAC CCCAAATTCT CCAAATGTTC CTATGTCATG ATCTTTTACG CCCTTCGCGC CTTCGCTGTC GCCGGCGACG AGACTTATGA GCATTTCTAC AAGACCATTT GGAATCCATG GAGGAAGATG TTGAAAAACA ACCTTACCAC CTGGGAGGAA GACGATGTGA GGCAGCGGTC AGACTGCCAT GCATGGGGTA GTGTACCGGT CTATGAGTTT TGTGCGGAAG TCGCTGGGGT ACAGCCGCTA GAACCTGGAT GTAAAAAGAT CCTCTTCAAA CCTCGCCTAT CTTTGAGTGA CGAGTTGGAG GCAAAGATTG CGCTGGGGAA GGATAATTTG GCGGTGGTGA AGTGGTGGAA GGAAGGAGAG AAAAAGGTTG TGACATTAGT CTTGGAGAAG CCAGTGGTGG TTGTGGTTAA GAAGCCGGGA GAACAGCAAG AGAAGGAGAA TGATGAGCCG GTGACGAATC TCAGGTTAAT TTGTAAGATT TAAATT
|
Protein sequence | MVASEEQQTI NNLQANWVWV PNWIDSSPDN SAARLVSFTR SFSLSFIPSS AILYFSADTR YKLLINGARV AVGPTKGHSS IWYYETLDIA PYLKEGHNDV EFLVIRYFAS SRGGMPFERT TFPGLTVVGE VGNVNLASKE GWKAVVDQSR VYPTGLVDDV FLHINERVSA TSSRATPLTP IPYSLKTVNG ELPPWRLRPR SIPLPESSPV AVNNIHACQS PTPSSDWLTF FDSSNPLILP AGTSHSLDIQ AEVHSTAFTK WIFSSEKGSE VKLRLTYSEG YELDPRQYPW LRTKGDRLDS KNGHLLGPYD EVTLQLSPGQ SVIYEPFWFR TFRLIRVQIE VGDQPIKLVS FEAMQANYPM GVKAEWKEPA MRENEKIWEV SIRTLRNCMF DGYSDCPFYE QLQYSGDSRS VGLFHYLLSG DDRLMRQAIT NFAASVTPEG LTQSRFPSHV PQIIAAFSLY WILQISDHHL YFGDTPYTKS FVPKIDGVLA FFDSHIDGLG LVSGISEDVW QYCDWVTSWS ATEDHPDKGV PTSGRKSNRH TYLSLLYAYV LKQAARLLRQ VGRAGNATEY EERAEAVVKA IKKHCYDGEF FTDSTADIAN DSAYSQHCQV FATLAGVIPP SEASQLLTNA FSNPKFSKCS YVMIFYALRA FAVAGDETYE HFYKTIWNPW RKMLKNNLTT WEEDDVRQRS DCHAWGSVPV YEFCAEVAGV QPLEPGCKKI LFKPRLSLSD ELEAKIALGK DNLAVVKWWK EGEKKVVTLV LEKPVVVVVK KPGEQQEKEN DEPVTNLRLI CKI
|
| |