Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04650 |
Symbol | |
ID | 3254919 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 298782 |
End bp | 301631 |
Gene Length | 2850 bp |
Protein Length | 847 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253936 |
Product | expressed protein |
Protein accession | XP_568008 |
Protein GI | 58261196 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.793676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTCTT TTTGCTTTCT TTCATCCCCC AAATTGCCCT CCATGCTATT CCACAGCGAC AAACGCCCCC GCAGCGCCCT CACAGCAATA TTCACAGAGT CAGGACACGC AGACATCCTC CTCGGCATAT TTAACAACCT CGCCATACAA CATATCCCCG CTCTTCTACG CGTTAGTGCT CTTTCTCCTC TCAGCCCAGT CTACAAGCTG ACACTACACA GGTTAACAAA TACGTAAACT CAGTATTTAC TTCTTCTTCC CAGCTCCAGC TTCGTTATCG AAAGTCCTAT CATTCCGTTT CCCCTCATAC CCAATCGTCG GCCCCCAGCC GACTCACAAA CTCTGCCGAA GAGCTCGGTC TTCTTATTGA ACAAGAAAAG CGCCTATCCA ATCTCTGCCC GTCAGAGATC AGGTGCTTAC ACTTTCCCAA TGCTGAAATA GCTCAAGTTC AACTTAACCA CATCCTCGTA ACAGAGCGCA TTAATAGGGA GATCGTATTT CCAAATGCCC CCGATTCGAA CTATCTCTGG GATGGATGGT CTGTTTGGAG GATCAGAAAC GCCTACGAAT TTGATGGAGA GTCAAGAGGT GGGAAGAAGG CGCAAGGTGT ATGGAAATGG AAAATGAACT TCGGTGCGCC AATCGACAGT GCTACCATGT GTGTAGAAGA CAATGTCGTG GCTGTCGCCT ACGCTGTGTA GGTACCTTCT CTTTTCTTTT CTCTCCACGA CACTCAGATT ACCATAGCAA CCACCCACCA TGTGACGAAA TTTCCTACAA CTCTACACTC AAGGTAGCCC ATCGAATCTA CTTTTACAGT CTCATTCCGC CGGTCGGCAC ACCTCAACAA TCTGATGGTA TCTTTCAACC ACCGATTCCC CACCCAGAGG CCAAGCTGCC ATATGTTGAA GTTCTTGTCC CAGCAAAGCA CCATATGCAC CATATTGAGC TTCAACTGGG CCCAGGTGGT AAAGTTGGTT TACTTCTGGT TTCTGCAGAC ACGACAAGGC AAAATTTTGT AGGCCTTTGG GACTGGAAAG CGGGTGTGTC TATTGGTGTA AGTCTGTACC GTTTTCCTTC GAGACAAAAA CTGACATGTG GTAGAAACTT TCTCCTACGC CCGACGTCCC TGTGTGCGAA GATTTCCGCT TCTTGGGCCC CTTTATTCTT TCTTCTGTTA TGAGGGACCT TGTCAAGAAG CCTGAAGACG AGCTGATGGT GCTTCATAAC AGGGAATTGG ATTCCTGTGC ATCCGGAGGT CGCCCAAGTT GTCTTTCGGC TTCCACCCCG CAGAGGCCCA AGCCTGCCAG TGTCTACGAT GATATTGATT ATGATTCTGA GGAGGACACC TTTCAGGGCG TTTCAGAGGC TGTTCCCCCG CCCTTGGAGA CGGTGTACAG CATCGACACA CACGCTCCCT TACCCACAGA ACGAGGTACT CGGCCGTTCA AGCGCAGCTA TATAACACAT TTCGGCCCAG GCTCCGAAAT TGATGAAGCA TATACCTGGG ACTGGGACGA AATTCCGTTT TGCATGCCTC TCGCTTCCCT TCAACTTCCT ACTTTGAATA CAACCCCTTT GGGTGTTTAC GAAACCCCTC TAGATTCTAT CATGGATTCT TTACATGTCG ATACTGACTT CCATCCAGTT CGCCTCGCCT TTGACAGCTT CTCGGTTGAT CCTGCGCTCT TGCAGGGCGA GCGATTGGGT GTTATACCCT TTACCTTGTC TGGTGATACC AGTGATGGTC TTGGTGCGCC AAATGTTGTC AGATGTAAGG GCATAATCGA CATTACGACC CTTCTTGGAA AGACTATCGA AGCGATGCAA TGGCACATGG CGAAAACAAA GGGTATAGAT ATCAAACTGT TAGCTCTCGG TGCGAAGCTC AACAAAAAGA CATGGAGAGA TCAGTTAATA GAGGCATTTG AGCTGTTCCT TCGTTCTGGG CAGGATGATG GGTGGGAAAC TGAGGAAGAT GAGCCGAGCC GGTACAAGTC CAAAAAGAGC AGATCAAAGA GAATCAGCAA AAGAGCTACA ATTATATCTC CGAAACAAAC CAAATCCACT TACAAGTGGG AAAATCCACA TGTTGTTCCC TGGAAATCTT GGGATTCCGG GGTCAGCATA CGTTTCAACC ATCGCAATAT CGTAACGGCT TGGGGTACAC GAGCGGTCAT GGTTGAGCCT GTGCCAGATG CAAAGCCCTT GGTGAGGAAG AAGGATGGGC AGCTCATGAG GCAATACTGG ATGACCTTGC GGGATTACTC TCGACACACT TTCCATGATC ATCCTGGACG TCCACGTCAA GTCTTGGGAG GTCTTCAACA TCCATATTTA TCTCTCGACA AAGCTTCATT ATCCACTGCC TCTTCATCCA CATCACCTTC TCCCCCTGAA AGCGGTGCCA AACGTGAGAA ACCGCGCGTC ATCAAATGTC CGCTAGTGAA CCCTGTCCCT GTCGCCCAGA TGGACAACAA TTTGATTGAA CACAGTACCT CGGAGATTCA GGATAAAAGA CTTTGGACGA AGCGCATATT TTATGATGAG CTCTTGCAGT CGAAATTGAT GTATAAGGAA GTCAGAGCAA AGATCTTGTT GGATCAAAAG AGGCCATATC GCTCGTCTGC TTTCGATGGC AAGATGGTCG TTATCGGTGT GGTAAGTTTC TTCCTTCTAT TTTGCTAGTT GAAGATTGAG GCTGATTGAT GGGGAAGGAA TCCGGCGCTC ATATATTTAC ATTCTGAGGA GAGTTCGGAA GGGCAAGCCA TTTGTCAGAG TCTGATGTGC ATGATGGCGG CTTGAGTGTG TTGCTTGCAA TGCAATTAAG AAAAGAATAA TGGAGTTTTT
|
Protein sequence | MLSFCFLSSP KLPSMLFHSD KRPRSALTAI FTESGHADIL LGIFNNLAIQ HIPALLRVNK YVNSVFTSSS QLQLRYRKSY HSVSPHTQSS APSRLTNSAE ELGLLIEQEK RLSNLCPSEI RCLHFPNAEI AQVQLNHILV TERINREIVF PNAPDSNYLW DGWSVWRIRN AYEFDGESRG GKKAQGVWKW KMNFGAPIDS ATMCVEDNVV AVAYAVNHPP CDEISYNSTL KVAHRIYFYS LIPPVGTPQQ SDGIFQPPIP HPEAKLPYVE VLVPAKHHMH HIELQLGPGG KVGLLLVSAD TTRQNFVGLW DWKAGVSIGK LSPTPDVPVC EDFRFLGPFI LSSVMRDLVK KPEDELMVLH NRELDSCASG GRPSCLSAST PQRPKPASVY DDIDYDSEED TFQGVSEAVP PPLETVYSID THAPLPTERG TRPFKRSYIT HFGPGSEIDE AYTWDWDEIP FCMPLASLQL PTLNTTPLGV YETPLDSIMD SLHVDTDFHP VRLAFDSFSV DPALLQGERL GVIPFTLSGD TSDGLGAPNV VRCKGIIDIT TLLGKTIEAM QWHMAKTKGI DIKLLALGAK LNKKTWRDQL IEAFELFLRS GQDDGWETEE DEPSRYKSKK SRSKRISKRA TIISPKQTKS TYKWENPHVV PWKSWDSGVS IRFNHRNIVT AWGTRAVMVE PVPDAKPLVR KKDGQLMRQY WMTLRDYSRH TFHDHPGRPR QVLGGLQHPY LSLDKASLST ASSSTSPSPP ESGAKREKPR VIKCPLVNPV PVAQMDNNLI EHSTSEIQDK RLWTKRIFYD ELLQSKLMYK EVRAKILLDQ KRPYRSSAFD GKMVVIGVES GAHIFTF
|
| |