Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01910 |
Symbol | |
ID | 3258305 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 555080 |
End bp | 559485 |
Gene Length | 4406 bp |
Protein Length | 1053 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257316 |
Product | cleavage/polyadenylation specificity factor, putative |
Protein accession | XP_571518 |
Protein GI | 58268724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.305657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAATTAGC TTGTATTCAG TGTTTCACCA CATTCCTTCT TCCGCCATCC CTTTACAGAG CAAGGAACTG TCGCCCGCAT TCCAACCATG TCTCTCTACG CAGATGACGA CCCGACCAAA GCCCTCCAGC TCACTGACGA GCAGCCCACT CTCCAACCGC CTACGGACCC GCAGCAAGCT CTTAATGCGG CTCTTGCCCT TGATTCAGAG TCACCGGAAC AGCAAGAAGG CCTTCAAAAT GCGGCTCTAA GGTTCGAAGA ACATCCAGAA AGGCTGCCAG AAGTGATACC GCGCTTGCTG GGGCTTATCA CAGAGGGTGG AGATTCGTTG TTGCGGTTTT GGACACTGGA CATGATTGCT TTAACGGTCG GAAGAAGTGG CTTAATGCTG GACGTAAAGT TACATGGTGC GTATTAAGAG AAACGTTTTA ACTGAATGCT GATGGTTGCC TAACATGACA GTCGCTCAAT ACTGTTTGAC GGCATTGAAC AAACTGCTTC ATTCCGACTC GGTACCGACA ATTAAGGCTG TCATTCCCAT TTTATCCACC ATATATCCGA TACTTTTTCG GCTACTTGCT ACTTCTAGAC CAAGTCAACA AGTCTTGGAT CTGTTTAATG TGGCGAAGTC CAAAATCATT TCATTTGCTT TGGACCCAAA CGCACGACCA TTTAATGTCG GTATCAAGGC TGTAGCGTGG AAATTCGTGC AAAGAGTGCT ATTGGCTGCT ACAAGAGCTG CCGGATCAGA TCCAAGGGTA CGTCTTTCCC CTTTGACGAG TTTATCGCTG ATAATTATAC AGTTACCGCA AAGAGGAGCA GGAAGCAATG ATGTCAATGT ATCGCACATT GTGCCCAACG GCTGTCTGTC TGCTTCCGAT ATAGAAAAGG AGGGCACTCT TCTTCAGACC CAGCTTGTAA CTCATCTGTA CTCTTCGTCA GACCCAGCCA TCCTCCATCC GCTTATTAAT ACCCTTCCGG CTATAGCGAA AATGCGGCCT ACCCTTGCTC CCCTTGTTGT ACATTCTCTG GCGAGCTGGA CACCCTCTGC TCTTGCCGCT GCTGGTAGAT CAGCTATGGA AATCCGAGCA GTAGATAAAA CCGTGAGGAT TGCAATCAGT CATCTGCTTC GTCACCCCCC CTTGTCGGCG CATGTTGCCC AACTCAACGA CGCCCTTGTT CGTCAGAAAG CTAGGATGGA GGCCGCTTTC CTAGCCGAGG CTTCGGCGCG TAAAGAGCGC CGTCAAAATC TTAAGCATAC AATTGACCCG GGGGCTGCAT CCGAGGCCGA ATCAAGCGAG CAAGCAGCCA AACGGGCCAA GTTGGACGGG CTGGGTGTGG GAAGTGGAAT AGGCAAAGGA CCCGAGATTG ATGTGAGCGG GATGAGGCTC GAAGAAGTGG TAGAGGTTGT TCTCAGCAGC CTAAGGGGTG TGAGTTCGGA ACTCCTTACA GGAACCATAG AGGTAAGTTC ATCCCTTAAT GTTATCCCAT ACAGAGCCTG ATTCGAGACA GAACGCAAAG CGAGCTCTGC AAGAAAACTC AGCCGATGCC CAGCCGCTGT TGGCAGCTGC ACTGGGTATC AGCAATGTCG TCAAGGAAGA AGAGGAAGAA GAGATCTTGA ACCCGCTGGA CATGGAGGAT GACGACGACG ACTTATTGGT CAGTTTCTTT TTTTCTCCTA GCTCCAGTCT CATCCATCAT GTCCTGACAT TTGAGGCAGA TGGACGGCCC AGAACCGCTT ATGGAAGAAG AAGAACCAAC ATCCTTCACG GATTTCGTTC TTCCTGCCCC TGAACCTCTT GAGTCGTCTG ACAAGGAGTT CATTTTTTCA GACACAATCG AACGAATATG GCAGACAGGT GCCGACCTCG CGAGTCTCCC GGACCCTAAA GATTCTGATG CAACGAAGTT GGCCGTAAAG CCGAAAGAAA TGTGGATGCT ATTACTAGCA CGGTTAGCAA CAAGGGGGGC AGATGTGAAG AGGAAGGTCA TTTGCGATTT CGTCATTGCC GATTTTGCTA ATCGGTAAGT CGAATGGTTT GCCCAAGGAA AATCAAGCTG ACGGATTGTA GATCAAAATT TGCGTCGGTG TGGCTTAACG AAGAATGGTA CAATGAAAAA ATCGGTGTTT CTTCGCCTGG CCAATATCTG TCCAACCTCG AAGCTATCGT AACTGCTTAC CTCCCAAAGG TTGACTCAAA AGACAAATCT CTTTCGACCT TCATCCTGAC GCTTCCCGCG ATCCCTCCTT CTCTCATCTC GACTCTTGAG ACGATCTGTC AAGAACCTGA AAGGGCTCTT GTTGGATTTT TAGCTTTACG AGATATTGTA GAGGCCAGGC CACCCGTCAG ACCTCAAGCT CTCCAAACAT TATTAGAGCT TTGTACACAC CCTGACAGGA AAATTCGCGT GATGGCGATC ATCACCACCG TCCGGCGATG GGGCAATGAT TCTCCAATGA TGCCATTCCT TACCAAATAT GCGCTCGGAG TTCTTTGGCG ACTCATCAAT GATGACATCA AGAGCGAAGA CGTGGATATG GAGGAGGGCG AACAGGCTGA CGAAAAGATC CAGTCAAAGT TCTTGGGAGA ACCTAATGCG GACAACGTGC AACAACATGT GGAGTTAGCT TTCGCTCTTG CAAAGCGGAA GCAAGATTTG CTAGATGACA TATTCAGGCT TTATCCTCGC CTCGAGCCTG CAGTGCAAGA CGTTGTGGAG GCACAGTTGA TGCCATTGAT CCAAAGTCTG GGTGCTACTG AAAAGCTACT GGAGATTCTG CGAAAGTTCC CTAATGGAGC TGACAAATTG GTAATGCGCG TTGTGGGAGT GCTGAGTGCA GAAGGAAGCA AGACGCTGGT AACTTTGATG AAGACATTGC TAAGTGAAAG AGATTTGGAT CCGAGATTCG TGATACCGAT TGTGGGAGAT CTGGACAAGG TCAGTGGATC AAGATATTAC CGTTTAGATC ATCAGCTGAT GATTTTGTAG GCCGAAATCG AAAAACAACT CCCTCGTATT GTATCCCTCC TGGGAGACAC AGATTCCAAA GATATGGTTA AGACTGCTTT TGCTTCAATG CTTCAAAAAA TGACTCCATC TGATCTGATG GTCGCATTGC ATCAAGAAGG CGCCCCGTTG AAGTTGACTA TTGAAGGTAA ACCATTACAG TCTCGTGATA AGCACGGCAG CTGACGAAGA ATAGCTATCG GCATATGTTT CTCTATGACC ACCGTTTTTC GATCTGATGT TCTTGCGAAT GCCATGTCTC GCATTGCTGA TCTTCCTACT ATCCCCCTCA TTTTTGTTCG TACTATTATC CAAGTTGTTA CTACCTACAA GTCCCTTGCG CCCTTCGTTG CGAACCACAT CCTCCCCAAA CTCGTTACCA AAAAGATTTG GGAAATCCCC CAACTTTGGG ACGGCTTCAT CATGCTTGCC AAAAGAATCG CTCCGGCCAG TTTTGGGGCT TTATTGCAGC TGCCTAAAGA ACAGCTTAAG CAAGTGGTTG AGAAGCAGCC GGGATTGAAA TCGGGCCTCA AAGGATTTTT GTCGAACAAG CCTGGTAGCA AGGCTGCGAT GGCTGAGGTG GGTCTTCATT AACGCATCAT TGATATCAAG AAAGGCATTG ACTCGTAATA TCAGATCTTC GGTGATGATT GAACGACTTT ATCGAAAGGC ATTTGATGTT GAAGCCATCG GTCCGGAAAA TGGTTTGAAC CTGAAGTAGC AATTGAAACA ATGGCAAGAA GTTTGCTGTG TCTTGAAGCA GTGTCTTACG AGTGTTAACG AACGATGACC AATCTACACT TCAGAGAAAT GATGATAGAG CCAGGTGCTG TACGTGATAG GGATTATAAT GTACAAACCT GGTTCAATTG CTTCAATTTG TCTCTCGCTG TGCTTCAAAA TCAACATTGT CGCTGTTTTT GACAATTTTT AGTTTTCTGT TCCCTAATCC AATACCGATG AAAGAGCAAA AGCGAATTAG ACCTGCTTTA ATGTTTTAAT CGGTAGGGCG AAAATGGAAA CAAGATCACT AAGTCGATTG TACCATTGAG AGACGCCATC ATCAACGGCG ACACTACTGA GATCATGATC GCATTGAGAA GCATCACGCT TTATTTGCTT CAAGCGAAAG CCTCAAGGCG AAATGAAGAT GTGAGTCTAG CGGAGTGGGT TATTGGCGAT TATCCATGCA AGTTCGTAAG TGATCCATCT CTCATCATAC TAAGCAAAAT GATCATTGTG AAAGCTAGAG ATGTTGTCAT TTTAGTGACG ACTCCAGCAC CGAGTCAAAG GAAGGGCCTC CATCGCCATC ACTACCGGAA GAATAATCCT TTGTTTTAGC CAGGAGGGGA GGGGCGTTGG TTCCTAGTAG TACTTAAAAC AGTAGCAGCG TAAGAA
|
Protein sequence | MSLYADDDPT KALQLTDEQP TLQPPTDPQQ ALNAALALDS ESPEQQEGLQ NAALRFEEHP ERLPEVIPRL LGLITEGGDS LLRFWTLDMI ALTVGRSGLM LDVKLHVAQY CLTALNKLLH SDSVPTIKAV IPILSTIYPI LFRLLATSRP SQQVLDLFNV AKSKIISFAL DPNARPFNVG IKAVAWKFVQ RVLLAATRAA GSDPRLPQRG AGSNDVNVSH IVPNGCLSAS DIEKEGTLLQ TQLVTHLYSS SDPAILHPLI NTLPAIAKMR PTLAPLVVHS LASWTPSALA AAGRSAMEIR AVDKTVRIAI SHLLRHPPLS AHVAQLNDAL VRQKARMEAA FLAEASARKE RRQNLKHTID PGAASEAESS EQAAKRAKLD GLGVGSGIGK GPEIDVSGMR LEEVVEVVLS SLRGVSSELL TGTIENAKRA LQENSADAQP LLAAALGISN VVKEEEEEEI LNPLDMEDDD DDLLMDGPEP LMEEEEPTSF TDFVLPAPEP LESSDKEFIF SDTIERIWQT GADLASLPDP KDSDATKLAV KPKEMWMLLL ARLATRGADV KRKVICDFVI ADFANRSKFA SVWLNEEWYN EKIGVSSPGQ YLSNLEAIVT AYLPKVDSKD KSLSTFILTL PAIPPSLIST LETICQEPER ALVGFLALRD IVEARPPVRP QALQTLLELC THPDRKIRVM AIITTVRRWG NDSPMMPFLT KYALGVLWRL INDDIKSEDV DMEEGEQADE KIQSKFLGEP NADNVQQHVE LAFALAKRKQ DLLDDIFRLY PRLEPAVQDV VEAQLMPLIQ SLGATEKLLE ILRKFPNGAD KLVMRVVGVL SAEGSKTLVT LMKTLLSERD LDPRFVIPIV GDLDKAEIEK QLPRIVSLLG DTDSKDMVKT AFASMLQKMT PSDLMVALHQ EGAPLKLTIE AIGICFSMTT VFRSDVLANA MSRIADLPTI PLIFVRTIIQ VVTTYKSLAP FVANHILPKL VTKKIWEIPQ LWDGFIMLAK RIAPASFGAL LQLPKEQLKQ VVEKQPGLKS GLKGFLSNKP GSKAAMAEIF GDD
|
| |