Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00710 |
Symbol | |
ID | 3254589 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 228538 |
End bp | 233458 |
Gene Length | 4921 bp |
Protein Length | 1531 aa |
Translation table | |
GC content | 51% |
IMG OID | 638253560 |
Product | hypothetical protein |
Protein accession | XP_567633 |
Protein GI | 58260446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACA ACAACCAAGA GCAGAACCCT TTTGCTGGCT TCAACATGGC ACAGTTGAAT CAGCTCAATC CTGCATTGTT CGCCTCCCTG GCGAACCAAA TGGGAATGAA TAACCAGGCA GGTCCATCCC AACCACAGAA TCAAGCCCAG CAGCCAGCAA TGGGAAACCT CAGAAACATC CCTCCTCAGT TATTACAGCA ATTGCAACAG CAGCAACAGC AGAGGCAACA ATCTGGTCAA CCTCCATCCC AGGCATCTAT CAACGACACT TTGCAAAACA TCATGCTGCC TACCTTTAAC CGCCTTCAAA GCCAGAACAG TCAACAAATG CAACAAGAAA GACTACAGAC CTCAAATAAT CCTGCCATCC AGGCCCAGGT TGCCGCTCAA CAAGCAAAGG CGCGCCAAGC CGCCCAAGCT GTTCAGGCAG CACAAGCCCA GGCTCATGCC CAGGCGCAAG CTCAGGCTCA AGCTCAAGCG CAAACACAGC AACAAAATTT GGGGATAGGT GTTCAAGGCA TGACAGGGAT GCAAGGCATG GGAGGATTGC CAGGGATGGG AGGGATAGGG GGTTTTGGTG GAATGGGGCA GAACCAAACG CAGCAGCCAA ATCAGCCGAA TCAAGTGCAG CGGAACCAAA ATCAGTTTTC AGTACACTCG CACCAACCGG ACCTTGGTCT CAATCAATCA TCACAACCTC AAAGTAGCAA CACGAATATT CAACAAGGAA ACATAGGCCA AGGTCAGAGC GGTCAATTTA ACCTGGGGAT GAACATGGGA ATGATAGGCG ACGATTCGGT TCGTCGTGCG GCTCTACAAA GGTTCGTCAA GTTGCTCAAC TGTGAAATCT GATTCTTACT TTCTACACAG TATGTTGGCA CAGTCAACTC AGAACCAAAA TCAAAACCAG CCCCCTCCGC CCCAACCTGC GCCATCCAAT CCTACACCTC AAATCAGCCC CGAAGTCCAC GAATTCATGC GCACTCGGTC GGACATTGCT TCCGCCGTTT TCAAGAAACA CGGGAACCCG CAGGCTGCAT TGGAAGATCT CCAGAGGATT GTTGGGATGT TGAAGGGTCA AAACCAGGCT CAAAACCAAC CTCAAGCTCA GACTGGGAAT AATATGGGTG CAGCTGGTGT TCCGGTCGGT ATTAATGGAC AGATGTTCAA CCAGAGAATT GCATCTGGAC AGACTCAATC TGCTCAGAAC CAAGGTCAGA GCCAGCAGGC TGGTCACCAA AGAATACCTT CCAGTGGCAA TTTTGATCTT CCAGGGTCGC AGTTCGCCAA CACCTTGAGT CAACTTGAGG CTGTAAGTAA TGTTCATCTA ATACATAATG AGTGCTAACA ACAAACAGCT TCAGCAGGTC AAGGAAAGGC AACAGCGCGG GGGGCTGAAA ACTCCTCAAA TGCCCCAAAG CAACACTCTA CCAATGACTT CTCCGCAAAT GGGCAACAAT GCCAATAACC TCAACCCTCC ATTTAACTTA ACTCTACCCA ATCAACCTAA TAATGCCCAA AATCCTGTTC AAAACAGGCA AACGCCTCGG ATGTCGAATA ACATCCCGAC TCTTCCTTCT GGTCCTGCTG CTCCTCGGCC ACCGCAAGGC ACAAACCCTG TTCAATCTGG CGCTCGACCG CTACCTAATC AAGTGATTGC CACTTGGCCA CTCGATAAAC TTATTGGTGC ATCTTCGAAT CTCAGTAAGA AGATCATTGA GTCGGAAGCT GCGCATGGAA ATATCATTCA GCCAGGAAAG CCTTCAACTA TGGGTCTGGG TGTGCCCGGA AGAACGGCTG CAGAGCAAGC AGCCAAGTTT CAATTGTTGG TCATGATTGC TGAGATCAAG AGGCGAGCTG TGCCCGTCCC GGATGATGCG CTGAACGTTG CTGCTAGTCT GTAGGTTGAC TTGTTTACGT TCTTGATCAA TTACTGATGT TGAATAAGAC TACCTGTGGC AAATGCGAAA GAGCAAATTC TACAGATGGA GCACAGCCAG CTTAGCGAAG TTGCCCGAGC CACTATGAGC CAACTCTCGC GCCAAGCTCA AGCGCAAGCG CAAGCGCAAG CAGCTCAAGC TCAAGGTCAA AACCAGTCTC AGTTAGGACA AGCGCCCGGT CAGCAAGGGT CGCAAGGAAC TCTAAATGCT CAACAAATGG CCCGCTTTCT GCAACAACAG CAACAGCAAA GTCAACAGCA ACAACAGAAC CAGCAACAAG CCTTGCGAGG CTCACAGGCT AACCCCATAG AACTTGTCAC TCCAACCATG ACCAACACCC AGCTTCCGCA TTTCCCTACC CCTAGCCAGA CTTCAGCAAC GATGGCCAAT TCCGCCTCTC AACATCCTCA GCCTAACCAA CTTCAGGGCA ACCAGTCCTC CCAACCTCCA CAGGCGAACC CTAACTCTAC TCCTCAATTG TCTCAAGCTA CGACACAGTC CAACGCAGGA CCGGGAGGTC CGGATATCAA CGACATCCCG GAGGAGAGTT TCTACGGTTA TTTGAGACAG ATGATGGCTA AGAACGGCAT AACCAGTGGG ATACCGACCA TCGAAGGACG ACAGGTCAAC CTGTACAAGT TGTTCCAGAT GGTAATCAAG AATAACGGAT CTAGCCATGT AAGTCCACGA GGTCATCGAA AGACATCAGG TTGATAATCA AAGCAGATTG AGCCTATGCG CTGGACGTTC GTTGCTGGCC AACTTGGGTT TGCCACTGAG CCCACTCAAC CTGGTCAACC ACCTGTTTCC AACATGCAGG TAGCTCAACA AGTGCGCCAC ATCTACATCA GTCTTCTTCA ACCAATGGAA GATGCCTATA TGACGATGAA GCGAAACAAA ATGGCTCAAG CAAATATGAG GAGGGCAGGT ATGCCACCTG GTCAAGGATC AACCCCCGGA CAGGCGCCGG GTCAAGTGAG ACCTCCGATG AACGTCGCTG GTCCTAGTGT CACACCAACG CTGGCATCCC AAAATCAAAA TCAAGGCCAG CCTTTATCAG AGCAGCAAAG AAGGTTTTTG GAAGCGGCCA AGAACGCTGG TGCTGGAAAT GTAGGTACCG GTGTTGGGCT TGGAGAGGCG TGGCAGGGTC AGGCGCAGCA AAATGCTGGC CAAGGAAGCC AGTCTCAATC AATTCAGCCA CCAACTCAGC CTCAGACTCA ACAGCCTGCG GGCCACCAGA CTGCAACACC GACTGCTCAA AGACTGGAGC TTTCCGCATT GAAGATCTTA GAGTTCATCA GGATGCAGGA ATCGCTCATT CGGAACGGGC TTGGTGAGTG AACTCAGGCC ACGATCAGCG TGTCTGTTGC TGATTTATTG ACAGAAAAAC AACCTAACAC GGACGTCAAT CCCGACATTT ACCGAAACGA GCTTCGCACT ATTCTTCCGA TTGCAAAAGA AGCAGAATCA AGATTGCCCA TCTACTTATT GATGATTTCG GATGGCGGGT CTGTTGAACC TCAGTCTGTG TTAACGATTA TCAACATGGT AGGTACTTGC GATGATTGAA GATGTCAATA GCTGACAGAA AACAGTGTAC CACGCCTTCC TACGCCGCCC TCCTCGCTGA ACGTAATCGT TTCATCTTCA ATCTCGCCGA TTTCCCTCGT CTTCGTGCCG GTCTTAGCCA GTTTCTCTCG AAAGCACAAG CTACCTATCG CACAATGGCT GAAAAGCCTG TTGGTCAGGC GAAGCTCAAG AGCATGGTAT CAGCTGTTCA GATGGCAAAG GGATTGATAG CTCAAAAAGA TCGAGAGAAG CAGGCCCAAA CCACACCCAA TGCCGCTCCA GCGAATGTGG CTGCTGCGGG GTCAGGTCTG GGCCTCAACC TTGGCAGTAC TCAAGGGGCA ACTGGCATTG GACTTGACGT TCAACAAATG GTCCAACAGG CTCACAATCA CGCCGCCTCT CAGCTACAGT CTAGTCAAAA CCAACAACAA CAACAGCCGC CCGCTCAACC TCAAACTCTC CTCCCACCGC CTGCCCTTCC TAAAGACAGT CCTTCGAATC TCCAAGAGGC TATCCGACAC AAGGGCCTGC GAGTCGAAGA CCTCAAACCG CCACCGTCCA AACGTCAAAA GAGCAAGGGT TCTCCCGCCA CACCCGCAAA TGCTCAAGTA CCAACACCAG AGTCTGCAAA AACGCCGGCT AACACTGCCG TTATTGGAAC CCAAGGGACA CTAGGGGAAA GTCCAAAAGA AAAGAAGGCC ACGAAGCGGA AGAGACAATC ATCAACTGCC ACAGCAGCTG GCACCGAAAA GCCAGTCAAG CAGACGAAGG CAGAGGCTGC CAAGGCCAAG AAAGCTGCTG TTGCGGCCGC TGCGGCAGTC GCTATTCCCG CTACGGTACC AGAGGATCCT GTTAATGCTC TTGGTATCCA GCTTGCGGCA GATGAGGCTG CTACCAAAGC AGAGATTGCT CAGCACAAAG CTTTCTTTGA TGACCAGAGG GCTTTGGCTA GCGCTGCAGC CCCTGGGGAT GGTGAGAAAA AGGTTGGAGG GGAAGATGCG ATGGCGGTCT TCACAAAGAT ATTTGAGGCG CATCAGGCAG GTATTCAGGC GGATATGGTC CGAACAGATC CTGTCGCCCA GTCAGCATCC ACGGCAGCGG GCCACATGCC CCCGGTAACC AGCGCCGGTG ACCCCAACGA CCAAGATTTA TTCGATACCT ACTTTGATGT GACTTTGTTC AGCCTCGCGG CCGATCTTCC TACACCCGAT CTTGTCATCG AAGCAACCCC TCAGGAAGTG GATGGCGAGA GTCCAGAAAG TGTAAGGACA GTTGGAAGTA CCGCTGGACA CACAGTGCCT GGCAAAGAAG TTAAGTCAGG TTCTAAAGAA GAGAAAGGGA AGGAGGAGGG GACAAAAATA ATATTAGGAA GCCCTGGAAG TATGGCGTAT AACGGAGGCA TCGAATGGTG A
|
Protein sequence | MSNNNQEQNP FAGFNMAQLN QLNPALFASL ANQMGMNNQA GPSQPQNQAQ QPAMGNLRNI PPQLLQQLQQ QQQQRQQSGQ PPSQASINDT LQNIMLPTFN RLQSQNSQQM QQERLQTSNN PAIQAQVAAQ QAKARQAAQA VQAAQAQAHA QAQAQAQAQA QTQQQNLGIG VQGMTGMQGM GGLPGMGGIG GFGGMGQNQT QQPNQPNQVQ RNQNQFSVHS HQPDLGLNQS SQPQSSNTNI QQGNIGQGQS GQFNLGMNMG MIGDDSSTQN QNQNQPPPPQ PAPSNPTPQI SPEVHEFMRT RSDIASAVFK KHGNPQAALE DLQRIVGMLK GQNQAQNQPQ AQTGNNMGAA GVPVGINGQM FNQRIASGQT QSAQNQGQSQ QAGHQRIPSS GNFDLPGSQF ANTLSQLEAL QQVKERQQRG GLKTPQMPQS NTLPMTSPQM GNNANNLNPP FNLTLPNQPN NAQNPVQNRQ TPRMSNNIPT LPSGPAAPRP PQGTNPVQSG ARPLPNQVIA TWPLDKLIGA SSNLSKKIIE SEAAHGNIIQ PGKPSTMGLG VPGRTAAEQA AKFQLLVMIA EIKRRAVPVP DDALNVAASL LPVANAKEQI LQMEHSQLSE VARATMSQLS RQAQAQAQAQ AAQAQGQNQS QLGQAPGQQG SQGTLNAQQM ARFLQQQQQQ SQQQQQNQQQ ALRGSQANPI ELVTPTMTNT QLPHFPTPSQ TSATMANSAS QHPQPNQLQG NQSSQPPQAN PNSTPQLSQA TTQSNAGPGG PDINDIPEES FYGYLRQMMA KNGITSGIPT IEGRQVNLYK LFQMVIKNNG SSHIEPMRWT FVAGQLGFAT EPTQPGQPPV SNMQVAQQVR HIYISLLQPM EDAYMTMKRN KMAQANMRRA GMPPGQGSTP GQAPGQVRPP MNVAGPSVTP TLASQNQNQG QPLSEQQRRF LEAAKNAGAG NVGTGVGLGE AWQGQAQQNA GQGSQSQSIQ PPTQPQTQQP AGHQTATPTA QRLELSALKI LEFIRMQESL IRNGLEKQPN TDVNPDIYRN ELRTILPIAK EAESRLPIYL LMISDGGSVE PQSVLTIINM CTTPSYAALL AERNRFIFNL ADFPRLRAGL SQFLSKAQAT YRTMAEKPVG QAKLKSMVSA VQMAKGLIAQ KDREKQAQTT PNAAPANVAA AGSGLGLNLG STQGATGIGL DVQQMVQQAH NHAASQLQSS QNQQQQQPPA QPQTLLPPPA LPKDSPSNLQ EAIRHKGLRV EDLKPPPSKR QKSKGSPATP ANAQVPTPES AKTPANTAVI GTQGTLGESP KEKKATKRKR QSSTATAAGT EKPVKQTKAE AAKAKKAAVA AAAAVAIPAT VPEDPVNALG IQLAADEAAT KAEIAQHKAF FDDQRALASA AAPGDGEKKV GGEDAMAVFT KIFEAHQAGI QADMVRTDPV AQSASTAAGH MPPVTSAGDP NDQDLFDTYF DVTLFSLAAD LPTPDLVIEA TPQEVDGESP ESVRTVGSTA GHTVPGKEVK SGSKEEKGKE EGTKIILGSP GSMAYNGGIE W
|
| |