Gene CNK00710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00710 
Symbol 
ID3254589 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp228538 
End bp233458 
Gene Length4921 bp 
Protein Length1531 aa 
Translation table 
GC content51% 
IMG OID638253560 
Producthypothetical protein 
Protein accessionXP_567633 
Protein GI58260446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA ACAACCAAGA GCAGAACCCT TTTGCTGGCT TCAACATGGC ACAGTTGAAT 
CAGCTCAATC CTGCATTGTT CGCCTCCCTG GCGAACCAAA TGGGAATGAA TAACCAGGCA
GGTCCATCCC AACCACAGAA TCAAGCCCAG CAGCCAGCAA TGGGAAACCT CAGAAACATC
CCTCCTCAGT TATTACAGCA ATTGCAACAG CAGCAACAGC AGAGGCAACA ATCTGGTCAA
CCTCCATCCC AGGCATCTAT CAACGACACT TTGCAAAACA TCATGCTGCC TACCTTTAAC
CGCCTTCAAA GCCAGAACAG TCAACAAATG CAACAAGAAA GACTACAGAC CTCAAATAAT
CCTGCCATCC AGGCCCAGGT TGCCGCTCAA CAAGCAAAGG CGCGCCAAGC CGCCCAAGCT
GTTCAGGCAG CACAAGCCCA GGCTCATGCC CAGGCGCAAG CTCAGGCTCA AGCTCAAGCG
CAAACACAGC AACAAAATTT GGGGATAGGT GTTCAAGGCA TGACAGGGAT GCAAGGCATG
GGAGGATTGC CAGGGATGGG AGGGATAGGG GGTTTTGGTG GAATGGGGCA GAACCAAACG
CAGCAGCCAA ATCAGCCGAA TCAAGTGCAG CGGAACCAAA ATCAGTTTTC AGTACACTCG
CACCAACCGG ACCTTGGTCT CAATCAATCA TCACAACCTC AAAGTAGCAA CACGAATATT
CAACAAGGAA ACATAGGCCA AGGTCAGAGC GGTCAATTTA ACCTGGGGAT GAACATGGGA
ATGATAGGCG ACGATTCGGT TCGTCGTGCG GCTCTACAAA GGTTCGTCAA GTTGCTCAAC
TGTGAAATCT GATTCTTACT TTCTACACAG TATGTTGGCA CAGTCAACTC AGAACCAAAA
TCAAAACCAG CCCCCTCCGC CCCAACCTGC GCCATCCAAT CCTACACCTC AAATCAGCCC
CGAAGTCCAC GAATTCATGC GCACTCGGTC GGACATTGCT TCCGCCGTTT TCAAGAAACA
CGGGAACCCG CAGGCTGCAT TGGAAGATCT CCAGAGGATT GTTGGGATGT TGAAGGGTCA
AAACCAGGCT CAAAACCAAC CTCAAGCTCA GACTGGGAAT AATATGGGTG CAGCTGGTGT
TCCGGTCGGT ATTAATGGAC AGATGTTCAA CCAGAGAATT GCATCTGGAC AGACTCAATC
TGCTCAGAAC CAAGGTCAGA GCCAGCAGGC TGGTCACCAA AGAATACCTT CCAGTGGCAA
TTTTGATCTT CCAGGGTCGC AGTTCGCCAA CACCTTGAGT CAACTTGAGG CTGTAAGTAA
TGTTCATCTA ATACATAATG AGTGCTAACA ACAAACAGCT TCAGCAGGTC AAGGAAAGGC
AACAGCGCGG GGGGCTGAAA ACTCCTCAAA TGCCCCAAAG CAACACTCTA CCAATGACTT
CTCCGCAAAT GGGCAACAAT GCCAATAACC TCAACCCTCC ATTTAACTTA ACTCTACCCA
ATCAACCTAA TAATGCCCAA AATCCTGTTC AAAACAGGCA AACGCCTCGG ATGTCGAATA
ACATCCCGAC TCTTCCTTCT GGTCCTGCTG CTCCTCGGCC ACCGCAAGGC ACAAACCCTG
TTCAATCTGG CGCTCGACCG CTACCTAATC AAGTGATTGC CACTTGGCCA CTCGATAAAC
TTATTGGTGC ATCTTCGAAT CTCAGTAAGA AGATCATTGA GTCGGAAGCT GCGCATGGAA
ATATCATTCA GCCAGGAAAG CCTTCAACTA TGGGTCTGGG TGTGCCCGGA AGAACGGCTG
CAGAGCAAGC AGCCAAGTTT CAATTGTTGG TCATGATTGC TGAGATCAAG AGGCGAGCTG
TGCCCGTCCC GGATGATGCG CTGAACGTTG CTGCTAGTCT GTAGGTTGAC TTGTTTACGT
TCTTGATCAA TTACTGATGT TGAATAAGAC TACCTGTGGC AAATGCGAAA GAGCAAATTC
TACAGATGGA GCACAGCCAG CTTAGCGAAG TTGCCCGAGC CACTATGAGC CAACTCTCGC
GCCAAGCTCA AGCGCAAGCG CAAGCGCAAG CAGCTCAAGC TCAAGGTCAA AACCAGTCTC
AGTTAGGACA AGCGCCCGGT CAGCAAGGGT CGCAAGGAAC TCTAAATGCT CAACAAATGG
CCCGCTTTCT GCAACAACAG CAACAGCAAA GTCAACAGCA ACAACAGAAC CAGCAACAAG
CCTTGCGAGG CTCACAGGCT AACCCCATAG AACTTGTCAC TCCAACCATG ACCAACACCC
AGCTTCCGCA TTTCCCTACC CCTAGCCAGA CTTCAGCAAC GATGGCCAAT TCCGCCTCTC
AACATCCTCA GCCTAACCAA CTTCAGGGCA ACCAGTCCTC CCAACCTCCA CAGGCGAACC
CTAACTCTAC TCCTCAATTG TCTCAAGCTA CGACACAGTC CAACGCAGGA CCGGGAGGTC
CGGATATCAA CGACATCCCG GAGGAGAGTT TCTACGGTTA TTTGAGACAG ATGATGGCTA
AGAACGGCAT AACCAGTGGG ATACCGACCA TCGAAGGACG ACAGGTCAAC CTGTACAAGT
TGTTCCAGAT GGTAATCAAG AATAACGGAT CTAGCCATGT AAGTCCACGA GGTCATCGAA
AGACATCAGG TTGATAATCA AAGCAGATTG AGCCTATGCG CTGGACGTTC GTTGCTGGCC
AACTTGGGTT TGCCACTGAG CCCACTCAAC CTGGTCAACC ACCTGTTTCC AACATGCAGG
TAGCTCAACA AGTGCGCCAC ATCTACATCA GTCTTCTTCA ACCAATGGAA GATGCCTATA
TGACGATGAA GCGAAACAAA ATGGCTCAAG CAAATATGAG GAGGGCAGGT ATGCCACCTG
GTCAAGGATC AACCCCCGGA CAGGCGCCGG GTCAAGTGAG ACCTCCGATG AACGTCGCTG
GTCCTAGTGT CACACCAACG CTGGCATCCC AAAATCAAAA TCAAGGCCAG CCTTTATCAG
AGCAGCAAAG AAGGTTTTTG GAAGCGGCCA AGAACGCTGG TGCTGGAAAT GTAGGTACCG
GTGTTGGGCT TGGAGAGGCG TGGCAGGGTC AGGCGCAGCA AAATGCTGGC CAAGGAAGCC
AGTCTCAATC AATTCAGCCA CCAACTCAGC CTCAGACTCA ACAGCCTGCG GGCCACCAGA
CTGCAACACC GACTGCTCAA AGACTGGAGC TTTCCGCATT GAAGATCTTA GAGTTCATCA
GGATGCAGGA ATCGCTCATT CGGAACGGGC TTGGTGAGTG AACTCAGGCC ACGATCAGCG
TGTCTGTTGC TGATTTATTG ACAGAAAAAC AACCTAACAC GGACGTCAAT CCCGACATTT
ACCGAAACGA GCTTCGCACT ATTCTTCCGA TTGCAAAAGA AGCAGAATCA AGATTGCCCA
TCTACTTATT GATGATTTCG GATGGCGGGT CTGTTGAACC TCAGTCTGTG TTAACGATTA
TCAACATGGT AGGTACTTGC GATGATTGAA GATGTCAATA GCTGACAGAA AACAGTGTAC
CACGCCTTCC TACGCCGCCC TCCTCGCTGA ACGTAATCGT TTCATCTTCA ATCTCGCCGA
TTTCCCTCGT CTTCGTGCCG GTCTTAGCCA GTTTCTCTCG AAAGCACAAG CTACCTATCG
CACAATGGCT GAAAAGCCTG TTGGTCAGGC GAAGCTCAAG AGCATGGTAT CAGCTGTTCA
GATGGCAAAG GGATTGATAG CTCAAAAAGA TCGAGAGAAG CAGGCCCAAA CCACACCCAA
TGCCGCTCCA GCGAATGTGG CTGCTGCGGG GTCAGGTCTG GGCCTCAACC TTGGCAGTAC
TCAAGGGGCA ACTGGCATTG GACTTGACGT TCAACAAATG GTCCAACAGG CTCACAATCA
CGCCGCCTCT CAGCTACAGT CTAGTCAAAA CCAACAACAA CAACAGCCGC CCGCTCAACC
TCAAACTCTC CTCCCACCGC CTGCCCTTCC TAAAGACAGT CCTTCGAATC TCCAAGAGGC
TATCCGACAC AAGGGCCTGC GAGTCGAAGA CCTCAAACCG CCACCGTCCA AACGTCAAAA
GAGCAAGGGT TCTCCCGCCA CACCCGCAAA TGCTCAAGTA CCAACACCAG AGTCTGCAAA
AACGCCGGCT AACACTGCCG TTATTGGAAC CCAAGGGACA CTAGGGGAAA GTCCAAAAGA
AAAGAAGGCC ACGAAGCGGA AGAGACAATC ATCAACTGCC ACAGCAGCTG GCACCGAAAA
GCCAGTCAAG CAGACGAAGG CAGAGGCTGC CAAGGCCAAG AAAGCTGCTG TTGCGGCCGC
TGCGGCAGTC GCTATTCCCG CTACGGTACC AGAGGATCCT GTTAATGCTC TTGGTATCCA
GCTTGCGGCA GATGAGGCTG CTACCAAAGC AGAGATTGCT CAGCACAAAG CTTTCTTTGA
TGACCAGAGG GCTTTGGCTA GCGCTGCAGC CCCTGGGGAT GGTGAGAAAA AGGTTGGAGG
GGAAGATGCG ATGGCGGTCT TCACAAAGAT ATTTGAGGCG CATCAGGCAG GTATTCAGGC
GGATATGGTC CGAACAGATC CTGTCGCCCA GTCAGCATCC ACGGCAGCGG GCCACATGCC
CCCGGTAACC AGCGCCGGTG ACCCCAACGA CCAAGATTTA TTCGATACCT ACTTTGATGT
GACTTTGTTC AGCCTCGCGG CCGATCTTCC TACACCCGAT CTTGTCATCG AAGCAACCCC
TCAGGAAGTG GATGGCGAGA GTCCAGAAAG TGTAAGGACA GTTGGAAGTA CCGCTGGACA
CACAGTGCCT GGCAAAGAAG TTAAGTCAGG TTCTAAAGAA GAGAAAGGGA AGGAGGAGGG
GACAAAAATA ATATTAGGAA GCCCTGGAAG TATGGCGTAT AACGGAGGCA TCGAATGGTG
A
 
Protein sequence
MSNNNQEQNP FAGFNMAQLN QLNPALFASL ANQMGMNNQA GPSQPQNQAQ QPAMGNLRNI 
PPQLLQQLQQ QQQQRQQSGQ PPSQASINDT LQNIMLPTFN RLQSQNSQQM QQERLQTSNN
PAIQAQVAAQ QAKARQAAQA VQAAQAQAHA QAQAQAQAQA QTQQQNLGIG VQGMTGMQGM
GGLPGMGGIG GFGGMGQNQT QQPNQPNQVQ RNQNQFSVHS HQPDLGLNQS SQPQSSNTNI
QQGNIGQGQS GQFNLGMNMG MIGDDSSTQN QNQNQPPPPQ PAPSNPTPQI SPEVHEFMRT
RSDIASAVFK KHGNPQAALE DLQRIVGMLK GQNQAQNQPQ AQTGNNMGAA GVPVGINGQM
FNQRIASGQT QSAQNQGQSQ QAGHQRIPSS GNFDLPGSQF ANTLSQLEAL QQVKERQQRG
GLKTPQMPQS NTLPMTSPQM GNNANNLNPP FNLTLPNQPN NAQNPVQNRQ TPRMSNNIPT
LPSGPAAPRP PQGTNPVQSG ARPLPNQVIA TWPLDKLIGA SSNLSKKIIE SEAAHGNIIQ
PGKPSTMGLG VPGRTAAEQA AKFQLLVMIA EIKRRAVPVP DDALNVAASL LPVANAKEQI
LQMEHSQLSE VARATMSQLS RQAQAQAQAQ AAQAQGQNQS QLGQAPGQQG SQGTLNAQQM
ARFLQQQQQQ SQQQQQNQQQ ALRGSQANPI ELVTPTMTNT QLPHFPTPSQ TSATMANSAS
QHPQPNQLQG NQSSQPPQAN PNSTPQLSQA TTQSNAGPGG PDINDIPEES FYGYLRQMMA
KNGITSGIPT IEGRQVNLYK LFQMVIKNNG SSHIEPMRWT FVAGQLGFAT EPTQPGQPPV
SNMQVAQQVR HIYISLLQPM EDAYMTMKRN KMAQANMRRA GMPPGQGSTP GQAPGQVRPP
MNVAGPSVTP TLASQNQNQG QPLSEQQRRF LEAAKNAGAG NVGTGVGLGE AWQGQAQQNA
GQGSQSQSIQ PPTQPQTQQP AGHQTATPTA QRLELSALKI LEFIRMQESL IRNGLEKQPN
TDVNPDIYRN ELRTILPIAK EAESRLPIYL LMISDGGSVE PQSVLTIINM CTTPSYAALL
AERNRFIFNL ADFPRLRAGL SQFLSKAQAT YRTMAEKPVG QAKLKSMVSA VQMAKGLIAQ
KDREKQAQTT PNAAPANVAA AGSGLGLNLG STQGATGIGL DVQQMVQQAH NHAASQLQSS
QNQQQQQPPA QPQTLLPPPA LPKDSPSNLQ EAIRHKGLRV EDLKPPPSKR QKSKGSPATP
ANAQVPTPES AKTPANTAVI GTQGTLGESP KEKKATKRKR QSSTATAAGT EKPVKQTKAE
AAKAKKAAVA AAAAVAIPAT VPEDPVNALG IQLAADEAAT KAEIAQHKAF FDDQRALASA
AAPGDGEKKV GGEDAMAVFT KIFEAHQAGI QADMVRTDPV AQSASTAAGH MPPVTSAGDP
NDQDLFDTYF DVTLFSLAAD LPTPDLVIEA TPQEVDGESP ESVRTVGSTA GHTVPGKEVK
SGSKEEKGKE EGTKIILGSP GSMAYNGGIE W