Gene CNK00140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00140 
Symbol 
ID3254466 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp40603 
End bp44073 
Gene Length3471 bp 
Protein Length883 aa 
Translation table 
GC content50% 
IMG OID638253508 
Producthypothetical protein 
Protein accessionXP_567586 
Protein GI58260352 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.518899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTATTAAAAG GAAAATAATC ACCTGTCTTA TAATAGTAAG TTGAGCACTT GCACTCGCTC 
GCACCATTTG GTTCCCAGTT CATGATCGAG CTCCCACTTT CCAACCTATG AAAAAAGAAA
AAACCACAGC TGATACCAAC CAATCCAGAG AAACAAAATA GCAATCGGGT TTCGTCTCCC
CACGAATCAC CACAGACAAA AAAGAAGGAA AGAAAAAGGA TGTTCGAGTC CCACCTAAAC
TTAAACCCAC ACGGCGCGAT TGGCCAGCCT TCACCTTCTA TAGGTGTAAT CGGCGAACAA
CCTCAAGCTA TTGATAATCT CTCGCCCGCG CCCGCGCGAC CGGCTGCACA CCCTCGGCAG
GTGGCAGCGG CCAGTCTATC TCAAGGACAA TTAGGAGGTC GACCTAGGAC GCCGATCAAT
CGTGATTGGG ATAGGGATAC GGATACGGAT AGGGATAGGG ATGTGGTGAT GGGTGAAAGG
AGAGAGCAAG CGGATGTAGA GTCGGATTTG ACGAATTTGT TGGTGCAAAC GCGATTACAT
CGGACGGGGC TCTTTTCCGA TCGGGAGGTA CACTCTCAGC CTCATTCCCA TTCCCAGTCC
CAATCGCAAT CGCAACCACA ATCAGAAGCA CCCTCCCAAC GCCAACGTCA ATACCAATCC
CAATCCCGAT CCCAACCCCA ATCACAATCA CCATCACAAG GAAAGGATGA TGTAACAGGA
GGCATTGACG ATTTTCGCAA ACGTTGGGAA AGAGAGAGCC GGGAAGGTGG AAAGAGGTTT
GATGAGGAAA CTTGGGATCG GAGTCTCTTG GGCATGGGCA TGGGTATGGG CACGGTGCAG
AGGCAGGGAA GAGGGTCGCC TGCTCCTGGA GGCTTTGGTA TAGGTCTGGG TTCGGGTTTG
GGTTTAGGTG GAAAAGCGAT GGGTATAGGA GGGACTGTGG GCGGGAGCAC GCAGTCCAAG
ATGACTACAG GGACGTTTGC TGGTTATCAA CCTTTTATGG GCAAGATCTC GCCCGTCTCT
TCTGCTCAGT ACCGACCTTC TCTCTCTCCC AGTAATCATC ATCTCCCATC TTCGACATAT
GGGAATCAAG CACCGTCCCA ACCTCAATCG TATACCCAAC CTCAGACTCC AAGCCATACA
GCGGCTCCCA ATACTTTGGT GGCCGGGGCC GGGACCGGCG GGCAAAACGA CACGACTGCA
ACCCAGATAA CTCGCCACCT CGAATCCCTA ACCATCCTCC TTAACCCATT GTTGGCCCAG
GCCGATGAAG TAGAAAGGCT CCGTAAGGAA GTCGAGATGT GGAAATCCGA GTGGGCACGG
GCGGAGAGGG AGAGGAAGAG GTTGGAGGAA AAGGTGGGTG GGATGGGCAT GGAGTTGGAG
AAGGCAGTTG GGAAGAGTAG GGATGTGAGT TTTATTTTTA TTTTTTATTT TTCTCCTTAC
CGCTCCGCCT CTTATTCTTA CCTTCCCCTC ACCTTTCCTT TTTCACCGCG CCATCTCCAA
GCGTGTACAA TAAAATAGGA GTGAACTGAT GCGTGTCGAC TTTAGATTGC TGGACCATCG
TTCACTGCGG TATTGATCGA CGGAAACGGT CTTATAGTGA GTGCAAACCC TCCCAATCGA
AGCTTTACGC TTGTCTTCTT TACAAGATGC TAAACGTCAA GATTCTTAGT TCCAAGATCC
ATACCTTCAA GCAGGGTTCA AAGGCGGTCA GCTCGCAGCC CATCATCTCC TCTCTTCCAT
CCCCAACCTC GCACCTGGTT CACCCTCCTC GAAGACCACT CATACCGGCA TTATTGCGAA
AGAAGTCACT CTAGGCCTTG ATAGTCTTCC TGTTAATAAT AATAAGGGGG GAGATGACGA
TGATTATGGT GGGAAGAAGA CGGAAAAAGG AGGGAGAGAG ATGGGGAGTG TGGTGGTCCA
GATTTTCGTT AATAAACAAG GTCTCGGTGG GGCTCTTATC AAGGTAAGCA CGCCGAATAT
GGGATTGCTA TTCTGCGCAG TTCTGTACGC TGACAAGCAG GTATTTAGTC TGGTATTGTA
CCCTCGTGGA ATGTATACGA CCAGTTCTGG CAGGGTTTAT CTTCCTCACA CGAACTTTTC
ACAGGTGTGT TGTTCACTTT TCCCTTTCAT GTTACTTTGT TCAAACCGTA CTTTTACTGA
AGTGAATTTT TTGGGAGATA GTATGTGACG TAGGGCAAGG TAAAGAAGCG TCCGATGCCA
AGATCAGGGA ATACCTCAAC TTATACGCAA GTAATGCGCA ATGCCGGTCT ATCATCCTTG
GTGCTTCGCA TGATAATGGG TACGCCAACG TACTTTCCTC GTACGTCCAG ATTCAAATTT
ACCGCCGTTG AGGCGCTAAC GCGTCCATGC CAATCGCCAT TCATATCTCA GATTGCAAAC
AGGATCACGT CTCTCTAATG TCGTCCTTCT CAAGGGTTAC GCTACGCTCG CGCCGCAGCT
CAAGACATAC TCTAGCCGTG TCGTCTCTAT CCCTGATTTA TTCAGGCTCG AAAAAGTACC
ACCCCCTCTA CCGTCTTTTA CTTCTTCTAC TACTGCCGAC GTTGTCTCGT CTATCCAAGC
GGGAGGTTCG CCCGACCTCC TTTCCGCGGT CACGGGTCTC GCGGGCGTAT CCTTCTCCTC
CATTGTTGCT AGTAGCCCAA AGGACAAAGA AGCCGATTTC GCGGGACCTT ATGGCGCCAA
CGTCCGTGAT AACACCAACA CTGCAACATC AGCACGCAAT ATCAGCTCGG CGGATATAGG
TAAAGCGAGG ATAAGCACAC CGAAAAGGGA CGAGTATGAA GAGAGTGAAG AGGAGATTGA
GGAATTCGAG TACGAATGGG GGAGTGGTGC TCAGTTTAAG GGAAGGTCGG ATCTGGCCTC
AGGGTTCGCT CCAGGTAGCG CGAAGAAGAA GAAAATCCCA GCTCCTTTTT CCCGCGAGGC
AAATTTTTCT CGTGTAGGAG GAAAAGATAG GGAGATGGAT GATGAATGGA CAGAAATGGC
GCCTAAGAAG AAGGTTAAAG GCAAAAGGAA GGAGGCGGCA GAGTATGTGC GGACTTTGAA
ACCTCGACCT TGCCATACGT GAGTCATACA TTCCTTTCTT ACATTCTTTG ATAATACGAT
AAGGGCTGAT GAGGTACGGG CGTAAGATTT TATTTGGGCC CGCGGGGGTG TAAGAATGGG
GACGACTGTC AATATGGCCA TGAGTATAAG CTCAATGCCG CCCAGCTCGA CGAACTCGCC
CGCCTGGCAA AGTGCATCAT GTGCCCATAC GTCAAGGATG GACGATGTCG ATACTCGGAT
GATGATTGCG TCTATGGACA TCAATGTCCC AACCCTGATA AATGTGTCTT GTACGTTTCA
CTGTTCTGTC TCCCAGGCGG GAGTAAGGGG CCGTTCACTG ACAGCTGGAT ATTTAGCGGC
GAAACTTGTA GATTTTACGA GTTGCCCAAC GGACATGGCG AATTGAATTA A
 
Protein sequence
MFESHLNLNP HGAIGQPSPS IGVIGEQPQA IDNLSPAPAR PAAHPRQVAA ASLSQGQLGG 
RPRTPINRDW DRDTDTDRDR DVVMGERREQ ADVESDLTNL LVQTRLHRTG LFSDREVHSQ
PHSHSQSQSQ SQPQSEAPSQ RQRQYQSQSR SQPQSQSPSQ GKDDVTGGID DFRKRWERES
REGGKRFDEE TWDRSLLGMG MGMGTVQRQG RGSPAPGGFG IGLGSGLGLG GKAMGIGGTV
GGSTQSKMTT GTFAGYQPFM GKISPVSSAQ YRPSLSPSNH HLPSSTYGNQ APSQPQSYTQ
PQTPSHTAAP NTLVAGAGTG GQNDTTATQI TRHLESLTIL LNPLLAQADE VERLRKEVEM
WKSEWARAER ERKRLEEKVG GMGMELEKAV GKSRDIAGPS FTAVLIDGNG LIFQDPYLQA
GFKGGQLAAH HLLSSIPNLA PGSPSSKTTH TGIIAKEVTL GLDSLPVNNN KGGDDDDYGG
KKTEKGGREM GSVVVQIFVN KQGLGGALIK VSTPNMGLLF CAVLYADKQV FSLVLYPRGM
YTTSSGRVYL PHTNFSQVKK RPMPRSGNTS TYTQVMRNAG LSSLVLRMIM GSRLSNVVLL
KGYATLAPQL KTYSSRVVSI PDLFRLEKVP PPLPSFTSST TADVVSSIQA GGSPDLLSAV
TGLAGVSFSS IVASSPKDKE ADFAGPYGAN VRDNTNTATS ARNISSADIG KARISTPKRD
EYEESEEEIE EFEYEWGSGA QFKGRSDLAS GFAPGSAKKK KIPAPFSREA NFSRVGGKDR
EMDDEWTEMA PKKKVKGKRK EAAEYVRTLK PRPCHTADEL NAAQLDELAR LAKCIMCPYV
KDGRCRYSDD DCVYGHQCPN PDKCVFGETC RFYELPNGHG ELN