Gene CNN00100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00100 
Symbol 
ID3255401 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp33927 
End bp36072 
Gene Length2146 bp 
Protein Length701 aa 
Translation table 
GC content54% 
IMG OID638254426 
Producthypothetical protein 
Protein accessionXP_568519 
Protein GI58262218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.519073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAG ATTATCCTAT GTATTCTCAC CCACCTCCTC CGCCGGTTCA TGGACACGAG 
CCAACACATC CGTACGCCAT GCCTCCACCA CAATATCAAA ATTACTATCG AGACTATTAC
CAGCCGCCAC CGCAACGCTT TCAGCGCGAA CACGAGTATA CTCATGGCCA TGGCCAGTTA
CCGCCACCTC ACCCTCATCA TCCTTATTCA CATCCGTATT CACACCCAAC CGGCCCTCAA
CCGCACATGA TGTCCTACGC AGCACCTCCT TCCTGGCGTC CTGAGCCTTC TTCACCAGTC
CGTCATCAGT CGATTCATCC TCCTGCGACC AATCGTGCAT CTGTGCTTGA ACAGTCTACA
TCTCGCGCGG AAGAGAAACT AAAAGCCGAG AGTTCACATG CAGCCGCAGC GGAAGATGAC
AACCCGCCTC CTAAAAAACG AGGTCGTCCG CGCAAGCATC CACTTCCCGT CATTCCGCCC
GACATGAAGA AGGATGACCG GTCGTCTGCA TCGAAAGAAG GAGTCAAAAA GCGTTCAAGG
GGCAGCGAGG GGCATTTGAA TGCCGATCAG ATGGGCTTGG AGGCGGGAAG TGATGAATTG
GAAGCTGGTT TGACTTTGGC AGGGTTGAAA AGAAGGGAGA GTGCGCCTGC CCCGGTTCCA
GATGCGACTC CAGAATCCGC CAAGAAAGTG AAGCAAGAGA ACGACGATGG AAAGGATGAG
AAGAAGGAAA AGGAATCTCA GGTCAAAAAA AGGGGTGAAG GGCTGAAGAA GAGTTGTGCA
GAATGTAGAC GATTGAAAGC AAAGTGCGAT CGGGTGTTTC CATGTTCCAA CTGTAAGTCT
TCTTGTCAGG GAGCGACGCT GACATTTCAT AGGCAGACGG AGAGGGTGTG CATTGGTTTG
TCCGGACGGT GATCTCAGTT GCATGAACGG AAAGCGACTT GTTCTCGCCT CGACAAAGCA
ATTGCACGAA AGGATTCAGC AGCTCGAAGC AGCTCTCTTA CAAGCCCACC GCTCGACGAC
TTCCACCACC CACCCGCTTT TGGCGCCAGA GTACCTCGAT GGGGGCTTTG CGTCTCTGCC
CAGTGCGAAC GCTGATAGCA AAGGTGAAGA GACGAAATCA CCCAAATCAC CAAAGTCCCC
ACTTCCCGAA GGATCACCGA TACATGCCAT GTCTACGCCC AGCTTTACGG TCGCGACGCC
GCTTTCGTCT GCAGCGGCCA ACCTTCCTCC TTCTCGCCGC ATTGCCGTTC AGTCCCTTTT
GACGGAAGCT TCTTCTGCGC CTGAAGGAAA GCGCGAGGAT GAATGGGCGG GAGAGAATGC
GGCTCCGGCG ATGATCATCG GCACGGGCCA TGCCCACTCT CACCCCCCTT TTTCCCCTGA
AGATCTCAAT CAGCGACGCC TTGTGTTTGA GCGGCTAAAG CGGATTATCA AAGTTCTTCC
GTCTAGCGAG GTGACGGCAC AGAAGGCAGA GAGGTTCTGG AAGACGTCAC AATGGTACCA
GACCATTCTC CAACGAGAAG AATTTGAGAA TGTCTACTTT CCTGCCGTCT ATTCGCCCAC
GCCTGCGAAT CCGCTTTCAC CACACAAACT TGCTGTCGTA CTCATGGTCC TCACGCTTGA
AGCTTACCTT GACCTCGAGC AAGACGAAGA CGCGCCTCTT GTCGCCACGT ACTGGGACGC
GGTCCAAGCA TGCTTTGACA CCCGTTTCGG CTGGGCGGCG AGCATAGCGG GTACGCAAGC
GTTGGCGCTC TGTACGTTGT TTGTCGGTTT TGGGTGGCGC GGTACAAAGG CGAGCAACTT
TTACTGGTTA CGGCAGATGA CTTCATCTGC TCTCCAGCTG GGACTGCATC GCGATCCGCA
CCCGTCTTTC CCGGCGGAAG AGCGAGAGTT TAGGCGTCGG GTGTGGCACG AGGTGTACGT
CCTTGACTGT TTAATCTGTC TCAATCATGG GCAAAGGGCA TCGATTCCGG TGGAGTATAT
CGAGACGGCG TACCCCAAGG GGGTGCCTCC GCTGGCGTAT AAAAAGTACG ATTTCATCAG
ACTGGTGAAG AGTCGGGTGA TTGAGGTGGG ATGTTTGCCA GATAGTGCGC CGGCGACGTG
GGATAGGGTG GAAGATGTGA AACGGTTGTT GATGCAGTTT GAGTAG
 
Protein sequence
MPADYPMYSH PPPPPVHGHE PTHPYAMPPP QYQNYYRDYY QPPPQRFQRE HEYTHGHGQL 
PPPHPHHPYS HPYSHPTGPQ PHMMSYAAPP SWRPEPSSPV RHQSIHPPAT NRASVLEQST
SRAEEKLKAE SSHAAAAEDD NPPPKKRGRP RKHPLPVIPP DMKKDDRSSA SKEGVKKRSR
GSEGHLNADQ MGLEAGSDEL EAGLTLAGLK RRESAPAPVP DATPESAKKV KQENDDGKDE
KKEKESQVKK RGEGLKKSCA ECRRLKAKCD RVFPCSNCRR RGCALVCPDG DLSCMNGKRL
VLASTKQLHE RIQQLEAALL QAHRSTTSTT HPLLAPEYLD GGFASLPSAN ADSKGEETKS
PKSPKSPLPE GSPIHAMSTP SFTVATPLSS AAANLPPSRR IAVQSLLTEA SSAPEGKRED
EWAGENAAPA MIIGTGHAHS HPPFSPEDLN QRRLVFERLK RIIKVLPSSE VTAQKAERFW
KTSQWYQTIL QREEFENVYF PAVYSPTPAN PLSPHKLAVV LMVLTLEAYL DLEQDEDAPL
VATYWDAVQA CFDTRFGWAA SIAGTQALAL CTLFVGFGWR GTKASNFYWL RQMTSSALQL
GLHRDPHPSF PAEEREFRRR VWHEVYVLDC LICLNHGQRA SIPVEYIETA YPKGVPPLAY
KKYDFIRLVK SRVIEVGCLP DSAPATWDRV EDVKRLLMQF E