Gene CNI03210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03210 
Symbol 
ID3259533 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp871322 
End bp874555 
Gene Length3234 bp 
Protein Length991 aa 
Translation table 
GC content50% 
IMG OID638258813 
Producthypothetical protein 
Protein accessionXP_572927 
Protein GI58271542 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.353593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTTATTCA ATCGTCTAGA CCCAGACACG ATGTCAGTGC CCTACGAGGC CCATAATTCT 
CCACCACCGC CGCCCCCGAA GCATGCCTCA TCTTCGTCTT CCGGCAAAAA GGCAAAGCAA
AAACGACCAT CTTCCTCGAA TGGCACTCCC GATCAAAGCT CGGCAACGGC GGTAAGGGCT
TTCTCGGCTT GCCGGAACTG CCGGAACAAG AAGGTGAAAT GCCTTCCTGG ACCACCCCTT
CCCAGTTCGA GTATCACTTT GCCAGCATCT CCGACTTCTC TGGAAAATCC CGGGGCGAAC
CTAGGACCTT GTCAACAATG TCTTCAATCT GGTGCCGAGT GCATTTATCC CCCGACTAGA
GATAGAGCTG CGTACAGCAG ACAATATGTG GCGAATTTGG AAACAAGGGT ACAAAGTCTT
GAGATGGTGC AGGCGAGATT GATGCCTTTA CTCGAAACTT TCGAGGCGAG TACGCATAGC
GGAAAGCCGA TGCCTATGCC GCTTCCTCCG GTACCGGCCA GAAGGACTGA AACACCAGCT
CAAGAGATGA ATGAAGACGT CGAAGAAGGA GAGGATGAGA TTCCCGAGGA TAGCGCGATG
CAACCTGCTT CAGACAGTGA AGATGCTGGT CAGATCACAC AAGATGATAG GGGAAACTAT
CGATGGATAG GTTCGTCGAA TACACTTTCT CTTCTTGACT CCTTCTCTGG TCGGCAATTA
GCAAGTGGTC GACCATTGCC TTCTCGTACG CAATCATCTA CAACCCAAAT GGATGCAGCC
ATTTCCAGGG ATCGCACACC GTCAAACATC GCGTCTACTC CTACTCGAGA ATCAAATCCT
TATTTTGGTC CAGTGGCTGG TTCAGGTGTC GTCAAAGCCT TACCTCCTGT TGATGAGGTG
CAGTATCCTT CTGCAGAGAA GTCACTGGAA ATGGTCGATG CTTTCTTTCA AGAGGTGCAT
CCTTGCTTAC CTGTCCTTTT GGAACACGAA TTTAGAAGAG ATTTCCGGGC GCTGATGGAG
GCAAGGGCCA GAGGTAATCT CTCGTGGGGC GGTGGAGTGA GTCATCCGCT TGGAGCCAAA
CAGTCGCCAC GTTGACGGTG TGTCTAGTTT ATTTCAGTCG TGTTTGCCAT ATTTGCCCTG
GGTGAAAGAG TAATTGTCAC ATCAAGAGCG TGGAGGAGAG AAATGGCTAA GGCTGAAGGT
GATGATGATG ATCATGAGAC TGTCTTGCCT GGTGAGGCAG AGGCCGGTGT AATCTGGTAT
GAGAGGTAAG TGCTGCTAAG CTTGCGGAGA GATGCGGGAT AATAATGCTT CTGCTTCAGA
GCTCAAATCT TACATTACAC CACTTTAAAA GACGTCAACA TCCACCAGGT CCAATGCCTT
ACTCTTCTCG CTGCATTCCA AGCAAGTGTA AATGCTATGC CCATGTCATG GCTTCTTGCC
GGACAGGCTA TCCGTGTAGC TCAAGACTTG GGTTTACATC GATCAACCGC CCGGCTTCCC
TTATCGTTTG CAGAAAAACA GCTGCGTTCA CGATGTTGGT GGGCCATCTA CGGTTTGGAG
AGGATGATGT CAATTTCTCT CGGCCGGCCG CTGGGTGTGG ATGATCTTGA TGTGGATGTA
GCATACCCGC TGGAGGTTGA CGATGCCGTG CTGGAGAAGA TGGCGATGGA GAACCTGCAA
GCTTTACCGC CTGAATTCGA GAAAGAGCCC GAGGCTTCGA CAATGAGTGG GTTCATCGCG
CTCACAAAGC TCTGCAAGAT TGCTGGGCGA GTTGTGCATC TGCTCTATCG GCCTTCAAAT
GGAAGGTCGG TGAGTGATCC TTCGTGGGCG GTACAGCAGC AGAATGCGAT CAATAAATTA
GACAAGTTAC TTAGAGATTG GTTAGCAAAC GACGTGGTAA GTTGTCACGT GCACTGTATT
CATGGGTAGC TGACTAGTAA TAGCCTTCAA AATACAAAGA TCCTTCAGAA ACGCATTCGG
TATCCCTTCT TTCCGCCATT TTATCCAACT CTTACTTTAC TGTTCTCGTC ACCCTTCACC
GAAACTTCTT GCCCTCATCA CCCGATTATC CTCGACCTAA ACCTCCTCCC TCTTCTCAGT
CGCTCGCTCA TTGCGTCGAC GCTGCCAGAT CTGTCATCCA CATCGCCTCT CAATCTCGCA
CCCTCGTACC ACCCTCTCAT CATCTTGCAA TGTATTGTCA ATACTTGTGG TCGTCAGCAG
TCATCCTGCT GTTATGTGAG ATCCAAGCGA GAGATGAAGT GGTCATCGAA GCGGTCGGTT
CGCAGGTGGA GGCTTGTAGG AAGTGCTTAC AGGCTTTGGA ACCTGTTTGG CCTGGGTCAA
AAAAGTTGAA GGAATTGTTA AACGATGTGG CCAGTCGTGC GAAAGAAGTG ATGGTTTCAA
AGTCATCAGA CAAAAAGCGC AAGTCATCTG CGCATAAGGA CAAGGACAGA GAGAGGCAGA
TGCTACATCC TTCCCAAAAC CATCAGTCTC GACCGTCAAC CGACTCGCCT ATCCCGCAGC
ACGCTGAGAA TCAGTGGCCG TCTCACTCTG TTTCTCCACC AGAGAAGAGA CAACGTGTGT
TTGAACTTTC TGACACTCGC ACAGCTTCGA ATGAGGATGA ACCGGCTCAA AACCCCCAAG
CATATTATTC AGTGTATCCT ATGACCACGC CGATAACGTC TCAGGCTTCA CTCCAGTTTA
CTGAGCCAAT GCCTACTTAT GATATGGTAT TCGACCTTGG AGGAGTCACC TTTGACGGAT
TAGAGTTGTT GCAAGGCTTT AGCGGGGGCG CTTCCAATTT TTGGAATAAC TTCAACTTTG
GCATGGATGG AGCTGGCAAT GGTAGCGCTT CTGGAGGCTC TGCGCCAGTG GCAAGAACCG
GGCAGCAATT CCTGCCCTCT GGTCAATTGA CACCCAACTC TAATGGTGAC GGTTCGAGGC
CTTCGTCATC CAGCTGGCAA GGGCAGCTAT CACGGATGGT CAGCCAGAAT GGGCAGCAGT
ATGTGCAAGG ACAGGGACAG GATGGATATA ACGGCGCAGG GAGTGGAAGC GGTGGCGTTG
GGACGCCTAA TGCGCATCGG GGAGCAACAT TCTGGGAGCA AGTGACTGGG AGTACATTCG
ACTGGCAGGC AGACCCAAAT GTGCCTTTCA ACATCTAGCT TCGAATTCAT TTACTGCATC
CGTATCTTCA AATCATACTT ATTGTATTTT AGATTTGTAT AATTTGTGAT TCTG
 
Protein sequence
MSVPYEAHNS PPPPPPKHAS SSSSGKKAKQ KRPSSSNGTP DQSSATAVRA FSACRNCRNK 
KVKCLPGPPL PSSSITLPAS PTSLENPGAN LGPCQQCLQS GAECIYPPTR DRAAYSRQYV
ANLETRVQSL EMVQARLMPL LETFEASTHS GKPMPMPLPP VPARRTETPA QEMNEDVEEG
EDEIPEDSAM QPASDSEDAG QITQDDRGNY RWIGSSNTLS LLDSFSGRQL ASGRPLPSRT
QSSTTQMDAA ISRDRTPSNI ASTPTRESNP YFGPVAGSGV VKALPPVDEV QYPSAEKSLE
MVDAFFQEVH PCLPVLLEHE FRRDFRALME ARARGNLSWG GGFISVVFAI FALGERVIVT
SRAWRREMAK AEGDDDDHET VLPGEAEAGV IWYERAQILH YTTLKDVNIH QVQCLTLLAA
FQASVNAMPM SWLLAGQAIR VAQDLGLHRS TARLPLSFAE KQLRSRCWWA IYGLERMMSI
SLGRPLGVDD LDVDVAYPLE VDDAVLEKMA MENLQALPPE FEKEPEASTM SGFIALTKLC
KIAGRVVHLL YRPSNGRSVS DPSWAVQQQN AINKLDKLLR DWLANDVPSK YKDPSETHSV
SLLSAILSNS YFTVLVTLHR NFLPSSPDYP RPKPPPSSQS LAHCVDAARS VIHIASQSRT
LVPPSHHLAM YCQYLWSSAV ILLLCEIQAR DEVVIEAVGS QVEACRKCLQ ALEPVWPGSK
KLKELLNDVA SRAKEVMVSK SSDKKRKSSA HKDKDRERQM LHPSQNHQSR PSTDSPIPQH
AENQWPSHSV SPPEKRQRVF ELSDTRTASN EDEPAQNPQA YYSVYPMTTP ITSQASLQFT
EPMPTYDMVF DLGGVTFDGL ELLQGFSGGA SNFWNNFNFG MDGAGNGSAS GGSAPVARTG
QQFLPSGQLT PNSNGDGSRP SSSSWQGQLS RMVSQNGQQY VQGQGQDGYN GAGSGSGGVG
TPNAHRGATF WEQVTGSTFD WQADPNVPFN I