Gene CNI02050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI02050 
Symbol 
ID3259691 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp563659 
End bp567317 
Gene Length3659 bp 
Protein Length908 aa 
Translation table 
GC content49% 
IMG OID638258691 
Producthypothetical protein 
Protein accessionXP_572648 
Protein GI58270984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.313076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGCAACTG CATCTCTTCC TCCTTGTTGT TGCTCCATCC AGCCACTACT GTCTCCCAAT 
CCTGCTACAC TCAGTCCATA GTGGCAACTA CAGCTTTCAT GTTGAACCCG TCAGTGGCTG
CAACGTTTGC CGGGGGAAAC AACTACCACA GGGGTAACTC CGCTGCACCT CTCAACATGC
TCTCCCATCC GCCCTCCAGA CTTTCTCCGC CTCCAGGGGC GGATATATAC GGCGTGACAG
CATCTTCATC AAGTCAACAA CTGCCAGTGG CAGGTTCTTC CGCTACGTAT CCCACACAGC
TGCCTCAAAC AGCCGCACCG CTCACGGATA CGAAGAAGCA AGGCCAGAAC CTTCCTAAAC
GAGGTTACCG GGCATGTGTG AGTATCCAAA AGGCCCAAAT TCTTACAATC AAACTGACAT
GAGCCTATTG CTTGCTTCTG ATCAATCGGA ATCACCTGAA ATCGATAGAC CAACTGTCGA
CTCAGAAAAG CTCGTTGCGA CCGTGCGTGA CCCTTTCTTC CCAAAGTTGA ACGTGGCTGA
GACCAGGGTA GTTGGGGATG TAAACTCCCC GTCAGAACCT CCCTGCTCGC GTTGTCGAAG
GGAGCAGCGT GACTGTGTGT TTCTTCCTAG CGTGCGTCAA CATATCACAT CTTGCGACCT
TTCGACTGAC AGTGTGTTTG GTGCCGCCAG AAACGGCGTC GCAGAACATC TGTAGCTGAC
CCTGCTTTAC GGGAAGAATC TTTGGATCTT CCACAGACTG AAATCTATCC TGTCCAAGCA
GCAGCAAGGC CATCAAGTGC CAATGCACAT TCAGGCGGGG CTCAAGGGGC TCCCAGCGTG
TCTCAGTCTC TCAGTCAACA AGGTCCCCCT CTTGCCTCTG CCTCATCCGT CAATTCTTCT
TTCCCCCAGC TTCCTAAGCC GGCGGCCAAT AAGGATGATT CCTCCAGCAG GATGCCCTTC
ACTCAAACCG GAATCTGGAA CGAGACAGTG CAACAACCCA CCAGTACTGA AGATCAAGGT
AAACAACAGG TGTTTCCTTC TTCTGCTACA TATCCGACCA TGCATTCCGT GGGCTCAACT
CAACATCACA CTGACACCAC TACTCCAGGC TCATTGGGCG GATCATCTGC CGCTTCAACC
ACCGCTTTGT CACCACAGTA TCGCACCAAA AAACGGAAAA CTGAGCCCAG CAATCGAAAG
ATTGTCAACG CCAACTTGAG CAACGAGATG GATGCTTTGG AGATCCTGGC GAATGCTGCA
ACTGATGGGG ATGATGAGGG AGAAGATGGT AAAAGGAAAC AGCCACACGA TCCGAAGAAG
GTCACCTGGA ATGTTGGAGA GCAATACAAA GCGCCTATGA GGGAGTTGAG CGATTTTCAT
TTGATAAAAG CTGGCATTTT AGACGAACAT GGTCTTCATG CCATGGTAGA CAACTTCTTT
CGGTACTATC ACCCAGCACT TGTACGTGAC ATATAATGGG TCTTCGTATC ACCGCTGACT
TTTTGGCAGC CTATCTTCCA AACAGCCCGT ATCCCGAGGT GCAGGGAACA ACTACTCGAT
CTCGCTCACA ATGATAGTTT CCTTCTTACT TGTATAGTCG CTGTCGCGTC TAGACATCCG
TCTGATAATC GATACAAAGA TGTTCATGAC AAAACATGGG CAGTGATTAG AGACGCCATG
AGCGATTATT CTTTCACCGG TCTACCTGGA TCGGTTGGTT TTGTAGAGGG AGTGCTTCTT
CTTGCAGAAC ACCTCCCTAG AGAAAGGGGA ACACCCCCAC GAGGAACAAG TGTAGATATG
CTTGCCGGAC CTGGGACTGA GTCGGCTGGA GTGCATGGCA CAGACAACAG GAGAAGCTGG
TCTTTGATCG GCTTGGCCAT TCGTGCGGCA TATCTCCTGG GGTGTGAGTA TGATTAATAG
CATGACATGA CCCGCGCTAA CGTTTACGTC TAGTGGACCA AATTTCGTTG GAGATTGACG
AATCCAGTCG TACTCCTGAT GTAGAGCGAG CTAGAAGTGT TTGGACCTGG TGTTATTTAT
ATGATCGAAC GATAGATAAG TATTACTTCA CTTCATCAAA TAACCTCTGG CTAACATTCT
CGGCACGTCT TCGAACAGGT CTCGCTTTCT GGTCTAGAGG TCCTGCTCTG TGTTTTGTGG
GCTATTCTCA CATATCACAA ACGGGAGAGG TAGCTGCCAG ACTCAACTTC CCTCTGCAAT
TATCTCCTGG GGCAGAGCTA GATGGGAGAG CTCATGATGA CTCGGCATCG CTTATGCAAT
CCCTTGTGGA GCTCACTCAA GTGATGACCA ACGCCCATGA CATCCTGTAT CCAAGCAAGC
TTAGAACAGA AGTGTTGGTC AAACAGGGAG AGTACTTCTT GTTCTTGGAC CATTTCAGGC
GAGGTGCGTT CGGTCTATCG TTACTGGGGA CTGTAAATAT TAATTTTCTT TTTCAGCTCT
TGACAGCTAC CGTACTATCT GGAAACCCAA ACAATGGTCG AATCGTACTC TTGAAGAGTT
AAGCTGGATG ACATTTCATT ACGTTAGGTG AGTAAACCAA GTATAGTCTC GCGATACAAT
GATTCGGCTG TTGATGTTTA TCGCAGATTG TACATTTCTT CATTTGGATA TTCGGCACAT
ATAAAGAGAG CTCAATGGCG TGGAGATAAG GAGGCTTCGA CCGGTCGAGA TGGCTCTCGA
CAGGCAGTGC AGATATTTCC TCGAGGTATG TCTGACAGTC AAACCTTACT CGGATCAAAT
GCTGATAAAG CTCTTAGGCT CTGCAACGTC CCCCGATGCT CTTTACATCT ATGATTCCAT
TGCAGCTGCC AACGAGATTC TACATATTTC TCTGCGTCTT GCGCAAATGG GTAGTCTCCG
TTATCTTCCT TCCCGATATC TCATCAACAT ATCGTATGCC GCTGTCTTTG CGTTAAAGAG
CAGTTACTCT GGCGCTGTAG AAGAAAAGGA TATGCATAGG TAAGCCACAT CGCAGACTTA
CTGAAGATCG CGCTAAAGCA TCGCAGAATC AGAGAATTAG TCGACCATGT GTGTACGGGA
TTGGTGCTAT CCTGTCCTGA TAAGGACCAT CCGGCCGTTC GTTACGGACA AATGTTGAGA
ATGCTGGCTA AACGACTGGA AGAGCTCCAT GATGCGAGTG TAAGTCACAA TTAGTGCCTC
TTTAGACCTG TGTATTCACG TCCTTTTGTA GGCGGTTCCG TCTCGTTATC CCTCGCCTGA
GCCCGTCGAA ACGACTTCCA ACTCAAATGC ATCCCCCCAA GCAAGCCAAG ACCAACCATC
TGTTTTCCCT TGGCCAGCAC CTCCCAACGC TTCGACCGAG CATAGCGCCC CTACTCCAGC
TTTTCAACTT CCCCCCTTCC CCGACATGTC TTTTCTTACG AACAGCCAAG TTAATCCCGA
TTACCTGGGC GAAAATAATA CGTCGAATTC AGAGAAACAA GCAGCAATGT TTGACTATGA
GGACACCAAA TTTGATTTCG ATCTAAAGGG TTTCTGGGAT GACTTCTCGC TAGGCGAAGG
AAGCGGTTTT CCGTTTAGAT GATCACCTAA CTGACAGGCT GTTGACTCGA GTGTAACCCA
GTTAATGCCT TTTTGACTTT GTAAATATAG AATGACGTCT TGCATGTACA TATCTATGC
 
Protein sequence
MLNPSVAATF AGGNNYHRGN SAAPLNMLSH PPSRLSPPPG ADIYGVTASS SSQQLPVAGS 
SATYPTQLPQ TAAPLTDTKK QGQNLPKRGY RACTNCRLRK ARCDLGDVNS PSEPPCSRCR
REQRDCVFLP SKRRRRTSVA DPALREESLD LPQTEIYPVQ AAARPSSANA HSGGAQGAPS
VSQSLSQQGP PLASASSVNS SFPQLPKPAA NKDDSSSRMP FTQTGIWNET VQQPTSTEDQ
GSLGGSSAAS TTALSPQYRT KKRKTEPSNR KIVNANLSNE MDALEILANA ATDGDDEGED
GKRKQPHDPK KVTWNVGEQY KAPMRELSDF HLIKAGILDE HGLHAMVDNF FRYYHPALPI
FQTARIPRCR EQLLDLAHND SFLLTCIVAV ASRHPSDNRY KDVHDKTWAV IRDAMSDYSF
TGLPGSVGFV EGVLLLAEHL PRERGTPPRG TSVDMLAGPG TESAGVHGTD NRRSWSLIGL
AIRAAYLLGL DQISLEIDES SRTPDVERAR SVWTWCYLYD RTIDKYYFTS SNNLWLTFSA
RLRTGLAFWS RGPALCFVGY SHISQTGEVA ARLNFPLQLS PGAELDGRAH DDSASLMQSL
VELTQVMTNA HDILYPSKLR TEVLVKQGEY FLFLDHFRRG ALYISSFGYS AHIKRAQWRG
DKEASTGRDG SRQAVQIFPR GSATSPDALY IYDSIAAANE ILHISLRLAQ MGSLRYLPSR
YLINISYAAV FALKSSYSGA VEEKDMHRIR ELVDHVCTGL VLSCPDKDHP AVRYGQMLRM
LAKRLEELHD ASAVPSRYPS PEPVETTSNS NASPQASQDQ PSVFPWPAPP NASTEHSAPT
PAFQLPPFPD MSFLTNSQVN PDYLGENNTS NSEKQAAMFD YEDTKFDFDL KGFWDDFSLG
EGSGFPFR