Gene CNB00820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB00820 
Symbol 
ID3255888 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp234559 
End bp237545 
Gene Length2987 bp 
Protein Length754 aa 
Translation table 
GC content49% 
IMG OID638254734 
Producttranscription factor, putative 
Protein accessionXP_569090 
Protein GI58263360 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.140572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACGAAACAT CCTCCATCTT CTCTTCCCAT CTATCTTTTG CCCATATACA CCACAAAGCG 
GGGAACAAGC ATCTCACAGC CACATTCGGC GACACAGATT TACGTAGTCG CCTCTTTTCC
CACCCCCTTC TACCTCCAAG TCTCAACCCG AGACACGTTC CGCCATGGGC AAGAAGGTCA
TCGCCTCTGG TGGGGATAAT GGGCCCAATA CCATCTACAA GGTGCGCATC TACACCAATC
CTATCTGGAT GGCATCCTGG ATTGCTTTCA AAGCATTGCT AACTCGATGT CCCGTTCTAC
AGGCCACATA TAGGTGCGCA GCATTCCTTT AATTCGAGCG GCGCGCCGCC CGATTGTATG
TGTCAAGTTT GCTAACCTTG AACTCAATCT CTTTGATAGT GGAGTGTACG TACCATATCC
TTTGTTTACC TGCTTCTAAA CTGACAGTGT TTCCAGGCCC GTATACGAGA TGGTCTGTCG
CGATGTCGCG GTTATGCGCA GACGTTCAGA CGCGTACCTG AACGCGACTC AAATTCTGAA
AGTAGCCGGT TTCGACAAGC CTCAGCGAAC ACGAGTTTTG GAGAGAGAAG TGCAAAAGGG
AGAGCATGAA AAGGTTCAGG GTGGCTACGG AAAATATCAA GGTTAGTCTA CTTTTTTTCC
TTCAAATGCT CTGTTTTGTC TTGTCACCCA ATCGCGTAAG CTGACTCGAC GTAGGTACCT
GGATTCCTAT CGAGCGCGGT CTTGCTCTTG CCAAGCAATA TGGTGTTGAA GACATTCTCC
GACCCATCAT TGACTACGTT CCTACCTCTG TATCTCCTCC CCCTGCCCCT AAACACTCTG
TCGCGCCTCC TTCGAAAGCC CGCAGGGACA AGGAAAAAGA AACCGGTCGA ACCAAGGCTA
CTCCTTCACG AACCGGGCCA ACATCAGCAG CTGCTCTTCA AGCTCAAGCA CAACTTAATC
GTGCCAAGAT GCATGATTCC ACTCCCGACG CTGATGCTAG CTTCCGCTCT TTCGAGGAGA
GAGTCAGCTT AACGCCTGAA GATGATTCGA GCAGTGATAC ACCGAGCCCA GTCGCGAGTG
TTATGACTGA CCAGGACATG GAAGTCGATA AGATGGGGAT GCACATGAGC ATGCCCAACG
TGACACTGTC CCAAAATATG GAGGAACTGG GAGCTGGCTC AAGAAAACGT AGCGCCGCAA
TGATGATGGA AGATGAAGAC CAATTTGGCC AGCTCCGGTC CATCAGGGGT AATAGCGCTG
TACACACTCC TCACGGTACT CCTCGACATC TTGGTATCGG TATGCCCCCG GAACCAATCG
GCCCGGAGCA ATACACCGAT ATTATCCTTA ACTACTTCGT CTCTGAAACC TCGCAAATAC
CGTCTATCCT CGTCAGCCCT CCTCACGACT TTGATCCTAA TGCTCCCATT GACGATGACG
GCCACACCGC GCTTCACTGG GCTTGTGCCA TGGGTCGAGT ACGCGTTGTC AAGCTGCTTC
TCACTGCAGG CGCGTCAATC TTTGCTGGTA ATAATGCCGA ACAAACTCCT CTTATGCGCA
GCGTCATGTT TTCAAATAAC TATGACATGC GTAAATTCCC CGAGCTTTAC GAACTTCTTC
ACCGATCTAC TCTTAATATT GACAAGCAAA ATCGAACCGT TTTCCACCAC ATCGCCAATC
TTGCCCTAAC AAAAGGCAAA ACTCATGCCG CCAAGTACTA CATGGAGACT ATCCTCGCGC
GTTTGGCCGA CTACCCTCAA GAACTTGCCG ACGTGATCAA CTTTCAAGAT GAAGAAGGTG
AAACTGCTTT AACTATTGCT GCGCGTGCCA GAAGCCGTCG ACTGGTGAAG GCTCTGCTCG
ACCACGGTGC CAATCCCAAG ATCAAGAACC GTGACTCCCG CTCAGCTGAA GATTATATCC
TCGAGGATGA GCGATTCCGT TCATCTCCCG TTCCAGCTCC CAACGGTGGC ATCGGTAAAG
CTAGCACCTC TGCTGCCGCC GAAAAACCTC TCTTTGCTCC TCAGTTGTAC TTCTCCGAAG
CGGCCAGGTT ATGTGGCGGC CAAGCATTAA CCGACATCAC TTCCCACATG CAGTCACTCG
CACGATCTTT CGACGCTGAA TTGCAAGGCA AAGAACGAGA CATTCTCCAA GCCAAGGCTC
TTCTTACCAA CATCCATACT GAGGTTACCG AAAATGGTCG ATCAATCACT GCTATCACCA
ATCAAGCGGC TCCCCTTGAA GAAAAACGAC GTGAGCTTGA GGCTCTACAA GCATCTCTGA
AGACAAGAGT AAAGGACGCT TTGAAGAAGG GTTATATCGG GTGGCTTGAG GGCGAACTGG
TAAGGGAACA ACGATGGGAG AACGGTGAGC TCGAGGGAAA TGAAGAGGAG AAGGCGGCTG
TTCAGGCATT AAGGGATGTT CCTACCGGTG GTCAGGAGGT TGTTCAGGCC GAGGAGGAAA
AGTTAAGATG GGAGATTGAG GAGAAGAGGA AGCGAAGGGC TATGTTTGTG GAAAAATTTG
TCAGAGCACA GACCGAAGTA AGTTTCTGGG CAATGTTGAA GTAGAGCAAT GCTCATAATT
TTGCAGGCTG GTACAAGTGA ACAGATTGCC AAGTACAGGA AACTGGTATC CGCTGGGCTC
GGAGGTGTTT CAACAAATGA AGTAGATGAG TTGATGAACC AGTTATTAGA AGTAGGTTCT
GCGATCTACT AGCCTAATGA GAGTTGAACT GATCTTCCTT GCAGGGTCTC GAAGAGGAGA
ATGATAATCA AGTGTACAAC ACAACCGCTG GAGAATCAGG TCCTTCATCA TGGGTGCAGT
AATATGGTCA TTGGGGATGA AGGGAAGGAA GGAATCATGT GGTCAATAAT TGGAAGTTCT
CAGATCTCTG TTCTGTATTA CCAAAAGGTT TCTGCACATG ATGTGACTTG GTCTTGGGTC
TCTTAAGTGG TCTTTTACTT TCTAGTAACT ATGCGAATGC AAAATGC
 
Protein sequence
MGKKVIASGG DNGPNTIYKA TYSGVPVYEM VCRDVAVMRR RSDAYLNATQ ILKVAGFDKP 
QRTRVLEREV QKGEHEKVQG GYGKYQGTWI PIERGLALAK QYGVEDILRP IIDYVPTSVS
PPPAPKHSVA PPSKARRDKE KETGRTKATP SRTGPTSAAA LQAQAQLNRA KMHDSTPDAD
ASFRSFEERV SLTPEDDSSS DTPSPVASVM TDQDMEVDKM GMHMSMPNVT LSQNMEELGA
GSRKRSAAMM MEDEDQFGQL RSIRGNSAVH TPHGTPRHLG IGMPPEPIGP EQYTDIILNY
FVSETSQIPS ILVSPPHDFD PNAPIDDDGH TALHWACAMG RVRVVKLLLT AGASIFAGNN
AEQTPLMRSV MFSNNYDMRK FPELYELLHR STLNIDKQNR TVFHHIANLA LTKGKTHAAK
YYMETILARL ADYPQELADV INFQDEEGET ALTIAARARS RRLVKALLDH GANPKIKNRD
SRSAEDYILE DERFRSSPVP APNGGIGKAS TSAAAEKPLF APQLYFSEAA RLCGGQALTD
ITSHMQSLAR SFDAELQGKE RDILQAKALL TNIHTEVTEN GRSITAITNQ AAPLEEKRRE
LEALQASLKT RVKDALKKGY IGWLEGELVR EQRWENGELE GNEEEKAAVQ ALRDVPTGGQ
EVVQAEEEKL RWEIEEKRKR RAMFVEKFVR AQTEAGTSEQ IAKYRKLVSA GLGGVSTNEV
DELMNQLLEG LEEENDNQVY NTTAGESGPS SWVQ