Gene CNB04820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04820 
Symbol 
ID3256082 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1373675 
End bp1375751 
Gene Length2077 bp 
Protein Length548 aa 
Translation table 
GC content50% 
IMG OID638255126 
Producttranscriptional activator, putative 
Protein accessionXP_568972 
Protein GI58263124 
COG category[K] Transcription 
COG ID[COG5068] Regulator of arginine metabolism and related MADS box-containing transcription factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGGA AAAAGATTGA AATTCGCCCG CTCACTGTAC GTTTTCTGCC GAAGAGCAGA 
ACTTGCGCCT AAGAAAAAAA AGGGGCTAAT AGGTCATCCT CAGGACGAGC GTAACAGAAA
TGTTACCTTT CTCAAGGTTG GTCCACCATT CAAACATGGA CTATCATCAT CCGTAATGCT
CCAGAGTATA TACTGACCGC GAAATAGAGG AAAGCAGGTT TGATGAAGAA GGCCTGGGAA
TTATCGGTAC TTTGTGCCGC AGATGTGTCG ATCATCATAT TTTCTGCGGC CGGCAAGGCG
TTCGAATTCT CAAGCAAGGA GCTGGATAGT GAGATAGACA GGTATCACGA TGTAAGCTAC
CAGAGGATAT CGTTTCGTTG GGTGAGAAAG GAGGCTGATG AGATAATAGT ATGAAGGGAT
GATTGAGAGG AGACGGGCGG CAGAGTTTGC AGCAATGGCG CTGGCGGGAG AGGATGATGA
TGATGATGAA GATGATGATA CTTCAAGGAG AGGCTCTGCG TCCAAAAGCA AGGCGGCGGC
TGCGGCAAAC GGCAACCCTC CTCCTACTAG GAGTCTCAAA GGGAAGGAAA CGTTCAAGCA
CCGAACAGTA CGGCCAGGTG AAGACAGGAA GAGGAAAAGA CAGGACAAGA AACAGCGACG
TAAAAGTGAA CCAAGCGAGA AGAGGAGTTT CATCGATGAG ATCATGAGTG GAGGCGAGTC
TGATAGTGAA GAAGAAGAGA AGCCCAGACG ACGGTCAAAT GCAGAACATG GGAATGGGAA
GCGTATGAGT AGTTTGAGGG AAGAGGACGA GTTGGCTGAT GACGTGCCCC ACGTAAGTGT
GCTCTGATGT ATAACATTCA TAGTTCTGAC TTTATCTTCA GGCATCAAGA CAATCTCTCG
ATGGATTACA ATACGCACTC AGTATGCACG CCTCTCAACC ATCATTCGAA CGCTTTGCAT
CTCGTCATCG ATCTCCTCAC GCCGAATTTC TAGCTCCTGC CGCTTCATCC TCCCAAACAC
CGCTTACAAC TCCCACAATC CATCGTCACC CATCAGACAC CATTCCATAC CCAATGACCA
ACTCTCTTGC CGCTGCTCCG ATGCAAGCAT CTTTAGGTGT CCCACAGCTC GGTGTTCATC
CTTCCTATCC TCACAATCCT AACGGTCATC CCGGTTATCT TGGTTACCCC AAGTCTCTTT
TGGGGATGCA AGCGTCATAT CTACCTCGTC AACCATTTGC TGACGCACCT CAGCCTTCAC
CATACTACGG TGCTCAGGGC CCAGGGTCTC ATCCCGGTGG GCCTCCTCAA CCCGGCCTTC
CAACGCAGAT GCCTGGGGGA CAGCCTATTC AATGGGATCA AAATCTCCTC GCTCGTTATG
CCGAGTTCCA ACTCCAGCAG AATCACCAGC GTCAGCAACG CATACTCTTA GAGAGACAAC
GACAACAACT GGCAGAGCTG GGTGTACCTC TTGACGAGAA GAATTTGTTG GACGAGATCT
TTGGAGGGGT AGGAGCCAGC CGGTCGGTGA GTGGAGGTGC AGATCCAAGT GGGGCAGGTC
CGGGGTCAGC GCTTCTTACC GGACTTGGCA ATACTGCCTC TGAAGGGAGA GAAGAAGGGA
ATAGCCTAGA GTTCATCTGG CCATTGGGTA ATAATGCCGC TGCTGCTGCG CCAATTAGCG
ATGAAGATCG AAACGACGTT TCTTCCCGTT CTGTTATTTC TGCTGGCCAT CATCAGGCGG
CTTATGGATA TGGAAAGCCG CATGTTCAGC AGCAAAAGAC GGGTTGGGGG TTTGATGGGG
CCGGCTTTGA ATCAATGGAG GAAGTGGCCA GCGGGAGTGG AGTGGGTTTA CCAAGTCCTG
TTTCAGCAGG TGCTGGCGGG GGCAGGAAGT AGATGAGGCG ACGATGATCG AGAAGGCAAG
AATCTAATCA TGCGTTTGAT ATGGTATGGC CGTCGCATAC TAGAAAATTT CTTCTAATTC
ACTGAAAAAA ATAGCTTGCG TCGAAAGAAG CGCTTTTGTA ACTAGTCAGT CTGCCTTTTA
CCAACAGTTT CTTTTTCACT CGATTCGTTT TCCGTTT
 
Protein sequence
MGRKKIEIRP LTDERNRNVT FLKRKAGLMK KAWELSVLCA ADVSIIIFSA AGKAFEFSSK 
ELDSEIDRYH DYEGMIERRR AAEFAAMALA GEDDDDDEDD DTSRRGSASK SKAAAAANGN
PPPTRSLKGK ETFKHRTVRP GEDRKRKRQD KKQRRKSEPS EKRSFIDEIM SGGESDSEEE
EKPRRRSNAE HGNGKRMSSL REEDELADDV PHASRQSLDG LQYALSMHAS QPSFERFASR
HRSPHAEFLA PAASSSQTPL TTPTIHRHPS DTIPYPMTNS LAAAPMQASL GVPQLGVHPS
YPHNPNGHPG YLGYPKSLLG MQASYLPRQP FADAPQPSPY YGAQGPGSHP GGPPQPGLPT
QMPGGQPIQW DQNLLARYAE FQLQQNHQRQ QRILLERQRQ QLAELGVPLD EKNLLDEIFG
GVGASRSVSG GADPSGAGPG SALLTGLGNT ASEGREEGNS LEFIWPLGNN AAAAAPISDE
DRNDVSSRSV ISAGHHQAAY GYGKPHVQQQ KTGWGFDGAG FESMEEVASG SGVGLPSPVS
AGAGGGRK