Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB04820 |
Symbol | |
ID | 3256082 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 1373675 |
End bp | 1375751 |
Gene Length | 2077 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255126 |
Product | transcriptional activator, putative |
Protein accession | XP_568972 |
Protein GI | 58263124 |
COG category | [K] Transcription |
COG ID | [COG5068] Regulator of arginine metabolism and related MADS box-containing transcription factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAGGA AAAAGATTGA AATTCGCCCG CTCACTGTAC GTTTTCTGCC GAAGAGCAGA ACTTGCGCCT AAGAAAAAAA AGGGGCTAAT AGGTCATCCT CAGGACGAGC GTAACAGAAA TGTTACCTTT CTCAAGGTTG GTCCACCATT CAAACATGGA CTATCATCAT CCGTAATGCT CCAGAGTATA TACTGACCGC GAAATAGAGG AAAGCAGGTT TGATGAAGAA GGCCTGGGAA TTATCGGTAC TTTGTGCCGC AGATGTGTCG ATCATCATAT TTTCTGCGGC CGGCAAGGCG TTCGAATTCT CAAGCAAGGA GCTGGATAGT GAGATAGACA GGTATCACGA TGTAAGCTAC CAGAGGATAT CGTTTCGTTG GGTGAGAAAG GAGGCTGATG AGATAATAGT ATGAAGGGAT GATTGAGAGG AGACGGGCGG CAGAGTTTGC AGCAATGGCG CTGGCGGGAG AGGATGATGA TGATGATGAA GATGATGATA CTTCAAGGAG AGGCTCTGCG TCCAAAAGCA AGGCGGCGGC TGCGGCAAAC GGCAACCCTC CTCCTACTAG GAGTCTCAAA GGGAAGGAAA CGTTCAAGCA CCGAACAGTA CGGCCAGGTG AAGACAGGAA GAGGAAAAGA CAGGACAAGA AACAGCGACG TAAAAGTGAA CCAAGCGAGA AGAGGAGTTT CATCGATGAG ATCATGAGTG GAGGCGAGTC TGATAGTGAA GAAGAAGAGA AGCCCAGACG ACGGTCAAAT GCAGAACATG GGAATGGGAA GCGTATGAGT AGTTTGAGGG AAGAGGACGA GTTGGCTGAT GACGTGCCCC ACGTAAGTGT GCTCTGATGT ATAACATTCA TAGTTCTGAC TTTATCTTCA GGCATCAAGA CAATCTCTCG ATGGATTACA ATACGCACTC AGTATGCACG CCTCTCAACC ATCATTCGAA CGCTTTGCAT CTCGTCATCG ATCTCCTCAC GCCGAATTTC TAGCTCCTGC CGCTTCATCC TCCCAAACAC CGCTTACAAC TCCCACAATC CATCGTCACC CATCAGACAC CATTCCATAC CCAATGACCA ACTCTCTTGC CGCTGCTCCG ATGCAAGCAT CTTTAGGTGT CCCACAGCTC GGTGTTCATC CTTCCTATCC TCACAATCCT AACGGTCATC CCGGTTATCT TGGTTACCCC AAGTCTCTTT TGGGGATGCA AGCGTCATAT CTACCTCGTC AACCATTTGC TGACGCACCT CAGCCTTCAC CATACTACGG TGCTCAGGGC CCAGGGTCTC ATCCCGGTGG GCCTCCTCAA CCCGGCCTTC CAACGCAGAT GCCTGGGGGA CAGCCTATTC AATGGGATCA AAATCTCCTC GCTCGTTATG CCGAGTTCCA ACTCCAGCAG AATCACCAGC GTCAGCAACG CATACTCTTA GAGAGACAAC GACAACAACT GGCAGAGCTG GGTGTACCTC TTGACGAGAA GAATTTGTTG GACGAGATCT TTGGAGGGGT AGGAGCCAGC CGGTCGGTGA GTGGAGGTGC AGATCCAAGT GGGGCAGGTC CGGGGTCAGC GCTTCTTACC GGACTTGGCA ATACTGCCTC TGAAGGGAGA GAAGAAGGGA ATAGCCTAGA GTTCATCTGG CCATTGGGTA ATAATGCCGC TGCTGCTGCG CCAATTAGCG ATGAAGATCG AAACGACGTT TCTTCCCGTT CTGTTATTTC TGCTGGCCAT CATCAGGCGG CTTATGGATA TGGAAAGCCG CATGTTCAGC AGCAAAAGAC GGGTTGGGGG TTTGATGGGG CCGGCTTTGA ATCAATGGAG GAAGTGGCCA GCGGGAGTGG AGTGGGTTTA CCAAGTCCTG TTTCAGCAGG TGCTGGCGGG GGCAGGAAGT AGATGAGGCG ACGATGATCG AGAAGGCAAG AATCTAATCA TGCGTTTGAT ATGGTATGGC CGTCGCATAC TAGAAAATTT CTTCTAATTC ACTGAAAAAA ATAGCTTGCG TCGAAAGAAG CGCTTTTGTA ACTAGTCAGT CTGCCTTTTA CCAACAGTTT CTTTTTCACT CGATTCGTTT TCCGTTT
|
Protein sequence | MGRKKIEIRP LTDERNRNVT FLKRKAGLMK KAWELSVLCA ADVSIIIFSA AGKAFEFSSK ELDSEIDRYH DYEGMIERRR AAEFAAMALA GEDDDDDEDD DTSRRGSASK SKAAAAANGN PPPTRSLKGK ETFKHRTVRP GEDRKRKRQD KKQRRKSEPS EKRSFIDEIM SGGESDSEEE EKPRRRSNAE HGNGKRMSSL REEDELADDV PHASRQSLDG LQYALSMHAS QPSFERFASR HRSPHAEFLA PAASSSQTPL TTPTIHRHPS DTIPYPMTNS LAAAPMQASL GVPQLGVHPS YPHNPNGHPG YLGYPKSLLG MQASYLPRQP FADAPQPSPY YGAQGPGSHP GGPPQPGLPT QMPGGQPIQW DQNLLARYAE FQLQQNHQRQ QRILLERQRQ QLAELGVPLD EKNLLDEIFG GVGASRSVSG GADPSGAGPG SALLTGLGNT ASEGREEGNS LEFIWPLGNN AAAAAPISDE DRNDVSSRSV ISAGHHQAAY GYGKPHVQQQ KTGWGFDGAG FESMEEVASG SGVGLPSPVS AGAGGGRK
|
| |