Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND03810 |
Symbol | |
ID | 3257017 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1052013 |
End bp | 1055747 |
Gene Length | 3735 bp |
Protein Length | 1067 aa |
Translation table | |
GC content | 52% |
IMG OID | 638256316 |
Product | transcription factor binding protein, putative |
Protein accession | XP_570167 |
Protein GI | 58266022 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGCCCCCA TGAGCCACCG AAGACGCCAG TCCACAGCAT CCGCAGCAAC GCCCCGCTCA GCAAGGGAGC ACTCCGCACT CAAGTCGTTC AGGAGCACCT CCATCGCCAG CGCAACCACC AGGATGGACG AAGACGACGA CAGTGGTGAG TACGAGCACA GCGCCCCGGG GAGTGCATGC AGTGTCTCGT CGGCGGGGAG CAAGCCGCAG GTCAGAACGG GGATATAGAG TGCAGCTGCA GAGGTGCAGT GGCAGAGAAG CGTCCGGGTT TTGTGCCAGG AGGAGCCGGG CCGTCGCGGA TATGGATGCG ATAGGTGGCT TCACGAAGCA TCACCCGTCG TCTTGTGCAT GAGCAATAGC TGACAGCCGT GCAGGTGGCC CACGTAGGAA TACCCGTCAC CGAAATCCGT TACCACCAAC CACAGGTCCT CTCTTTCCGC CTTTACCACC CAAACAGCCG AAAAGCGCCC ACAAAGAGGG ATCCATGTCT AAATCACCCG CAAAAGCTAT TGGCGACGCC ACAGAAACTC AGGATGACTA TAATGAAGAA CAAGAAAAGT CAGGCGGCTC GACAGCAGTG GCAGACGACG AAGAAGTGGG TGAAGATGGT CAGGTACGAG AGGGGAATGG CCTGCCCAGC GGCTCAAACA CTAGTCTTAC ACCTCCTCCG ACAAGTCCAG CGGCTCGTGT CGATGTCGAT GTACCGCCCA ACACTGAAAC GACGTCGTTA TTACATCAGG ACCTGGAGGC TGAGCAAGTT CAAGACGACA TGGGTCAGCC CAAATCCATC AACGGCGGTG ATAATGGCGG CGTTGATAAT GACCAACCCC GCGAAGAACA AGAAATGGAT GAAGATGATG TCTGGGAAAC ATATAGACGG CATCGCGCAG TGAAAGGTTT TGTTCGCAAG GATGACAACG TCAAGTCAGA GTTTGGCGAA ATGGAGCGCG ACGACGGAGA CCATCTACTG GATAATCCTG TGCATGTCAA CGGGGACAAA TCGCACGACG AGCACCAGCA CCAGCACATA GATGTTCTTC ATGATGCCGA CGAACCAGAA GACTCCAATC ACCCACCTCG CACCCGCTCG AAACCTATCC CACGCGGCAT GTCCGCCATC TCCGCCTCAA ACGGCAATAC TCCTGCTTCT GCATCGGGAT CCGACGCCAA TCTTGGTCGT GCGGGGTCGC GTCAGGGAAG GAGGAGAAGA GGTGAGGAGC AGTTGCTGCT TGATGATCAT CTTCTCCCTG CAGAAATACG GCGCACTGCG CTTGCGGAGC GTGCCAAAAA AGATGTCGAG AAAGACGAAG AGGAGGAAGT AGAGGTAGAA GAAGAAGAGG AAGGGGAGGA AGACGAGGAG GAGGGTGATG GATTAGGGGA ACAAGAAGAA GATGAGGAAG GAAGGGATAT CACTCGTTGT GTGTGCAAAC GTGAAGGTTA GTGAGGATCA GACGTATATC AGCAGAAAAC GCTAACAACT ATCAGACATT GATGTGATGA TGATTCAATG TGATCAGTGC AACGTTTGGC AGCATGGAGA GTGTATGGGT ATATGGGGTG ACGAAGAAGC TCCAGACGGT GCGTTGTACC TCTTCTATTT TCGTTGTCTT TATGTACTCA TCCTCGCAAT AGAATATTTT TGTGAAGAAT GTAAACCCGA ACGACATCAG GCTCTGAAAA AATGGCTACG TTCTCGAGGA CGCAACACGT ATGTTGCCAG TCGTACCCTT TATATCTGTA TACTGACAAA TCACATTTAG CTCACCATTT ATTCCACCTA CTCCAGAGAT CCTCGAACGT CTTCATTCTG CCAGGGACCC TTACCCTCCG ACACAATCCA AAAGATGGAC GGAGTATTCT AATTCCGAAC CTCTGCCGCC GAAATTACTT GCAAGGTCTC ATCATAAGAA ACAGCAACAA CAGCAACAAC AGCAGCAGCA GCAAGGACAA GAGATGTTGA CCGGTACGAC GGAAGGGGGT GATGGGCGAA GGACAAGAGG AAGGCAATCG TCAACCAGGG AAAAGCCTCC AAGTTCGGCT AGCGGTGTCG GCCATGGAGG TACCCCATCG ACTTCCAGCA AAAAGGACGG TAAGAGGGGA GGCCGAAAAC AATCAGTGAA TGGAATGGAG AGCGAGAACG AATCAGAATC TTCTCAAAAC ATAGGGGCGG GAACTAGTCT TTCAGTCAAG AAGAGAAGCA CCATGAACTC GAGGGATGCC GCGTATGAAG AGGCCGTTAA AGCTGCGATG GAGGCAAGCC GGAAGGAAAT GGCTCCTCAA GAGGAAGGAG ATGTTAATTC CGAGACGAAA GAAGTAGAGG TCAAGGAGAA GGACCGAGCC AGACCTGGGA AGAGGAGAAG AAACGAGGAC GAAGAATTAG ACGAAGAAGA GAAAGAAGAT GAAAAAGATA AGCCCAAAAA AGGAAAGCGA AAGAAAGAGG ATGAGACTGG TGAGTTATCA TTCATTAAAA GCAACCAACT TGGTACGATC AGCTGACTCG CCTGACAACA GAATCTGATC CTACATCTGC TGGGCCTTCA AAACCCAAAC ACCCCAACCA ATACACTTAC CGCCCTAAGC CTCCCTCAAC AACCGGTCCC GCGGCTTCCC CTTCTCGTCG ATTAGGTGGC ACACCAGGAC CCACTACAAT CCCTACACAT CACGAGCATG GTACACGCCG CGCTGGAGCT TTAGCCAATG CGCCGGTGGT TTTCCATCCT TTGTCAGAGG AAGGCGCCAG TCAGCTGAAT TGGCACCTAC CTGATTACCT GACACAGTTT AATGACGTGC TACCTAGCGA TAGCCCTGTA GCTTTAGAAG TGCCTGCTCC GCGGGTCATG GCGTATTTGC CGAAGAATCA TTTCCACAAC CAGCGATACG GTCCGTTCTC CGAGGAACGG GACGCTACCG GCAAACTTAT CCTTCCAGAT GATCAACAGG ATCGAGAAGT AGTTGGTCAC CCCACCACCC AACTCGAGCC TCCAACACGC ATCAAATTTC CGGCCAAACG CATTTCAGCG GGAGACGTGA AGAAGCGCGT GAGAGCAGTG TTGGAATATG TGGGCAAAAT GCAGGCGGAC GAGGGAAAGA GGTTAAGAAG GGCAAAGACA TTGGGCATTA CTCCGGTCTC AACGGCGGCC ATCAGATGGA AGGAAAGGCA ACGAAAGGAA AGGGAACGCG AGGAAGGCGC AGGGGATGAC GATGTGGTTA TGAGCGAAGC ATACGCGGAG GACGGACCTT TCCCAGCAGA GTTGGGATTA GCAACAAACC AGCCTCGCTC TGCCCATTTA ATGGCAGAAC TCACAGAAAT GCTTATTGGA TTTCAAGAAG CATTCTCAAG CAATGACTTT GCCGCATTCG AGAATGGTGC CACTGTTGGT TCTGCTCCGC CTACTCCCCA AATACCCGAC ACGTCAACCG TCCCTCCCAC GCCAGTTCTA CCATCACTTC ATTCAGATCA CCCTCACGGT TCATCTGCTG GGGTGATATC GCGGTCGGCT GAGCGTGAAG TCGACGGCAC AGAAGAGACC ACAACGGAGG AAGTAGGAGC AGGTAAGGGA TTGGACGTGT ACAGAGCGGG GATCGTGAAT AAGGTTGTCA CAATTAACAG CACTGAGCGA GACGTGGTGA AAAAGGTAGA GGAAATCACT CAGGCATGAT GCGGGATATC GGGTGTTCAT GATTTGTGTA GAATGTACAT GGGCGTATTA TTATGGGTTA TATAACTGCT AATCA
|
Protein sequence | MSHRRRQSTA SAATPRSARE HSALKSFRST SIASATTRMD EDDDSGGPRR NTRHRNPLPP TTGPLFPPLP PKQPKSAHKE GSMSKSPAKA IGDATETQDD YNEEQEKSGG STAVADDEEV GEDGQVREGN GLPSGSNTSL TPPPTSPAAR VDVDVPPNTE TTSLLHQDLE AEQVQDDMGQ PKSINGGDNG GVDNDQPREE QEMDEDDVWE TYRRHRAVKG FVRKDDNVKS EFGEMERDDG DHLLDNPVHV NGDKSHDEHQ HQHIDVLHDA DEPEDSNHPP RTRSKPIPRG MSAISASNGN TPASASGSDA NLGRAGSRQG RRRRGEEQLL LDDHLLPAEI RRTALAERAK KDVEKDEEEE VEVEEEEEGE EDEEEGDGLG EQEEDEEGRD ITRCVCKRED IDVMMIQCDQ CNVWQHGECM GIWGDEEAPD EYFCEECKPE RHQALKKWLR SRGRNTSPFI PPTPEILERL HSARDPYPPT QSKRWTEYSN SEPLPPKLLA RSHHKKQQQQ QQQQQQQGQE MLTGTTEGGD GRRTRGRQSS TREKPPSSAS GVGHGGTPST SSKKDGKRGG RKQSVNGMES ENESESSQNI GAGTSLSVKK RSTMNSRDAA YEEAVKAAME ASRKEMAPQE EGDVNSETKE VEVKEKDRAR PGKRRRNEDE ELDEEEKEDE KDKPKKGKRK KEDETESDPT SAGPSKPKHP NQYTYRPKPP STTGPAASPS RRLGGTPGPT TIPTHHEHGT RRAGALANAP VVFHPLSEEG ASQLNWHLPD YLTQFNDVLP SDSPVALEVP APRVMAYLPK NHFHNQRYGP FSEERDATGK LILPDDQQDR EVVGHPTTQL EPPTRIKFPA KRISAGDVKK RVRAVLEYVG KMQADEGKRL RRAKTLGITP VSTAAIRWKE RQRKEREREE GAGDDDVVMS EAYAEDGPFP AELGLATNQP RSAHLMAELT EMLIGFQEAF SSNDFAAFEN GATVGSAPPT PQIPDTSTVP PTPVLPSLHS DHPHGSSAGV ISRSAEREVD GTEETTTEEV GAGKGLDVYR AGIVNKVVTI NSTERDVVKK VEEITQA
|
| |