Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND04780 |
Symbol | |
ID | 3257387 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1313296 |
End bp | 1316389 |
Gene Length | 3094 bp |
Protein Length | 892 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256414 |
Product | conserved hypothetical protein |
Protein accession | XP_570413 |
Protein GI | 58266514 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.835456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAGAG ACAGCACCAA CACGGATCGG GACACTCACA GACACGGCTC AAGGCATGAC CGCCACGACA GGCCTCATAG ACATCGAACT CACAGGGATC GGGACAAGCA AGATGCAGAC CAGATTGAAG AGGAGCAGAG GAGAGAGAGA AAACGGCTCA AGAAGGAGAG GAGGACAGAT GATGACCATA CTCTTCAGGT GTTGGACGAT GACCCTAGTA TGTGGGTAGA AAAATCATTG GATCCTACCA ATGCTGTCGC GAACATCCCC ACCGCCGACT CTCTCCCTCT AACGTCGAAT CCTTCTGGTC CGAAAGTTTC TCTTCCCCTC TCCACAGCAA CAGGCTCAGA AGGCCGGCAG CGCGATTCGT GGATGCTGGA GCCATCGGTG TCTTCTGCCA TTGTCCCGAC CCCGAGGGAC GATGTGCCCC ATAGCGCTGT CAAGTCTGCC GCTGATACAG CGGACAGGTA CGGAGATGGG CAGCCTGATG ATAGGACCTC CGTTTCGAAT GTGGACCTCT TTTCCTCCAT GGGAATAGAA CATAAGAGGA AGGATCCTAG ATCGGATAAG CCAGACCCTT CCCAGGTCAG TTTTGCACCT GGCATTCACG GATAAGCTGA TACATGCAAT TTGAAAAGCT CGTGGTCGAC GACCGTTTCG AACTCAACAC CCAACTCCTC GAGGGGAAAA ATGTCGATGA ATACGAAGTT AAAGGTATTG CAACCCATAT ATTACATCTT TTGTCCTTTA CTGATCATAT TCCTAGAAAA AAAGACGACA TTTGGTGGAC CTGGTTACCA GTGGCGAATG ATGAAACTCA AGCGTCTGTA TGAGCAGGCA GAAGAACAGT CTCGACCCGT AGAAGAAGTG GCTCTTGAAC GCTACGGATC ACTTGATGAG TTCAATGAAG CTCTTGAAGA ACGTCGTTAT CTAGACGATC GTGAAGCGCG ACGCAAATCG CGGGGTGTCA GTAGACCTGT TGGGCCCAGT TCCGATTCCT CCAGGCCCAC CACTCCTTCC GGCAACTCTT CCGGTATGCG GACTCCCGAT GCAGGTCGCC GATTCATGTT TGCTAACCGC ACCACTGGGG AGCAAACTTT TGGCACTGGT GGTGGCAGTG GCAATCGACC TGGCTCTCGA ACGGGTTTCC GTCGACCTGG TGAAGATTTA GAACAGGGCA CGACGCCTGT CAGCAGCGCT GGGAGGCTTG ATACACTTCG CCGTGACAAT GGTGGACTTG GAACACCAAA GCTTGAAAGT GGAGTGAGGT CTGGCTCTAG TGGTGTTGTT CCGATGAAGG TCGGTACTCC TATCCCTAGC GTTTTCACTC CTACCACCCT CACCCGCTCA TTCACTGGTC CCTCCCCCCC TGGGCCTGAG CATGAATCCG GTGCTGTGGA TCCAACCTCT TCGAAACCCC CTCTTTCAAC TGAACAGCTC AACAAACTTC AAGCCGCCGT TCTGCGTTCC AAACTTATGG ACGATCCGAA TGCCTCCGCA TTGGAGGACG AATACGAAAT CGAGCGTGAG CGGAGCGAGC GAGCACATGC GGGTGTGGGT GCAGGTGCAG GACTGTGGGA GGGGAATAAT GAGGGGATAC AGGGCCAACT CGGCAGGATG GATGAAAAAG GCAACAGGAT AGAAGTGCAG GTTCTTCCTA CATTGGACGG GAGAGGGAAG CTGTATGATG TAGGCACTGG CAAGGAAGAT GAATCGGTCG TCAGACCGGG AAATCGGAAG CAGAAGGACG CCAAGTTCGA GACCCGTGAC AAGCAAGGCA ACCTCCTACG CTATAACGCG GATGATGATA TTCAATCTCT GGGGGAACTT GTGCGACAAG AACGTTTTGG CGCAGGTTCA TCGGACCAGA AAAACCTGGA TGCTGAGATG GCGAGGGCTA TTGCAACGGA TGGCAAGTTT GAAGATGACC TTGACTATAT GGATGATAAT GCGGATAAGC TGGCCAGGAA AAAGATGAAG AGCGACGCAT TGAAAAGGGC CTTTGCCATA AATGGTAAGT TGAAATTGAC TCCCATGACT CGTTTCTAGG ATGTTTGGCT CAAGGCGAAA TGTTGGAACA GATTATGCTC GAACAAAGAA AGCGCTCGAT ACATGTCCAC TTTGCTACCA AGACGACCGT CCTCCGCAAA CCGCCATTGT TGCTCTTGGT ACACGCACGT ACATGTGCTG CACGCAATAC GAAGAACTTG TACCGGGGCA CTGTCTGATC GTGCCTTTGC AGCACCACCT GAGTATGCTG GAGATGGAAG ATGATGATTG GGACGAAGTC CGCGTGAGTT ACAGTCTAAT AAAAACACGT CGTTAGTAAA TTTATCAGCT CATCCCACAC GCTTTAATTT CCAGAACTTC ATGAAGTGCC TTATGCGCAT GCACGCTCAA TCAAACCATG GCGTTATCTT TTTCGAAACC ATCACCTCCT TCAAATCCCA GCGGCACTCG TACATAGAAG CTATTCCTGT GCCTTTTGAC ATATTCCAAG ATCTTCCTGC CTATTTCCGT GAATCGATTC TTTCTTCTGA AGGAGAATGG ACGCAACACA AGAAGTTAAT TGATTTCTCC TCGAGACCAG GTGGCTTTAG GAGGATGATG GTACCGAACT TGCCATATTT CATGGTTCAG TGGGATTATA AAGGCGAGAA GGGCTACGGG CATGTGATTG AAGGTATCAA AGATAGTGGA GCAGGAGGAG GAGAAGACGA GGAAGGCGAT GTGGGTGGAG CAATGTCCGA GAGCGAGTTC CCAAGGTAAA TTTGAGTTTG CAAGCTTATC AGAATTAGTA CATTAACATG GAAGGCTGAC TGGGAAAAAT TAGATACTTT GCCCAAGAAG TCATCGGCAA CATCCTGGGG CTGGAAGCTC GCAAATGGAG AAGACCGAGG AAAATGGACG TGGCGTTGAA TAAAGAAAGG GCACGAAAGT TGGGGACCCT TTTCCAGCCG TATAATTGGA CTGTGGGAAA CGGCGTTTAA GGACATGGGA AGGTTTACTA GTTCAAATCA GCACCGAAGA CGCATATCGT CTTTCAGTTT GAAATCGAAA ACTAGTTTGT GCTTTGAAGA AATGAAGACA GGCA
|
Protein sequence | MGRDSTNTDR DTHRHGSRHD RHDRPHRHRT HRDRDKQDAD QIEEEQRRER KRLKKERRTD DDHTLQVLDD DPSMWVEKSL DPTNAVANIP TADSLPLTSN PSGPKVSLPL STATGSEGRQ RDSWMLEPSV SSAIVPTPRD DVPHSAVKSA ADTADRYGDG QPDDRTSVSN VDLFSSMGIE HKRKDPRSDK PDPSQLVVDD RFELNTQLLE GKNVDEYEVK EKKTTFGGPG YQWRMMKLKR LYEQAEEQSR PVEEVALERY GSLDEFNEAL EERRYLDDRE ARRKSRGVSR PVGPSSDSSR PTTPSGNSSG MRTPDAGRRF MFANRTTGEQ TFGTGGGSGN RPGSRTGFRR PGEDLEQGTT PVSSAGRLDT LRRDNGGLGT PKLESGVRSG SSGVVPMKVG TPIPSVFTPT TLTRSFTGPS PPGPEHESGA VDPTSSKPPL STEQLNKLQA AVLRSKLMDD PNASALEDEY EIERERSERA HAGVGAGAGL WEGNNEGIQG QLGRMDEKGN RIEVQVLPTL DGRGKLYDVG TGKEDESVVR PGNRKQKDAK FETRDKQGNL LRYNADDDIQ SLGELVRQER FGAGSSDQKN LDAEMARAIA TDGKFEDDLD YMDDNADKLA RKKMKSDALK RAFAINDYAR TKKALDTCPL CYQDDRPPQT AIVALGTRTY MCCTQYEELV PGHCLIVPLQ HHLSMLEMED DDWDEVRNFM KCLMRMHAQS NHGVIFFETI TSFKSQRHSY IEAIPVPFDI FQDLPAYFRE SILSSEGEWT QHKKLIDFSS RPGGFRRMMV PNLPYFMVQW DYKGEKGYGH VIEGIKDSGA GGGEDEEGDV GGAMSESEFP RYFAQEVIGN ILGLEARKWR RPRKMDVALN KERARKLGTL FQPYNWTVGN GV
|
| |