Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB04270 |
Symbol | |
ID | 3255957 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1245088 |
End bp | 1247812 |
Gene Length | 2725 bp |
Protein Length | 820 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255072 |
Product | conserved hypothetical protein |
Protein accession | XP_569250 |
Protein GI | 58264188 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.302416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAACG AAATTCAAAA AAAGTCTTGG GTAACATCTC CACAAGATCA ACAAAAGGGC AAGATGAGAA TCACCCCAGA GTATATATCG GTTGGCGCAA ATCGTTCTTC CTCTTGCGCC GCTTGCTCCG CATCCGGTTT GCTCTTCTTT GGTGCCGGCA ACTTCATCGC ACTTTGGGAT TCATCGTCAA ATCGAGGAGT CCACGCAACG CTCCCGGGGC ACAAAGGCCA GGTCACAACC GTCAAACTCC TGCCAGATGG GAGACTCGTT AGCGGCGATA ACATTGGTGA AATAAGAGTA TGGAATTCTG TAAAGGGTGA TGACGAGTGG GAATGTGTGA TGAGCTGGGA AGCCCACAGA GGAGGGTCGA TTTCTGCCAT CGGCGTCCTT GCTTCAAGAG GGAGCCTAGA TGACATGATT ATCACCGGTG GGTCAGACAG TCTGATCAAA AGGTGGAAAA TAGCGGATAA GCCAGAGGAG ATTCAGAAAA TCGATCTGGG AGGCAAGCTG CCATTGGATT TGGAAGTTGG TTACTTGCCT GGATCTGAAG GTACGTAAGA AAGCGGGAAC CGAGGTGGAT AATAATCTGA CTCCGTACAG CCCCAATACT CGCCTTGGGC TGTACTGACC GCCGTATTCA AATATGGACC ATTCGAGACG GCTCATTTAC CCGCGCTCTT TCTCTAGAAG GCCATGAAGA CTGGGTCCGT TGTCTCTCCT TCACACCCTA CCCTTCCGCA TCCTCTTCCT CCCAGGATTT ACTTTTGGCT TCTGGGTCCC AAGATAATTT TATCCGATTA TGGCGTGTCT CGCCCATCGA GCAAGAAGTT GCTAGTCCAA GTGCCGGAGA CGAGGGTCTT GAGATGCTTG ACGAGTTTGA AAAAAGGCTT GCGGGCGAGG CCGGTGGAAA CGTGCAAATA TCAACCAAGG CCCATATCCT CGGTGTTCAA GATGGTGAGA AAAATTTACG GTTCAACATC ACTCTTGAAG CCTTGCTCGT TGGCCATGAG TCTGGTCTTA CCAATGTCCA CTGGTCCCCT ACTCCCACAT CATCTTCCCC CACACCCCTC CTTCTCTCGA CCGCTTCCGA CAACTCTCTC ATAATCTGGA GCCCGTCAAG TACGTCAACT TCTGCGGACG GCATCTGGGT ACCTACGAAC CGATTTGGTG CGATCGGTGG TAGAGGTCTG TCGTTCTATG GCGCGATCTG GGGCAAGGAT GGCAAGAGTG TCATGGCTGG TGGGTGGAAT GGAGGATGGG AAAAGTGGGT TGAGTCAGAA CAAGGATGGG ATGTCCAAAG AGGTTTGACC GGACATCATG GCAGTGTCGA AACTGTTTGT TGGGATCCCA GAGGGGAGTA CCTTTTGTCT GTTGCGTAAG TCGAATCATC AGACTGACTG TCATATACTA AATCTCCATG ACAGCTCCGA TCAAACAGCA CGTATCCATG CTGAATGCAA TCTGCCTTCC TCTTCCACGT CTATCTGGGC CGAAATCGCT CGCCCTCAAA TCCACGGTTA CGACATGACA GATGCTTCAT TTATCTCCCC TCTTCGTTTT GTTAGCGGTG CAGATGAAAA GGTTGCTCGA GTGTTTGATG CGCCCCAAGG TTTCGTCGAG TCATTAAGAT CTCTAGGTAT CAGTAAGAGG GAAGCAGAGG AGGAAAGCAG ACCTAAGGGA GCTACCGTCC CGCCTTTGGG GCTGTCAAAT CGCGCGTTGC AGAAAGGTTA GTCGTACTTG GTGCCTTTCG AAAACGGTAC TGACGTGGAT ACAGCTCCTG TCGCCGGAGA TGCTGTCGAA AAGCAAGGTC AAAATGAAGC TATTATTTCC ATCTCTCATA CTTTCACATC TCTCCCTACG GAAGAAGAAC TCGCTACCTC AACCCTCTGG CCCGAGGTCG AAAAAGTCTA TGGCCACGGT TACGAACTCG TCTGTGCAGC TGCTTCTCAT GCCGGAGACC TTATTGCGAC AGCATCCAAG GCCACTAATG CCGAACACGC TGTGATCCGA GTAATATCAG CCTCCAAGTG GGAGCTAGTT GGTGAACCAC TGGCGGGTCA CTCTTTGACA ATCACGAGCG TTTCTTTCAG TAGGGATGAC AAGAGAATTT TGAGCTGTTC TAGGGATCGA GGATGGAGAG TGTTTGAGAG AAAAGAGGAT GGGGAAGGTT ATTTCCCTCT TGCGGGAGAC GAAAAGGCGC ACGCGAGGAT GGTCTTGGAC GCATGCTGGG CAGACGAGAG AAATGACATG TTCGCGACCG CATCCAGGGA TAAGACTGTA CGTTTTCACG CTTGTTCTGG AGCTTGACAA TGACTAATGC AGATCTTCAG GTTAAAATTT GGACTTCAGC AGTAGCAGAT GGTTCTCAAT GGGCTGCAGC TGGAACAATC AAATTAACTG TAGCTTCTAC AGCAGTGGCC ATGATTAATG ACGGTTCTGA CGGCTATCTG TTGGCTGTTG GAAAGGAGAG CGGCTCTATC GAAGTCTTCA CTGTAGCTGT GAACCGGGAT GGGGTAAAGA GTGATCTACT CTCTACTTTC GATCGTCGGT GAGTAACCGA AAACCATGTT TGCATTCTTG AGTTGCGTTC TTGAGCTGAC CTCTTTGTAG AGTATCGCAT GTGAGCGCAG TGAATAAACT TGCATGGAGG AACGTCGAGG GTGTTTTGAG CTTGGCGAGT TGCAGTGATG ATCGAAGTGT CCGCGTATAC AAGGTCGAGT TATAA
|
Protein sequence | MLNEIQKKSW VTSPQDQQKG KMRITPEYIS VGANRSSSCA ACSASGLLFF GAGNFIALWD SSSNRGVHAT LPGHKGQVTT VKLLPDGRLV SGDNIGEIRV WNSVKGDDEW ECVMSWEAHR GGSISAIGVL ASRGSLDDMI ITGGSDSLIK RWKIADKPEE IQKIDLGGKL PLDLEVGYLP GSEAPILALG CTDRRIQIWT IRDGSFTRAL SLEGHEDWVR CLSFTPYPSA SSSSQDLLLA SGSQDNFIRL WRVSPIEQEV ASPSAGDEGL EMLDEFEKRL AGEAGGNVQI STKAHILGVQ DGEKNLRFNI TLEALLVGHE SGLTNVHWSP TPTSSSPTPL LLSTASDNSL IIWSPSSTST SADGIWVPTN RFGAIGGRGL SFYGAIWGKD GKSVMAGGWN GGWEKWVESE QGWDVQRGLT GHHGSVETVC WDPRGEYLLS VASDQTARIH AECNLPSSST SIWAEIARPQ IHGYDMTDAS FISPLRFVSG ADEKVARVFD APQGFVESLR SLGISKREAE EESRPKGATV PPLGLSNRAL QKAPVAGDAV EKQGQNEAII SISHTFTSLP TEEELATSTL WPEVEKVYGH GYELVCAAAS HAGDLIATAS KATNAEHAVI RVISASKWEL VGEPLAGHSL TITSVSFSRD DKRILSCSRD RGWRVFERKE DGEGYFPLAG DEKAHARMVL DACWADERND MFATASRDKT VKIWTSAVAD GSQWAAAGTI KLTVASTAVA MINDGSDGYL LAVGKESGSI EVFTVAVNRD GVKSDLLSTF DRRVSHVSAV NKLAWRNVEG VLSLASCSDD RSVRVYKVEL
|
| |