Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB04640 |
Symbol | |
ID | 3256036 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 1325646 |
End bp | 1328372 |
Gene Length | 2727 bp |
Protein Length | 830 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255107 |
Product | rRNA processing-related protein, putative |
Protein accession | XP_568954 |
Protein GI | 58263088 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAACCGGT CAAGATGGCA CCCCAACCGC TCAAGGTGGG AACTTCAAAA CTGTCCAAGA TGGTCAAATC AGCCCCCAAA ACCGCTGGAA AGGCAAAGAA GCGAGCAGTA GAGCAGGTGT CAGAAGAGAG CGACGAAGAA TTTGGGGACC AAGGGAGTGG AATTGATATG AGTGATGATG AGGAAGAGGT TGATGGAGAC GATGAGGAGG ATGAAGATGA AGCTTTTCCT GAATTCGACA GCGAGCTTGA GGATAATGAT CAAGAAGAAG CCAGTGATGA GGAGCAAGAT GAACAGGATA CGTCTGACGA AAGTGAAATC GTGGAGGAGG ACAGTGACTC TGAGTCAGGT TACAACACGT CTGACATCGA GCGAATGTAC GCTTCTGATG ATGATCTCTC GTCCGAAGAG AACAAAGACC TCCCTGTCGA CGAGAAGCTC TCGAGACTTA TCGCCAAGAA CACTGTCAAG CCCGACGACT CTATTGGCAC AGATGACAAA ATCAGTCGTG CAAAGGAAGG GGTAGGGAGA TTGGTGCCCA GCAAACACGT GAAGGGATCA TTTGTGCGAG AGTACGACGA ATATGAGGCT GGATATGGCA GTGAAAGTAG CACCGAGGAT GTGAGTTACA TCTCGCTCGA CAAGACCCAG CTGACAATAG CAGAACCCCA ATACTGTCGG TAACATTCCG ATGGAGTGGT ACGATGACCT TCCTCATATC GGTTACGATG TCAACGGTCG CAAGATCTTC CGGCCTTTGC AAGGCGACGA ACTTGACAAG TTCCTTGCCA ATGTCGAGGA TCCGTCCGCT TGGACCTCTG CCGAAGACAA ACTTCTTCAA CAAAATGTTC AGTTGTCAGA CAAGGAGCTC GATATCATTA GGCGATTGGA GAGGGCCGAA AACCCTGATG CCGACTTTGA CCCCTATCAA CCTACTATTG AATGGTTTAC CGGCGAAGGC AAGGAGAGAG TCATGCCGCT TAGTGCGGCG CCTGAGCCCA AGAGGAGATT CGTGCCTTCC AAATGGGAGC ATAAGAAGGT GAATTCACTT TGAAACTGAT AATATGCCAA CGCTGACTCG TTTGACAGAT TATGAAGATC GTCAAGGCTA TCAGAGAGGG CCGAATCATC CCCAACAAAC CTTCCGCCGA AAAACCTCGC TTCTATCCTA TCTGGTCTGA CGCCGACCAG CACAACCCTC ACGTCATGTA TATGCCCGCC CCTCAACTTC CCCCTCCGAA GACGGCCGAA TCCTACAATC CTCCTGAAGA GTACCTTCCT ACCGAGGAAG AGAAGGCTGA GTGGGAGGCG ATGGACAAGG AAGACCGAAA GACCGATTTC TTGCCCGAGA AGTATGATGC CCTTAGAAAA GTTCCTGGCT ACAAGAACTT GGTGCAAGAG AAGTTCGAGA GATGTCTTGA TTTGTACCTC GCCCCTCGAA CTCGACGAGT CAAGCTCAAC ATTGACCCAG ACTCTCTTAT TCCTAAACTT CCCGCTCCCA AGGAGCTCAA ACCCTTCCCT ATCGCTTCTA CTGTCCAGTA CCGTCATCCC GGAGACACTC GTGTCCGATC CGTTTCCACC AGCCCTGATG GTCAGTGGAT TGCTTCTGGC TCAGAAGATG GTGTGGTGCG AGTCTGGGAC CTGGGTAACG GTCGTGAAGT CTGGAGATGG GATTTACACG CTGGTCCTAT TCAGTACGTT GAGTGGTCAC CTTCTCGTGA GGAGTCTTTG CTTGTGGCTC TTGTCGCCGG TAAGATCGCT GTGCTCTCTC CTCTTGCTCT CGTCGCTCCT CATATTGCCG CCCAAACCCT CACCCACTCC AATACCGCTT TTGCTACTAG TTCTGCGACG ACAAAGCAAG GTGCCGGTAA CGAAGTTAAA GGAATTGAGT CTGTCAAATG GACGAGACCG AGTGAGAGGG AAAGAGAAAG GGGTGTCTTG GTGTATGTGG AAGTTCCTGG TACTCCTAAA CAAGTTACAT GGCACAGGAA GGGGGACTAC TTTGCCACGG TTGCATCCGA CGGTTAGTTC AATGTCGTAA AATGGTACAC TACTAATACA AAATAGCCGC TAACAAATCC GTCCTTATCC ACCAACTCTC CCGCCACGGC AGTCAATCCC CCTTCCGTAA GACTCCTGGC ACAATCCAGC GCGTTGCATT CCATCCTTCT AAGCCTCATT TCTTTGCTGC CACTCAACGT TACATCCGTC TTTACGACCT TGCTGCTCAA AAGCTCATTA GAACTTTACA GTCTGGTGTC AAATGGATTT CATCCATGGA TGTGCACTCC GGAGGTGACA ATTTAATTAT CGGTAGTTAC GATAAGAAAT TAGCTTGGTT CGACATGGAT TTGAGCGCAA AGCCTTATAA AACCTTAAGG TGGGTCTATT CAGTTTCTTT TTGTGGCAAT GTTGACAGAA GATCACAGAT ACCACAACCG TGCTCTTCGA TCCGTTGCCT ATCACCCTAC TCTCCCTCTC TTCGCCTCTG CCTCAGACGA TGGCACAGTC CACATTTTCC ACTGCACCGT TTACACTGAT CTCATGCAAA ACCCGCTCAT TGTTCCTCTG AAGATCTTGA GGGGGCATAA AGTAATCGAT GGTATCGGAG TTTTGGATTT GAGATGGGTG CCTGGAAAAC CGTGGTTGGT CAGCTCCGGT GCGGATGGAG AGGTTAGGCT TTGGTGTTCG TAGAATGTTT AGATAACGGG TTAAGAGATC TCTGGAG
|
Protein sequence | MAPQPLKVGT SKLSKMVKSA PKTAGKAKKR AVEQVSEESD EEFGDQGSGI DMSDDEEEVD GDDEEDEDEA FPEFDSELED NDQEEASDEE QDEQDTSDES EIVEEDSDSE SGYNTSDIER MYASDDDLSS EENKDLPVDE KLSRLIAKNT VKPDDSIGTD DKISRAKEGV GRLVPSKHVK GSFVREYDEY EAGYGSESST EDNPNTVGNI PMEWYDDLPH IGYDVNGRKI FRPLQGDELD KFLANVEDPS AWTSAEDKLL QQNVQLSDKE LDIIRRLERA ENPDADFDPY QPTIEWFTGE GKERVMPLSA APEPKRRFVP SKWEHKKIMK IVKAIREGRI IPNKPSAEKP RFYPIWSDAD QHNPHVMYMP APQLPPPKTA ESYNPPEEYL PTEEEKAEWE AMDKEDRKTD FLPEKYDALR KVPGYKNLVQ EKFERCLDLY LAPRTRRVKL NIDPDSLIPK LPAPKELKPF PIASTVQYRH PGDTRVRSVS TSPDGQWIAS GSEDGVVRVW DLGNGREVWR WDLHAGPIQY VEWSPSREES LLVALVAGKI AVLSPLALVA PHIAAQTLTH SNTAFATSSA TTKQGAGNEV KGIESVKWTR PSERERERGV LVYVEVPGTP KQVTWHRKGD YFATVASDAA NKSVLIHQLS RHGSQSPFRK TPGTIQRVAF HPSKPHFFAA TQRYIRLYDL AAQKLIRTLQ SGVKWISSMD VHSGGDNLII GSYDKKLAWF DMDLSAKPYK TLRYHNRALR SVAYHPTLPL FASASDDGTV HIFHCTVYTD LMQNPLIVPL KILRGHKVID GIGVLDLRWV PGKPWLVSSG ADGEVRLWCS
|
| |