Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE03710 |
Symbol | |
ID | 3257686 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 1053781 |
End bp | 1056589 |
Gene Length | 2809 bp |
Protein Length | 750 aa |
Translation table | |
GC content | 52% |
IMG OID | 638256954 |
Product | vacuolar protein sorting-associated protein vps27, putative |
Protein accession | XP_570941 |
Protein GI | 58267570 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCGTCTGC TGCCAGCCTC TTGCTTCCAT ACGGCACTCC ACCCCGCCCA TATCCGCTGT CTCCATTGCC CGCCACCGCC CTTCTCCTAG TCCCAACAAT TTGCTTCCGC GCCGAGAGTC CTTTGTCGCC GAGCACAAAC CCAACCGCCA TTCCTCGCTG TAATCCAACG TCGCAACCGC CCCTCCTACA TTAATCATCC ATCATGTCAT GGCTATGGGG AAGCACTACA AACCCGCAGT TCGAAGAGCT TGCTGGTTCG TACTTTACTG TTATGTTGTT ATCTATCGCT CACAGCCCTA TCGCAGAGAA GGCGTGTTCT CCTCTCAACC TCCCGTACCC ACAATCAGAA GACATCGCGA CCGCCTTGGA GGTTGCAGAT ATGATCCGCT CAAAGGCTAT ACAGCCTAAA ATGGCAATGC AAAGCTTGAA GAAGAGGATA GCCAGTAAGA ACGGGAGAGT ACAGATGTAT GCCATCGGAG TAAGCGATAT TTATTCGTGC ACTCGCTGAC GGATTGAAGC TAACCGACCG TGTGCAGCTT ACAGACACCT GTATCAAGAA TGGAGGAGAT CATTTCTTGC TAGAAGTAGC TAGCAAAGAA TTTGTGGATG AGTTGTCAAA CCTCATTAAA GCAACAGTAA GCAACAGTTA CTTTTGTCCC ATTTGTTCAT TACTAAGCTC TATCATAGAC GACTAGCCCG GAAGTCAAGC AGATGCTTAT CAAATACTTT CAACAATGGG CCCTTGCTTT CAAATCCAAG TCAGAGCTAT CCTTCTTTGT GGAAGTCTAT AATGAGCTCA GGGCTTCTGG CAAGTCATCT GGTTGCCTTA TAACATGTTC CGCCCTGACA TCCGTATAGG AATCACTTTC CCACCGCCAC CCGCTCCTGT CCCTTCTCAT CTCTTGACAA CAACCACTGC TCCTGCATGG GTCGACTCTG ATGCGTGCAT GCGCTGCCGC TCTGCCTTCA CATTTACTAA CCGTAAACAC CACTGTCGCA ACTGTGGTCT CGTATTCGAC CAAGCGTGCT CGAGCCACAG CATGCCTTTA CCCAAATATG GAATCACGGA AGAGGTCCGA GTGTGCGACG GCTGCTGGGC TAAAGCTGGG AGGAACAAGG CTGATGCTCC TGCCCCCGCT GTCCCCGGGC GTACGCCAAG GTCTAGAGCG GATCTTGATG CTGATCTTCA ACGAGCCATC GAACTCTCTC TCGCGGAATC TCAACATAGT CAGAACCGCC ACCACAGCCA TTTCACGCCT TCCGAGCCTC CTCTGGCGCA TGGCACTGTT GAAGATGAGG ATGAGCAAAT GCGCCTTGCG ATTGAGGCTT CTCTTCGTGA TATGGAGGCT CGTCCATCAG CTCCAGCGGG CCTCGGTGAA GCGCCAGAAC CAGAATACAG ACCTCTGCCA ACATTCGACC TTTCCCCAAG AGAGAATGAG ACGATCTTGA CGTTTAGCAA CACAATGGAT CAGATGGCGG CGTACGGTGA GCGAGATTTG AGAAGGTTTC CACATGCACA TGTGTTGGCG GAGCAGGCTA ATACAGTGGG CGGGAGGTTG AGGAGGAATG TGGAGGAAAA AAGCACAAAA CAACGTAAGC GCCTGTTATG CCTCAAGAGA GTTGCGTTAA CAATACCACA GAAATGTTGA TGGAAATGCA AGATAAACTG TCCCAAGCTG TCAACCTCTA CGGTCAAATT CTGGATGGAC AACAGGCTTA TGCTGCTAAA CGGGCGCACG AAGAGCAAGC GAGGAGGTAT CAACAACAGC AGAGCTACTA CACCCAACAG TACCAGCCGC AACCGCAGCT GTATGGCCAA TACCCTCCTA ACGGCTACCA AGCATTCGTG CCTCCTCAAC AAGCCTACCA ACCCCCTCAG CCTCAGCCCG AAGCCCAAGC TCAACACGCA CCCTCCCTTT ACCCTACGAT GCCTTATACT ACTCCCAATT TTACTTCTCC TCCCCAGGAA CGGGTTTACC CTCAACAATC CCATTCATCG CCTTACAGCC AATGGTCTCC CGCGCCATCG CATGTTCAGC CGGGATTAGC AAGACAAGCG TCGGTCGTTG TCCCGCCCGT TTCTTCACCC GTTCCCGCTG GTGTGCAAAG ACAGGCTTCT ATGACTTATG GTGCACCTAT ACCTGTAGCC GAGCAATCTC AACGACAACA GCAGCAATAT GCCTCTGCTC CTCCGTTCGC ATCTGGAGCG GCACCCGTCG ACATACCTTC CGCTCCTCCA CCTGTTAATC TCTCCACCCA CCCCAACTCG CCCCAGCGAC ACTCTTACAT CCCTTCCCAT CCCCAAACCC AAACCCAAAC CCAGTATGAA TCTCAACCAC AAGAAATCCC GTCACAGCAA GATATGCAGT ATGGCGCGTC GGCTCCGCCG CCAGACTCTT TAGGTTCGTA TGTGTCGGAA GGAACGGTAG GAAGCGCCAA GTCGGGGCTT GAACAGGAGC ATGCGGCTTC ACAGATTCAA CCCCAACCTC AGCCCCAGGC TTCCGCACAA ACTCAAACAC AGTCTCAATC CCAATTGCAG GCTCAACCCC AGCAAAACCA ATACGCTGCT CAAACACAGC TCCCCGCGGG AATGTATAAC GCTGCCTCTT TCCCTCAACC GTTACCCCCA ACCATCTTCC CAGATGCGCC CGTAGAAGCA CCCAAAGGTT TGGAAAAGGA GGAGAAAGAA GAAGCTTTGT TGATTGAGCT TTGATCTAAT GCAAGAGGAG CGTGATGTTG TCTTCAGAGG TTGTATTATA TTTTTATTTT TTTCCGTGGC CCTAGCAATG CATTACTCCG TGTGTAGGT
|
Protein sequence | MSWLWGSTTN PQFEELAEKA CSPLNLPYPQ SEDIATALEV ADMIRSKAIQ PKMAMQSLKK RIASKNGRVQ MYAIGLTDTC IKNGGDHFLL EVASKEFVDE LSNLIKATTT SPEVKQMLIK YFQQWALAFK SKSELSFFVE VYNELRASGI TFPPPPAPVP SHLLTTTTAP AWVDSDACMR CRSAFTFTNR KHHCRNCGLV FDQACSSHSM PLPKYGITEE VRVCDGCWAK AGRNKADAPA PAVPGRTPRS RADLDADLQR AIELSLAESQ HSQNRHHSHF TPSEPPLAHG TVEDEDEQMR LAIEASLRDM EARPSAPAGL GEAPEPEYRP LPTFDLSPRE NETILTFSNT MDQMAAYGER DLRRFPHAHV LAEQANTVGG RLRRNVEEKS TKQQMLMEMQ DKLSQAVNLY GQILDGQQAY AAKRAHEEQA RRYQQQQSYY TQQYQPQPQL YGQYPPNGYQ AFVPPQQAYQ PPQPQPEAQA QHAPSLYPTM PYTTPNFTSP PQERVYPQQS HSSPYSQWSP APSHVQPGLA RQASVVVPPV SSPVPAGVQR QASMTYGAPI PVAEQSQRQQ QQYASAPPFA SGAAPVDIPS APPPVNLSTH PNSPQRHSYI PSHPQTQTQT QYESQPQEIP SQQDMQYGAS APPPDSLGSY VSEGTVGSAK SGLEQEHAAS QIQPQPQPQA SAQTQTQSQS QLQAQPQQNQ YAAQTQLPAG MYNAASFPQP LPPTIFPDAP VEAPKGLEKE EKEEALLIEL
|
| |