Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02580 |
Symbol | |
ID | 3259099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 393758 |
End bp | 396953 |
Gene Length | 3196 bp |
Protein Length | 1037 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258226 |
Product | phosphoethanolamine N-methyltransferase, putative |
Protein accession | XP_572434 |
Protein GI | 58270556 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAGG TGTCCCCACA GTCCCAATAT GGTGGAGAAC CGGACAAGCC AACAACAAAA TCGAGCTCCT GGCTACGAAA TCTTTCGCGG TCCAACCTTG CTGCTCTCAT CTTGTTTTAC GTCGCCTCCC TTCACGTCAT TGGGTTGTAC GTCTTCACGC AAGGGTTTCT CCTTTCCCGC TTGGCTATCC CCCATGTCTC TCCCGCCTAT AACGTATCTA ACCCCTCCCC CATATCCGCT ACCCACTCTA AAGCGGTCAT CATTGTCATT GACGCTCTTC GTACCGACTT TATCTCCCCG TACCACCCTC AACCGCCTTC ACCCCATCAT CATGGTGTCC TCTCCCTGCC TGCCGAGCTC ACACAGAGTC GACCGGAACA CTCCCTCATT TTCAACTCCT TTTCTGACCC CCCTACCTCT ACTATGCAGC GCATCAAAGG TATTACCACC GGGTCGTTAC CTACATTCAT CGATATCGGC TCAAACTTTG CGTCAGCTGC CATTGAAGAG GATTCGCTCG TCTCTCAGCT CGTAGCGGCG AACAAGAGGG TTGGGTTTAT GGGTGATGAC ACTTGGATGA ACCTCTTTCC TTCTTCTTTC CATCCGAACA TGTCCCATCC GTACGATTCC TTCAATGTTG AGGATTTACA CACTGTTGAT AACAGCGTCA TCACCCATCT TTTCCCTTAC CTCCATCCTT CCAACCAATC TCAGTGGGAC GTCCTTATCG GTCATTTCCT GGGCGTAGAT CATGTTGGTC ATCGTGTGGG GCCTCATAGG GATACTATGA CCGAGAAACT TACTCAGATG AACGAAGTGC TGGAAAAGGT TGTGGATCTG ATTGATGAAG AGACTCTTCT TGTCGTTCTC GGTGACCACG GGATGGACGA TAAGGGTAAC CACGGCGGTG ACTCGGAGAT GGAGACCTCA TCTGCTCTTT GGCTCTATTC CAAAGGCCCT ATGCTAACCA ATCCCGCGGT CGTCCAAGAC AAAGACACCT CTGCCATTTT TAAATCTTTG CCTACCTACA TTTTCCCGAA ATCCACCATG CCTCTCAGGC AGATTAACCA AATCGATATT GTTCCTACTC TCGCACTTTT GCTCGGCGTC CCTATCCCTT ATAACAACCT TGGGTCAGTC ATCCCCGAGT GTTTCACAAA TAATCTGGAA ACTTTGGAGG TGGCGCAAAG GGTGACTGCA GACGGTATAT GGAGATACGT CGAAGCGTAC GGAGACAAGG AAGTCAAGGA AAACTTGAAC AGCGCTTGGC GTCACGCCCA ATCCCAATAT CAAAAGGACA ACCTGTCCGC CTCCATCATC GCTCATCGTG CTTTCTCCCT CGACGCTCTC GTCCACCTCC GATCACTCTG GGCTCAATTC TCCATGCCTC TCATCGTCAT TGGTTCTCTC ATCCTCGGCC TTTGCGCGCT TACCCTCATC GCACTTTACG TCGGTGTGCG TAACAATGGT GTGAATTGGG ACGTTTATGC GCGATTGGCT TTGGAAACGG CTACGATGGG ATCCACTGTC CTCGCGAGTA TTGCAGGAAC CGTAGCCGGG GTGTACACCG CCCGGCCTAT CGTGGCTATC AAAGTTTTCA TCGTTGCCGC AGCCTTGATC TCCGAGGTAA TCCTCATCCT TCCCCTATTC GTCAAATCCT CTCTCTCTCT CGCTCTCCCT CTCCCGACTT CATTCTCCAT CAACAGACAT ATCGGCCCCT TCATTCTTAT TGTCCATGCG TTGTCATTCG CTTCCAATTC ATTCATCATG TGGGAAGACC GGGTGGTTCT CTATCTCATA TCTACTATCC CTATCATCTA TATCATCCGA GCACTTTCCG CTCCTACAGC CGATATGCGT CTCAAAATCA TCTTCCTCTC TCTCGCTTTC ACCATCCTCT CTCGTCTGGC TGGAACGATC ACCATATGCA GGGAAGAACA ACAGCCATAC TGCAGCGTCA CTTTCTTCTC AGGCGTGACC GCTACCGCCC CTACATGGGC ACTCATCGCT ATAGTCCCGC TTGCCCTCCA AATCCCTCGT GCCATCGGTA TCACCCTCTC CCGCTCCAAA TCCCTCGCTG GGCCTGCACC TTTTATACTC GGTATCCTTT GGCGTGCCGT GCTCGTCGCC AACTCTGTCT ACTGGGTCCT CGAGTTTTTC GAATCGTTCG ACGGGCTCAA TCCTGCCCGA ATACCGCTGG TCAATTTCCT CAAACTCTGG CTCGCTCGAT GTTCTGCAGG CGCGAGTTTA GGTGCTATCC CATATGTATG GCTCACTTCC CCCCTGTGTA TCTCTGTAGA ACGCACGACG GATCAAGCGA CGGGTGAAGA AGAAGTGACA GTGTTTGGGT TCGCCAATGC CTTTGGGTCG ACGTACATCC TGTATACACT CGCTCCTTTT GCGCTCATAC ATCTAGTGTC GCAGCCAATG GCCCAGTTCA CACTGACCGC CTTTCTTATT GGGCTTCTTG TATACCTCGA ATTGGTCGAT ACGAGACGGG ACGCCATCAT CCTTACAACA TCCTTTGCTT CCTTGGGCAA CAACAAGAAC AACAACAACG CTTCTTCACC AGCGGGACTT GCCTCATTTG ACCCGGCTGA TACAGCGCAA ACCATCGTAC GCCCCTCATT CACAGATGTA GTCCCCATCG CTCTATCTGG GTTCCTCACA TTCTTTGCGA CAGGACATCA AGCCGTCATC TCATCAATTC AATGGAAATC TGCTTTTGTA GGCTTCTCCA CTGCCAGCTA CCTGTTCTCC CCTGTACTCG TCGTCCTCAA CACATGGGGA GGCTTCTTCT TGTCAGCGAT CGCTGTGCCT CTTTTGGCGA TTTGGAATAT CGCCCCCAGA CCACGAGAGA GTATCCCAAC CCTCGCGCAT GCTTTGCAAG TGACGTTGGC GTTCTTGGTG TACCACACCG TGGTCGCGTT CGCCAGTGCA ATCACAGCTG CCTGGTTGAG AAGGCATTTG ATGGTGTGGA AGGTGTTTGC ACCAAGATTT ATGATGGCAG GCGTCACGTT GTTGGTTGTG GATGTAGGGT TGGCGCTGGG ATTGTTTGGA GTGAGAGTGA CCGGGTGGAA GGTCAAAAAG ACGTTTGGAT GCGAGAGTAT ATAAGAATCA AACGAGGAGA ATAATGAAGA AGTTGGCGGT ACATCCAAAT CAAAAGTTGA CAGGAAGTTA CGAAGTTACG ACGTCA
|
Protein sequence | MSEVSPQSQY GGEPDKPTTK SSSWLRNLSR SNLAALILFY VASLHVIGLY VFTQGFLLSR LAIPHVSPAY NVSNPSPISA THSKAVIIVI DALRTDFISP YHPQPPSPHH HGVLSLPAEL TQSRPEHSLI FNSFSDPPTS TMQRIKGITT GSLPTFIDIG SNFASAAIEE DSLVSQLVAA NKRVGFMGDD TWMNLFPSSF HPNMSHPYDS FNVEDLHTVD NSVITHLFPY LHPSNQSQWD VLIGHFLGVD HVGHRVGPHR DTMTEKLTQM NEVLEKVVDL IDEETLLVVL GDHGMDDKGN HGGDSEMETS SALWLYSKGP MLTNPAVVQD KDTSAIFKSL PTYIFPKSTM PLRQINQIDI VPTLALLLGV PIPYNNLGSV IPECFTNNLE TLEVAQRVTA DGIWRYVEAY GDKEVKENLN SAWRHAQSQY QKDNLSASII AHRAFSLDAL VHLRSLWAQF SMPLIVIGSL ILGLCALTLI ALYVGVRNNG VNWDVYARLA LETATMGSTV LASIAGTVAG VYTARPIVAI KVFIVAAALI SEVILILPLF VKSSLSLALP LPTSFSINRH IGPFILIVHA LSFASNSFIM WEDRVVLYLI STIPIIYIIR ALSAPTADMR LKIIFLSLAF TILSRLAGTI TICREEQQPY CSVTFFSGVT ATAPTWALIA IVPLALQIPR AIGITLSRSK SLAGPAPFIL GILWRAVLVA NSVYWVLEFF ESFDGLNPAR IPLVNFLKLW LARCSAGASL GAIPYVWLTS PLCISVERTT DQATGEEEVT VFGFANAFGS TYILYTLAPF ALIHLVSQPM AQFTLTAFLI GLLVYLELVD TRRDAIILTT SFASLGNNKN NNNASSPAGL ASFDPADTAQ TIVRPSFTDV VPIALSGFLT FFATGHQAVI SSIQWKSAFV GFSTASYLFS PVLVVLNTWG GFFLSAIAVP LLAIWNIAPR PRESIPTLAH ALQVTLAFLV YHTVVAFASA ITAAWLRRHL MVWKVFAPRF MMAGVTLLVV DVGLALGLFG VRVTGWKVKK TFGCESI
|
| |