Gene CNA02920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA02920 
Symbol 
ID3253435 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp767259 
End bp771192 
Gene Length3934 bp 
Protein Length911 aa 
Translation table 
GC content47% 
IMG OID638252623 
Productprotein-vacuolar targeting-related protein, putative 
Protein accessionXP_566706 
Protein GI58258587 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATTCATACG ACTGGCACTC CGTAAGACAT GGGCAAAGAA GATTCACAAC GTTCTGCCAA 
CCATCCCGTA ACAGTCACCC CCCACATATC TTCCACGCAC GATCTTATCC AGCTTCCGCT
ACCTAACACA ATCTCGCACA ATAATGCCCG TTCCGGCGCC TCGTTTGCAG TCCGTGTGAC
GATAGGGACC CTTATCCTTC TCTGGACCCT CTTCAATATT GACGTTGTAA CATCTCCCTT
CAGACAAAAC CCTTCTATAG ACTCTCTCGA TGACTCTTCA GCGGCTGGTT ATCTCCGTGG
TATCGCGACA AAGAGTATGA AATGGGCATT GCCGGCACCG AACCATCAAC TTCACAGTAC
ATATGGTGAT CTGGAAAGTA CGGAGATAGG AGAAGAAGGC ATCAAGATGT CCCTTATTCC
TCCAAAGTTG GCCGAAAAGA TCTTCTTAGA TGTTCCAAGC AACGATAGCG TCGCTGCGTA
AGTCATCTGC CTAAGAAAAC TAACCCAACC TAATGAAGTA AATTTAGAGC TTCGAAGAGG
TATACTGGAT ATGCCCATCC TGCAGGTTCT GGATACGACT ACGCCTCCGC TCTTACATTG
AAGAATGAAT GGGAAAAGGA GCTGGGCTTG CGAGTGTCAG GTCCACAGGA GTTTATCTAC
GATGCCGGAA GTCCAGAAAG CCAAGCTCGA GTGAAGAACG GCATGGACAA ACTTGGTGTT
TGGATTGATA CGGTAAATGA ATTCTGTTCA ATTTTTCCAT CTGAATGTGC GTGAAGCTCA
TCTGGTGACA GTACTATCCG GTCATGAACA CTCCCGTCTA TGCTGCAGCT ACGCTTCTAA
CGGATCCTCC GTTCCATGCC AAGCTCCGCG AAGACATTGT AGACGGAGAT TCTGATTCCG
AGCTTCGAGA TGAGGTTCCC GTTTTCCATG GATTGTCAGT CAGCGGGGAT GTCAGGGGGA
ATTTTGTGTA TGTCGGGTAT GGCCGCAAAA AAGATTTTGA TCTTCTCAAA CAGAGAGGTA
GGCTTCTTAT CTATACTATG CAGCTCCAAA TCGTGGTGTT AACATCAGAT GAGGGGTTGA
TATCGACGGT AAGATCGTTT TAGCCAAGTA TGGAGGCTGT TTCAGAGGTT TGAAAGTCAA
AGGTGAGGTT TTTATGTGGT GTTGGCAACT ATCAGACGTT GAAACATAAG CTCTTAGCGG
CTCAAGAAGC CGGTGCTACT GGAGTGATCA TCTTTACTGA TCCAGGAAAT GATGGCGAGA
TTACCGAAGA AAATGGCTAT GAGGTCTATC CTAAGGGGCC AGCCAGGCAA GTGAGTGCTT
CGGTCAGGTC GATGTAATAC AAGCTTGATA TAAACTCATA CCATTGTGCT GTTTTTAGCC
TAGTAGCGTG CAAAGGGGCA GTGTGCAATT CGTAATTATG CTTTCATTCT TCTCTGTTCT
TGTAAAGGTC ACTTCTGACA CCTTTTCACA GCTCTCCAAG TACCCTGGTG ACCCTAGTAC
CCCAGGGGAA CCAGCTTACA AGAACGCTTC CCGCCTCGAG GGGGGTAATC AACCATCTAT
ACCATCGTGG GTACCGACTT CGCTGGCCTT TCTGTGATTG CCAGCTTCTA ACATTGCCTA
GGATCCCCAT GTCTTATGAG GACGTCATTC CGTTCCTCAA AGCACTTGAG GGCAAGGGTA
TACATGCTTC TGACTTGGGG CCAGATTGGG TTGGAGGTTT GGGTTATCAT GGCGTTGATT
ACTACATTGG TCCCAGTGAT GTGGATCTGC ATTTGGTCAA CGAAGTCAAT ACCAGAGTTA
TGCCCATTTG GAGTAAGTAG GACAAAAGCA ACTGCGCATC CGCTTACCAC ACCGTTTAGA
TACAATGGCT GTCATCCCAG GCCACATAAC GGACGAAGTG ATCATTCTTG GTAATCACCG
AGACGGTGAG TTTACCAACG TGCATCATGT GTTAACCAGC TAATGATCTC TGACCGATTA
GCATGGGTGC TTGGAGCCTC CGATCCTAAC TCAGGTACGG CCTCTCAATT CGAAGTCATA
CGTGGCCTAG GTACCCTTTT GAGGAAAGGT TGGAAGCCCC TCAGGACAAT TATGCTTGCC
AGTTGGGATG CGGAAGAATA TGGGCTAATC GGTAGTACCG AGTGGGCTGA AGACTTCGGA
GACTGGCTGC AGACGAACGG TGAGTTTACG AGTGTATCGG ACGCATAACC ATTCAGATGA
CCTGATCATC CTGAAGCTGC TGCTTATCTC AACATGGACA GCTCTGCATC CGGAAGCAAC
TTCCACGCAT CCGCCTCTCC TGTGAGTTGA TGTTTCAACT GTGTCTCCGT CAATCTAACT
GACCCATTTT GTCTTTGTAG TCCCTCGCCC TTCTCGTCCG ATCAGCGGCT GAAGAAGTAG
AATCAAGCTC AAGCCCTTCT AAATCAGTGT TCGACACCAG ATTTGACGCC GGCAATTGGG
AACAATTCAA CATGGAAAAG CTTGGGAACC ATGTTGGCCT GGGGGTTCCA CTGTCGGAAA
AGAAAGGGTC TGGGATCGGC GCTTTAGGGT AAGCTATGTT GGGGTGCTTC TTATATGACA
TGGGAAAACT CATGCCACCT AGTTCTGGAT CTGACTTTAC TCCCTTTCTT CAACGATACG
GTGTATGTTA ACTGAGACAG GGCAAATGGA ACCATGGCTG ACTTCAATGA CAGATTGCTT
CGAGTGAGCT TGGGCATAAG GGTGGTCCTA AAGACGCTGT GTATCATTAT CACAGCATTT
ACGATTCATT CACTTGGCAA AAGAAGTATG CCATCACCTA CATACTGCAT TGCGCATACT
GACGCAATGG ATTTAGGTTT GGAGATGTCG GTTTCCATCG ACATACTGAT GCAGCCAAGG
TCATTGGCTT ATTGCGTAAG TAACGGTCAG GTGGAAGTGC TGGTAGATAA GGACTGACGG
CTGTATTGTG AAAGTTTTGC GTCTGGCTGA TGGACTGATC TTACCTCTCA ATACCACCCA
GTACACCCGC GATCTCGAGT ACTATCTTGA AAAGTACGTC ACGTTGGACC TTGTGCAATC
GTCTGACCAT TTTCAGGGTC CAGGATGTCA GCAAAATTGA TCTGAGGACT TTGGAAATTG
ATTTTGAACC GCTTGCTGAT GCTATCGAAG CTGCCCAGAC TGCTTCAGGT GAACTGGACA
AGCAACGGCA TAAGGCACTA AAGAAGCTTC ATAAGTTGAT AGGAAAACCC GCGCATGGCA
AGTACGGTAT CTTCAAGACA ATGTTACGTG GATGTGGGTG GAGAGTTGAA GAGGGTAAAG
AAGTCGATTA CAGTTTGCAG ACCTGGGAGG AAGGTCCAAA AGAGGGAATC TCCTCCTTTC
CGCACCCTAG GCTTCCTTTC CCATTACCTA CTCCAGGAAG GATTCGTGAA ATTAAGAAGG
TCTTGAAAGA GATTAGGATC ATCAATAGGA AGCTGCAGAA CTTTGAGTCC GGTTTCCTTT
CACAAGATGG TTTAAAGGTT GGTGTGCCAA TTTGGCCTTT CTTATCACCA AACTGAGCCG
AAGTCTTTAC AGGATCGAGA ATGGTATAAG CATAAGGGAA CGGCCCCAGG TCTTTGGCTT
GGCTACGGTG CCACAACTGT AGGTGTTGAA TCAATGTGCT GTTGAGTTAT TGCTGAAGTC
TCACCATCAT AGTTTCCTGC TCTCACTGAA GGTCAGTACC AAAAGTCTTC CAACACAATA
ACCCTCTTAT TGATGCCTAT AGTTATAGCT ATTACTATTG ACCACTCCCC TAAGTTGGCG
CAGAAAGAAG TCCATGAGCT AGCGAAAATG ATAAACAAGA TTGCTAAGTA CCTTGTGGCT
TGAATGGGAT CAGCTGTTTT TTGCTCTTAA TATGTTAGCT ATATGTTACC GCCGAATAAA
TCACTTTACA GATAAATGAA GGCCTAGTAG CCTC
 
Protein sequence
MGKEDSQRSA NHPVTVTPHI SSTHDLIQLP LPNTISHNNA RSGASFAVRV TIGTLILLWT 
LFNIDVVTSP FRQNPSIDSL DDSSAAGYLR GIATKSMKWA LPAPNHQLHS TYGDLESTEI
GEEGIKMSLI PPKLAEKIFL DVPSNDSVAA ASKRYTGYAH PAGSGYDYAS ALTLKNEWEK
ELGLRVSGPQ EFIYDAGSPE SQARVKNGMD KLGVWIDTYY PVMNTPVYAA ATLLTDPPFH
AKLREDIVDG DSDSELRDEV PVFHGLSVSG DVRGNFVYVG YGRKKDFDLL KQRGVDIDGK
IVLAKYGGCF RGLKVKAAQE AGATGVIIFT DPGNDGEITE ENGYEVYPKG PARQLSKYPG
DPSTPGEPAY KNASRLEGGN QPSIPSIPMS YEDVIPFLKA LEGKGIHASD LGPDWVGGLG
YHGVDYYIGP SDVDLHLVNE VNTRVMPIWN TMAVIPGHIT DEVIILGNHR DAWVLGASDP
NSGTASQFEV IRGLGTLLRK GWKPLRTIML ASWDAEEYGL IGSTEWAEDF GDWLQTNAAA
YLNMDSSASG SNFHASASPS LALLVRSAAE EVESSSSPSK SVFDTRFDAG NWEQFNMEKL
GNHVGLGVPL SEKKGSGIGA LGSGSDFTPF LQRYGIASSE LGHKGGPKDA VYHYHSIYDS
FTWQKKFGDV GFHRHTDAAK VIGLLLLRLA DGLILPLNTT QYTRDLEYYL EKVQDVSKID
LRTLEIDFEP LADAIEAAQT ASGELDKQRH KALKKLHKLI GKPAHGKYGI FKTMLRGCGW
RVEEGKEVDY SLQTWEEGPK EGISSFPHPR LPFPLPTPGR IREIKKVLKE IRIINRKLQN
FESGFLSQDG LKDREWYKHK GTAPGLWLGY GATTFPALTE AITIDHSPKL AQKEVHELAK
MINKIAKYLV A