Gene CNI00800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00800 
Symbol 
ID3259442 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp192827 
End bp195791 
Gene Length2965 bp 
Protein Length750 aa 
Translation table 
GC content48% 
IMG OID638258565 
Productpeptidase, putative 
Protein accessionXP_572743 
Protein GI58271174 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCCGTAACA CCATGAGAGC CACCGTACTC ACCCTCCTTG GCCTATCAGC CTCGGCATGG 
GCTACTCCAG CGCCATTCAC CGTAGAAGAC ATGCTTGCTG CCCCTAGGCC TTTCCCGGCT
ATTGCCAGTC CTGACAAGCA GCACGCCATC GCTGTCGTCG ACTACTGGGA ACCTAGGGAT
GACTCGTACG TTTTCAGGAG TTTGCCCATG AAATCAAGCT AAATTACAAT GTAGTATGAG
GAGAGAAGCA TACCTGGCGA CGCTCAACAG GCCCGAAGTG AAGCATCCCA TCTCCTTGTT
CAATACCACT CCTTCGGCTG CAGCCGATTT CTTCTGGCTT GACGATGTTA CCATCGCATA
CCTCGATGGA TCGAATCTTT TTTCTTACCC GGTTGAATAT GCATTCAGCC AATCCAATTT
TAAGCCTAAA CACAACCCAC CTCGCTCCCC CAGGCATCAA AAGATCCTCT CATTCCCTCA
CGGCGTCAAC CCTACTTCTC TTCAATATGA AGCAAGCACC AAAACCCTCG CGTTTACTGG
CCAAGTGTGG TCGGATGGCT CATTCTACCA GACTCGACAT CACGACAAGC TCTACGAAAA
GAAACGTGAC AGTGCCCAAG TATACGACGA CTTGATGGTC AGGCATTGGG ATACTTGGAG
AGTCAGCGGA AAGGTCTGGA CGCTAGGTGT TGTCAAGCTG ATCAACATCA ACGATGAATG
GGCAGAGCTT GATAATGATA TCAACAAGCA TCACAAACGT CGAGCTGAGT TTATTAACAT
CTTGAACGGT ACCGATTTGG TATCCCAGAC CGACCCTATC GATGCGGGTT CTTACTCTAT
CAGCTCTGAA CACATCGCTG TAGCCGTTAA GCCTCCTTAC CTCCAGACTG CCACACATAC
GAGAGAAGAC ATATATCTTT TCCCTCTTCC TTCCTCTTAC GACTCTGCGT CCATCCTTCC
CAAACACGTT ACTCCGCACG CCCATGGAGC TATCAGCGAA ATCAAGTTCT CACCTGATGG
GAAGAAGCTT TCATGGCTTG AGATGAAGAA GGATGGCTAC GAGAGTGATA GGCGGGTGGT
CGTTGTTTAT GACTTGATGA GTGGGAAGAG TGAAAGATGG ACCGAAGTTT GGGACAGGAG
CCCTAAAAGC ATCTCCGTAC GATAAATTCC CCTCAACTTG ACATCAAAGA TATGCTAACA
GAGCACTGGA TCATGTAGTG GGCGGTCGAC TCTCAATCCA TCTTCCTTTT GGCCGAGTTC
CGAGGACGCA CCCTCCCATA CCACCTCACT CACCCCAACC ACCTCCCGAC TCCCCTTCTC
TTCAACGGTA CAACCGTTTC CCTCACTCCA CTGAACGAGA CCGACATCCT CATCGCCCGT
CAATCATTCC GAACTCCCAC TGTGGAATGG ATATTGACTT TGCCCGACCC TGCCGAGGAT
GGAAATGCAG TTGGAGACGG AGACGGAGAC AAGATACCGG CTGTTGAGCC TTTGAGGCAA
CTCACTCGAT GGAATGAACA TTTCATCCGT GGGAGGTTGG ATGTTCAGAC TGGTGAAGAG
TTTTGGTTCA AGGGTGCTGA AGGCAAGGAT GTGATGGGAT GGGCTTTGAA GCCTCGTGGG
TGGAAGCCTG ACCAGAAGGC CAAGTATCCT CTCGGTGCGT CCTTTTAACG TCGTTTTTTT
TTTGCTTCTG GGAGAGGAAA TCTGATAAGA TATGTACAGC TTTCTTGATT CATGGTGGCC
CTCAATCAGC TTGGGACGAT TCTTGGTCAA CTCGATGGAA CCCTGCCCTG TTTGCCGCCC
AGGGTTACTT TGTCGTCGCC ATTAACCCTA CTGGTTCTAC CGGTTATGGA CAAGAATTTA
CCGATGCTAT CCAGGGCGAT TGGGGAGGAA GTAAGTTTTT TTCGCCAAAA AGCCAAGCTT
GCAAAAGATT TTTTTACTAA ATGCATATTA CAGAGCCTTT CAAGGACCTC CTCGCAGGTT
ACCACTACGT CCTAGAAAAC TACCCTGAAG TAAATCCAAC CTTGCCCCTC CCTGATGCTC
ACTTATTAAC ATTACGTAGA TCGACCCCGA ACGTACTGCC GGCCTTGGAG CTTCTTATGG
TGGCTACATG GTCAACTGGA TCAACGGGCA TAACGACCAC TTTGGTTTCA AAGCTTTAGT
ATGCCATGAC GGCGTGTTCG ACACGGTCAC TACCTTCTTC TCCACGGAAG AGATTTATTT
CCCCACCCAG TGAGCAGCAC TTTCTGGGCG ATAAAATATG GCTAATATCG TATGCTGGTA
GAGACTTTGC TGGTACACCT TGGACGAATA GGGCTACTTA TGAAAAGTGA GTGAGATCAG
TGATGCAAAG AAGATGAAGA AATTACTAAT GTGATATTAT AAGATGGAAC CCTGTGAACC
ACGTTATCGA GTGGAATACT CCGGAACTTG TTATCCAAGG CGGAAAGGGT ACGTGCAGAG
AGGCGTTGAT TTGTCCTTGA CTAATATGGG CTATGTAGAC TACCGTCTGG AGAACTCTCA
AGGTCTTGGT AAGTGAATAG CGCTATATAG TTCACCCTAG ATGCTGATGA ATGGAAATAG
GCGCTTTCAC CGCTCTGCAG CTGTGCGTTT AGACTATGGA TTCAAGATGC AAGACTCTTG
CACTGATATA CTGACGACCA TCTTAGCCAA GGAGTCCCTA GCCGATTCGT CTACTTCCCC
GACGAGAATC ACTGGATTCT CAAACCTCAC AACTCTATCA AGTGGCACGT AAGTCTACCC
ATACTCAAAA TGGTACTGGA TCAGTACTTA CCGGGTGATA TATTAGCACG AGGTGTTCCG
ATGGCTGGAG GAATGGATTG GCAAGCCCAC GGATGAGAGT GAGGCTTTCG TCGTGCAGCG
AGAGTGAAAG GAATAGATCA ACCGATTTTA TGCCTGAATA TTTAGCATAG TTATCATCAA
TACAAATAAT GAAAAGGGAA ACACA
 
Protein sequence
MRATVLTLLG LSASAWATPA PFTVEDMLAA PRPFPAIASP DKQHAIAVVD YWEPRDDSMR 
REAYLATLNR PEVKHPISLF NTTPSAAADF FWLDDVTIAY LDGSNLFSYP VEYAFSQSNF
KPKHNPPRSP RHQKILSFPH GVNPTSLQYE ASTKTLAFTG QVWSDGSFYQ TRHHDKLYEK
KRDSAQVYDD LMVRHWDTWR VSGKVWTLGV VKLININDEW AELDNDINKH HKRRAEFINI
LNGTDLVSQT DPIDAGSYSI SSEHIAVAVK PPYLQTATHT REDIYLFPLP SSYDSASILP
KHVTPHAHGA ISEIKFSPDG KKLSWLEMKK DGYESDRRVV VVYDLMSGKS ERWTEVWDRS
PKSISWAVDS QSIFLLAEFR GRTLPYHLTH PNHLPTPLLF NGTTVSLTPL NETDILIARQ
SFRTPTVEWI LTLPDPAEDG NAVGDGDGDK IPAVEPLRQL TRWNEHFIRG RLDVQTGEEF
WFKGAEGKDV MGWALKPRGW KPDQKAKYPL AFLIHGGPQS AWDDSWSTRW NPALFAAQGY
FVVAINPTGS TGYGQEFTDA IQGDWGGKPF KDLLAGYHYV LENYPEIDPE RTAGLGASYG
GYMVNWINGH NDHFGFKALV CHDGVFDTVT TFFSTEEIYF PTQDFAGTPW TNRATYEKWN
PVNHVIEWNT PELVIQGGKD YRLENSQGLG AFTALQLQGV PSRFVYFPDE NHWILKPHNS
IKWHHEVFRW LEEWIGKPTD ESEAFVVQRE