Gene CNK02680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02680 
Symbol 
ID3254574 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp778557 
End bp781408 
Gene Length2852 bp 
Protein Length647 aa 
Translation table 
GC content47% 
IMG OID638253760 
Productcytoplasm protein, putative 
Protein accessionXP_567867 
Protein GI58260914 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.980721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACCCTATAT TCATCGTTCA GAATATCGCC TATATTACCA AAAATCACCC TCAGACTCAC 
ACGGAAAGTC TGTTCCAAGA TGGCAATGTC GTCCAGCAAG CTTGCGGACT TGAGACAATT
GATGAAGGAG CAAGGCGTTG ATGCTTAGTG CGTACAATGG ATCTATCACC TTAACGGCAC
TGATAAGATG CAGTGTGGTA CCTTCGGAAG ACGCTCGTAA GGCATATACA CCTACATCTC
TAGCTGGTTA TGGATTGACT GAGCCTTTCT GATTGTTATT TCTTGTCAGA CGCTTCCGAA
TATCTGGCAC CATGTGACGC ACGACGGGCC TATATTACAG GATTCACCGG GTCTGCCGGC
TGTGCAGTTA TAACCCATGA CAAAGCGCTT TGCTGGACTG ATGGCAGATA CTGGCTTCAA
GCAGAGAAGC AACTCGGTGA AGGGTAAGTA GACCCTAAAT GCAGAGTGGT CACATACTGA
TCTGTTTCAG ATGGGCATTG ATGAAGAGTG GGCTGCCTGA AGTTCCTACT TGGGCCCAAT
GGCTTAGCAC AGTAGATATC AATCTTCAAT TATCTGAGAA TCCATCTAAC AACGTTATCA
GGAAGTTTCA CCCAATTCCT TGATCGGTAT TGATCCCACC GTCATTCCCT ACTCTGAAGC
ACTCTCACTC CTCTCTTCCC TTCCCTCATT GTCTCCTGCC CCAAGTGCAG CATCTCCTTC
AAGACTCATC GCTACTCCAA ACTTGATCGA TTCCCTTTGG GTGCCTCCTT CCCGTCCCCT
TCGACCTTCT CAACCCATAT TCCATCTGGC CGATAGGTAC ACTGGGGAAC CCGTCTCTTC
CAAGTTGAGG CGACTGAGGG ACAAGCTTAT AAGGATAGGG AGTCCCGGTA CAGTTGTAGC
ATCGCTTGAT GAGATCGCTT GGGTGTTCAA TTTGAGAGGA GCGGACATTC CTTATAACCC
TGTAAGCCTC GGTCGTGACA AATGAAATTT CAGTTGCTGA AAGAGGGGCT TAGGTATTCT
TTGCGTATAC CATCATCACT CCGGATGATT GTACCCTCTT TGTCTCGCCT TCCTCTCTCA
CCATTGAGGT TCGATCCTAT CTCCACTCCA ATGGAATAGC CGTTCTTGAC TATTCTCATG
TGTGGACTTC ACTTGAAGCT TGGAAGAAGA GGGTCAAGTT TGACCAAGAG AATAAAAGCA
GGGAGCAAAG AGATGGTGTG AAGCGGGCAA GGCTCGAGGA GGAGGCAAAG AAAGAGGAAG
AAGGAGAAAG GCTGAAGAAA ACAGACAAGA TCTTAATTGG AAACAAGACG AGTTGGGCTG
TTGCCAAAGC GGTTGGAGAG GTAAGGCATG TGGATACATA TGCGTCAAAC AAATCTAATT
ATTGAGCAGG ATAATGTGGA AGTACGACGA TCTCTAATTG AGGAGATGAA AGCCAAGAAA
AACGCGGTAT GTCTTTGGCT CTTTCTTAGT ATTCGATATT TGCTAACATG CATGGATTCT
TGCAGACTGA AATTGAAGGC TTTCGCCAAT GTCATATACG TGACGGGGCC GCCCTTGTGC
GATATCTTGC TTGGCTGGAA GAAGCGCTTG AGAATGGAGA AAGCTGGACG GAGTATGATG
CAGCGACCAA GCTTGAAGAT TTCCGCAAGT GAGTTTACCG TCCCTCATTT CAAAGTCTTG
TAAGCTGACT GTTCAATAGG GAAAACAAAC TTTTCATGGG ACTTTCATTT GAAACCATCT
CGTCTACTGG TGCAAATGCC GCCGTCATTC ATTACTCTCC GCCCGCAGAG GGGAGTAAGG
TGATTGAAAA AAAGCAAATG TACTTGTGTG ACTCTGGCGG TCAGTACTGC TGGTATTCAA
TGGAGACCAA CGAGCTGATT GACCTTTTAA ACATAGCCCA GTACTTGGAT GGGACCACAG
ACGTAACTCG AACACTTGTA GGTCCAAGCG CGTCGACGTC AATCGACGTC TTTCAGGGAA
GCTAATGCGT ATGTAGCACT TTGGCACACC CAACGAGGAC CAAAAGCGTG CATTCACCCG
AGTGGTGAGT TGCATCAAAT GATAGGTGCC GAATTAGTAG CTGATCCATG TACATTAGTT
ACAAGGACAC ATTTCCTTAG ATACTATCGT TTTCCCTCAG GGTACAACTG GTAAGTCAAT
TTCTCTTGTT GTGATGATCT GAGCTGATTA CTTGCTTAGG CTATATTCTG TAAGGGATTA
CAACACGATG ACCGGGAAGG TGCTGATGCC TGTTTAGAGA TGTACTCGCC CGTCGAGCTC
TTTGGAGTGA AGGACTGGAC TACCGGTATG TGCCTTTTTA CAAAGCATCG TTATGCAACT
TATGTATTTG AATCTTGACA TTAGCCATTC AACATCCCAC GGCATTGGTT CTTTCCTCAA
TGTCCACGAA GGCCCTCAAG GTATAGGCCA ACGACCGGCG TACAATGAAG TGCCTTTACA
AGAGGGTATG GTTATCTCGA ATGAACCCGG CTATTATAAA GATGGTGAAT GGGGGATTCG
AATCGAAGGG GTGGACGTCA TCGAGAGAAG GGAGACGAGG GAGAATTTCG GTGGTAAAGG
GTGGTTGGGA TTTGAAAGAA TCACCATGGT GAGTTATGCG AACTACTTGA TGTGCCAGCG
CTCATTTTCC CATCTTTTCA AGTGTCCTAT CCAGACAAAA CTTGTGGATT CTTCGCTGCT
CACCATCGAA GAGAAAGACT GGCTCAATGA ATATCACGCA GAAGTCCTCG CAAAACTAGC
GCCGGTGTTG AAAGAGATGG GAGACGAAAG AGCAGGTAAA TGGCTGGAAA GAGAGTGCCA
ACCTCTGTAA GAGGGGTTTT TTTTGGACGC GA
 
Protein sequence
MAMSSSKLAD LRQLMKEQGV DAYVVPSEDA HASEYLAPCD ARRAYITGFT GSAGCAVITH 
DKALCWTDGR YWLQAEKQLG EGWALMKSGL PEVPTWAQWL STEVSPNSLI GIDPTVIPYS
EALSLLSSLP SLSPAPSAAS PSRLIATPNL IDSLWVPPSR PLRPSQPIFH LADRYTGEPV
SSKLRRLRDK LIRIGSPGTV VASLDEIAWV FNLRGADIPY NPVFFAYTII TPDDCTLFVS
PSSLTIEVRS YLHSNGIAVL DYSHVWTSLE AWKKRVKFDQ ENKSREQRDG VKRARLEEEA
KKEEEGERLK KTDKILIGNK TSWAVAKAVG EDNVEVRRSL IEEMKAKKNA TEIEGFRQCH
IRDGAALVRY LAWLEEALEN GESWTEYDAA TKLEDFRKEN KLFMGLSFET ISSTGANAAV
IHYSPPAEGS KVIEKKQMYL CDSGAQYLDG TTDVTRTLHF GTPNEDQKRA FTRVLQGHIS
LDTIVFPQGT TGYILDVLAR RALWSEGLDY RHSTSHGIGS FLNVHEGPQG IGQRPAYNEV
PLQEGMVISN EPGYYKDGEW GIRIEGVDVI ERRETRENFG GKGWLGFERI TMCPIQTKLV
DSSLLTIEEK DWLNEYHAEV LAKLAPVLKE MGDERAGKWL ERECQPL