Gene CNK03020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK03020 
Symbol 
ID3254716 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp881120 
End bp884554 
Gene Length3435 bp 
Protein Length797 aa 
Translation table 
GC content47% 
IMG OID638253793 
Productoligopeptide transporter, putative 
Protein accessionXP_567897 
Protein GI58260974 
COG category 
COG ID 
TIGRFAM ID[TIGR00727] small oligopeptide transporter, OPT family
[TIGR00728] oligopeptide transporters, OPT superfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGGCGGCCA AATCAAAGTC AACTAGTTTT CCATTAGTAA GATGTCGGCA TCCTACAACG 
GGATTGAGGC ACCTATCCCA CGGGATCAGG CAACCGATTT TCCAATCCAA ACATTCTACG
CAAGTACTGA GTCTCCAGAG CTCGATCAAA AGCAGGAGGA CAATGTGGAT CTAGAGAAGA
ATGAGTCTAC AGATAATGTT GAAGTCAAGG TCGAGTCCAT TAAAGACCCC GGAGTGGAGC
TTACACCCAA TGAGGCTTTT CGTTGGAATG TGGATGGTGA TCAGTCGCCT TGTGAATATC
TCTCATTGCC GGGCTTGGTT ATCAGCTAAT CCCTCTTCAG TCCCCGAAGT AGCAGCTTGC
GTGCCCAACA CAGACGATCC CAGTATCCCT TGCAACAGTG AGTGATCATT AACTGTTTTA
TGTTTTCTTA AATTGACCAC TACTAGCTGT GAGAGCATGG ATCCTTCTTA CAGTCTTTGT
CGTCCTTTTC GCTGGTGTCA ACCAGTTCTT CGGCTTGCGT TATGTATGTC TCCTCCGAGA
AATCCCAAGA CCTGGCTGAT ATAATCTGGT ATCCAGCCCT CTCTTACCAT TGTAGATTAC
TTCGCCTCCT ACCATGTTAA GTGGCAGCTT ACCCTTCTGG CAGGGTTATG TGGTCTGCCA
GTTGTTAGTA TTTCCTATAG GTCGAGCTTG GGAAAAACTT CCTAAATGGG TCGTTCCCCT
TGGGCCTTTC TCATTCTACC TCAACCCTGG AAAGTTTACT ATCAAAGAGC ATGCCCTTAT
CGTTATTGTA AGCCTTTATA CACCCCCCTT GTAGGCATCC AGCTGAGTGA AACCACTAGT
GTGTCAACTT GACAGCGAGT ACCGCCTATG CTATGGGTTC TCTTGTCGCT ATCATCTCTC
CCGTTTATTG GAACAGTGAT TTTGGAGCTG GTTTCTCCTT TGTATATTTG CTCACCACGC
AAGCTCTTGG GTAAGTCTCG CCATTCCCAG TCTCAAGCTA ATCACAGTAG TTTCGGCCTC
GCTGGTCTCG CTCGAAGGTG GCTGGTATAC CCCGCCGCCC TCATTTGGCC TTCATCCCTC
GCATCCACCG TACTTTTCCG AGCTCTTCAT GAGCCTCAGA GTCGAAGCCC GGCCAACGGG
TGGACTATCA CCAGATACCG CTTCTTCGTT TATCTGACTA TCGGCGCTTT CATTTGGTTC
TGGTTCCCTG ATTACATCTG GACTTCTTTA AGTACTTTTG CGTTCATCAC TTGGATTGTG
CCCCACAACC AGAAGGTCAA CACTATTTTT GGAGTGGGTA TTCATTGTGT TATTCGAAGA
GTGATGGGCT AATGAGCTGT ACAGATGAAC TCTGGTTTAG GTCTTCTGCC AATCAGTTTC
GACTGGACTC AAATCAACTA TGCTGGTTAT CCCCTTACCA CACCTTTCTA TATTACTTGC
AATGCTTTTG CTGTTGTCGT ATTCTTCTAT CTGTTCTTGT CACCCATCGT GAGTTGATCT
CAAGTCTCTA AGGCCGTAAG GCCGTAGGTA CTAATTAAGT ACAGCTTTAC TACAAGAACG
TCTGGCACAG TGCTTAGTAT GCTCCACAAT CGAAATGTGC AACGAACCAC ATGCTAACCT
TTCATCAGTC TCCCTCTTCT CTCCTCATCA ACTTTTGACA ACACCGGATC ATCCTACAAC
ATTTCTCGAG TCGTTGACGA GAAGCTCGAT TTTGTCCTTG CCAAATACCA AGAGTACTCC
CCCATGTACA TCTCCATGTC TTACTCCCTC ACTTACGGCC TCTCCTTTGC CGCTGTGACC
AGTATCGTCT TCTACACCTA CCTTTACAGC GGTAAAGAGA TCTGGGCTAA GTTCAAGGAT
GCCAAGCACG GTGGAGAGGA TATCCACAAG CGATTGATGA GTTCTTACAA GGAAGTTCCT
GATTGGTGGT ACGGCGTCCT CACCCTCGTT GTCCTTGGCC TTGGTATTTT CACTTGTAGA
TACTGGGATA CTCAGCTGTC TGTTTGGGGT TTCATTGTGG TTTGCTTTGG TATGGGGTTA
GTCTTGATTG TGCCCGAAGG TATCCTCGAG GGCACTACCA ATCAGCGAAG TGAGTCTTGC
TCGGTTTGCA GATGGAGTTG TGCTGATCAT CAACATAGTC TTCCTGAATA TCATTACCGA
GTTGATTGCG GGTTATGCTT GGCCTGGGAA ACCTATTGCC AACATGCTCG TCAAGTGTTA
CGGCTATAAC AGTGTCGTAT GTCTTCAGCC TTCTGGGAAC GTTTTCAGAA ACCAACGACT
GACGATGAAA CAGAAACATG GTATGGATTT CGCTCAAGAT CTCAAGCTCG GTCAGTACAT
GGTGAATTAT CATAAATAAC CTATCCCAAC TATCAACTGA CGGGACTGGT AGAAAATCCC
TCCCCGAACT CTTTTCTGGG CGCAGATCTA CTCTACCCTT TTGGCTACAA TGACTCAGAC
CGGAGGTAAG GCTCGCCTTT CGCCTTGGAT GTCCCATGCT GACGGCGGCC CAGTGCTTAG
ATGGATGATC GGCAACATCA AAGACCTCTG TCAACCTACA AATTCCGACC GATTCACTTG
TGCGGGCGCG AAGGTGGTGT ACAACGCGTC TCTTATCTGG GGTACCATTG GTCCTCAAAG
GATGTTCCAG GCGGGCCAGG TTTACAATGG TTTGATGTAC TTCTTCCTCA TTGGCGTAGG
TACTTTTTTT TTGGAATAAG ATATTGCTAA AAAGTGAATT GTTCAGCCTG TGGTTACTGT
GCTCGTCTAC CTTGTTTACC GACGATACCC TAGCAGCTGG GTCAAGTATA TTAACGTGCC
CGTCTTCTTC AATGCTGCAG GTAAATTTTC CTTCCCTACA TGCTTGACCT CAGATAATGA
CGAAGCCTTT TAGGCAATAT CCCTCCCGCC AACACTAGTA AGTCTGAGCT CGACCAAACT
CCATTCGCCC TTAGCTGACT GCTGTCATCA GCCCAATATT CTCTTTGGTT CATCTTTGGT
TTCATTTTCA ACTACCTCAT CCGAAGGCGG GCCTTTGCTT GGTGGAAGCG ATACAATTGT
AAGTTCAACT CCTTCCTTTT CTTTTTTGGC AACGCTTATG CAGTTTTCCT TCTCCAGACC
TAACCCAAGC TGCCATGGAT ACTGGTACGG CACTCGCCAC CATCATCATC TTCTTCGCTC
TTAGCTACAA TGGTGTCAAG TTGAATTGGT GGGGTAACAC TGTTGGATCC GACACGGATG
ATGCCAAAGG GACACCATGG TTGACTGTTC CAAGTGGAAG TTACTTTGGT AGGGGTCCAG
GAGAGTTCTA AGCGATTTGT TTCCTTCTTA TCAAATCCGT GAAAGGTGTG AGTTCTATTG
TGGGTATGGT GGGTGAATAC TGTAATAACA ATTGGTTATA GATAGAAGAT TCCTGTTTGA
CGCCAATAGA TTACT
 
Protein sequence
MSASYNGIEA PIPRDQATDF PIQTFYASTE SPELDQKQED NVDLEKNEST DNVEVKVESI 
KDPGVELTPN EAFRWNVDGD QSPFPEVAAC VPNTDDPSIP CNTVRAWILL TVFVVLFAGV
NQFFGLRYPS LTIGYVVCQL LVFPIGRAWE KLPKWVVPLG PFSFYLNPGK FTIKEHALIV
ICVNLTASTA YAMGSLVAII SPVYWNSDFG AGFSFVYLLT TQALGFGLAG LARRWLVYPA
ALIWPSSLAS TVLFRALHEP QSRSPANGWT ITRYRFFVYL TIGAFIWFWF PDYIWTSLST
FAFITWIVPH NQKVNTIFGM NSGLGLLPIS FDWTQINYAG YPLTTPFYIT CNAFAVVVFF
YLFLSPILYY KNVWHSAYLP LLSSSTFDNT GSSYNISRVV DEKLDFVLAK YQEYSPMYIS
MSYSLTYGLS FAAVTSIVFY TYLYSGKEIW AKFKDAKHGG EDIHKRLMSS YKEVPDWWYG
VLTLVVLGLG IFTCRYWDTQ LSVWGFIVVC FGMGLVLIVP EGILEGTTNQ RIFLNIITEL
IAGYAWPGKP IANMLVKCYG YNSVKHGMDF AQDLKLGQYM KIPPRTLFWA QIYSTLLATM
TQTGVLRWMI GNIKDLCQPT NSDRFTCAGA KVVYNASLIW GTIGPQRMFQ AGQVYNGLMY
FFLIGPVVTV LVYLVYRRYP SSWVKYINVP VFFNAAGNIP PANTTQYSLW FIFGFIFNYL
IRRRAFAWWK RYNYLTQAAM DTGTALATII IFFALSYNGV KLNWWGNTVG SDTDDAKGTP
WLTVPSGSYF GRGPGEF