Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK03020 |
Symbol | |
ID | 3254716 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 881120 |
End bp | 884554 |
Gene Length | 3435 bp |
Protein Length | 797 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253793 |
Product | oligopeptide transporter, putative |
Protein accession | XP_567897 |
Protein GI | 58260974 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00727] small oligopeptide transporter, OPT family [TIGR00728] oligopeptide transporters, OPT superfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGGCGGCCA AATCAAAGTC AACTAGTTTT CCATTAGTAA GATGTCGGCA TCCTACAACG GGATTGAGGC ACCTATCCCA CGGGATCAGG CAACCGATTT TCCAATCCAA ACATTCTACG CAAGTACTGA GTCTCCAGAG CTCGATCAAA AGCAGGAGGA CAATGTGGAT CTAGAGAAGA ATGAGTCTAC AGATAATGTT GAAGTCAAGG TCGAGTCCAT TAAAGACCCC GGAGTGGAGC TTACACCCAA TGAGGCTTTT CGTTGGAATG TGGATGGTGA TCAGTCGCCT TGTGAATATC TCTCATTGCC GGGCTTGGTT ATCAGCTAAT CCCTCTTCAG TCCCCGAAGT AGCAGCTTGC GTGCCCAACA CAGACGATCC CAGTATCCCT TGCAACAGTG AGTGATCATT AACTGTTTTA TGTTTTCTTA AATTGACCAC TACTAGCTGT GAGAGCATGG ATCCTTCTTA CAGTCTTTGT CGTCCTTTTC GCTGGTGTCA ACCAGTTCTT CGGCTTGCGT TATGTATGTC TCCTCCGAGA AATCCCAAGA CCTGGCTGAT ATAATCTGGT ATCCAGCCCT CTCTTACCAT TGTAGATTAC TTCGCCTCCT ACCATGTTAA GTGGCAGCTT ACCCTTCTGG CAGGGTTATG TGGTCTGCCA GTTGTTAGTA TTTCCTATAG GTCGAGCTTG GGAAAAACTT CCTAAATGGG TCGTTCCCCT TGGGCCTTTC TCATTCTACC TCAACCCTGG AAAGTTTACT ATCAAAGAGC ATGCCCTTAT CGTTATTGTA AGCCTTTATA CACCCCCCTT GTAGGCATCC AGCTGAGTGA AACCACTAGT GTGTCAACTT GACAGCGAGT ACCGCCTATG CTATGGGTTC TCTTGTCGCT ATCATCTCTC CCGTTTATTG GAACAGTGAT TTTGGAGCTG GTTTCTCCTT TGTATATTTG CTCACCACGC AAGCTCTTGG GTAAGTCTCG CCATTCCCAG TCTCAAGCTA ATCACAGTAG TTTCGGCCTC GCTGGTCTCG CTCGAAGGTG GCTGGTATAC CCCGCCGCCC TCATTTGGCC TTCATCCCTC GCATCCACCG TACTTTTCCG AGCTCTTCAT GAGCCTCAGA GTCGAAGCCC GGCCAACGGG TGGACTATCA CCAGATACCG CTTCTTCGTT TATCTGACTA TCGGCGCTTT CATTTGGTTC TGGTTCCCTG ATTACATCTG GACTTCTTTA AGTACTTTTG CGTTCATCAC TTGGATTGTG CCCCACAACC AGAAGGTCAA CACTATTTTT GGAGTGGGTA TTCATTGTGT TATTCGAAGA GTGATGGGCT AATGAGCTGT ACAGATGAAC TCTGGTTTAG GTCTTCTGCC AATCAGTTTC GACTGGACTC AAATCAACTA TGCTGGTTAT CCCCTTACCA CACCTTTCTA TATTACTTGC AATGCTTTTG CTGTTGTCGT ATTCTTCTAT CTGTTCTTGT CACCCATCGT GAGTTGATCT CAAGTCTCTA AGGCCGTAAG GCCGTAGGTA CTAATTAAGT ACAGCTTTAC TACAAGAACG TCTGGCACAG TGCTTAGTAT GCTCCACAAT CGAAATGTGC AACGAACCAC ATGCTAACCT TTCATCAGTC TCCCTCTTCT CTCCTCATCA ACTTTTGACA ACACCGGATC ATCCTACAAC ATTTCTCGAG TCGTTGACGA GAAGCTCGAT TTTGTCCTTG CCAAATACCA AGAGTACTCC CCCATGTACA TCTCCATGTC TTACTCCCTC ACTTACGGCC TCTCCTTTGC CGCTGTGACC AGTATCGTCT TCTACACCTA CCTTTACAGC GGTAAAGAGA TCTGGGCTAA GTTCAAGGAT GCCAAGCACG GTGGAGAGGA TATCCACAAG CGATTGATGA GTTCTTACAA GGAAGTTCCT GATTGGTGGT ACGGCGTCCT CACCCTCGTT GTCCTTGGCC TTGGTATTTT CACTTGTAGA TACTGGGATA CTCAGCTGTC TGTTTGGGGT TTCATTGTGG TTTGCTTTGG TATGGGGTTA GTCTTGATTG TGCCCGAAGG TATCCTCGAG GGCACTACCA ATCAGCGAAG TGAGTCTTGC TCGGTTTGCA GATGGAGTTG TGCTGATCAT CAACATAGTC TTCCTGAATA TCATTACCGA GTTGATTGCG GGTTATGCTT GGCCTGGGAA ACCTATTGCC AACATGCTCG TCAAGTGTTA CGGCTATAAC AGTGTCGTAT GTCTTCAGCC TTCTGGGAAC GTTTTCAGAA ACCAACGACT GACGATGAAA CAGAAACATG GTATGGATTT CGCTCAAGAT CTCAAGCTCG GTCAGTACAT GGTGAATTAT CATAAATAAC CTATCCCAAC TATCAACTGA CGGGACTGGT AGAAAATCCC TCCCCGAACT CTTTTCTGGG CGCAGATCTA CTCTACCCTT TTGGCTACAA TGACTCAGAC CGGAGGTAAG GCTCGCCTTT CGCCTTGGAT GTCCCATGCT GACGGCGGCC CAGTGCTTAG ATGGATGATC GGCAACATCA AAGACCTCTG TCAACCTACA AATTCCGACC GATTCACTTG TGCGGGCGCG AAGGTGGTGT ACAACGCGTC TCTTATCTGG GGTACCATTG GTCCTCAAAG GATGTTCCAG GCGGGCCAGG TTTACAATGG TTTGATGTAC TTCTTCCTCA TTGGCGTAGG TACTTTTTTT TTGGAATAAG ATATTGCTAA AAAGTGAATT GTTCAGCCTG TGGTTACTGT GCTCGTCTAC CTTGTTTACC GACGATACCC TAGCAGCTGG GTCAAGTATA TTAACGTGCC CGTCTTCTTC AATGCTGCAG GTAAATTTTC CTTCCCTACA TGCTTGACCT CAGATAATGA CGAAGCCTTT TAGGCAATAT CCCTCCCGCC AACACTAGTA AGTCTGAGCT CGACCAAACT CCATTCGCCC TTAGCTGACT GCTGTCATCA GCCCAATATT CTCTTTGGTT CATCTTTGGT TTCATTTTCA ACTACCTCAT CCGAAGGCGG GCCTTTGCTT GGTGGAAGCG ATACAATTGT AAGTTCAACT CCTTCCTTTT CTTTTTTGGC AACGCTTATG CAGTTTTCCT TCTCCAGACC TAACCCAAGC TGCCATGGAT ACTGGTACGG CACTCGCCAC CATCATCATC TTCTTCGCTC TTAGCTACAA TGGTGTCAAG TTGAATTGGT GGGGTAACAC TGTTGGATCC GACACGGATG ATGCCAAAGG GACACCATGG TTGACTGTTC CAAGTGGAAG TTACTTTGGT AGGGGTCCAG GAGAGTTCTA AGCGATTTGT TTCCTTCTTA TCAAATCCGT GAAAGGTGTG AGTTCTATTG TGGGTATGGT GGGTGAATAC TGTAATAACA ATTGGTTATA GATAGAAGAT TCCTGTTTGA CGCCAATAGA TTACT
|
Protein sequence | MSASYNGIEA PIPRDQATDF PIQTFYASTE SPELDQKQED NVDLEKNEST DNVEVKVESI KDPGVELTPN EAFRWNVDGD QSPFPEVAAC VPNTDDPSIP CNTVRAWILL TVFVVLFAGV NQFFGLRYPS LTIGYVVCQL LVFPIGRAWE KLPKWVVPLG PFSFYLNPGK FTIKEHALIV ICVNLTASTA YAMGSLVAII SPVYWNSDFG AGFSFVYLLT TQALGFGLAG LARRWLVYPA ALIWPSSLAS TVLFRALHEP QSRSPANGWT ITRYRFFVYL TIGAFIWFWF PDYIWTSLST FAFITWIVPH NQKVNTIFGM NSGLGLLPIS FDWTQINYAG YPLTTPFYIT CNAFAVVVFF YLFLSPILYY KNVWHSAYLP LLSSSTFDNT GSSYNISRVV DEKLDFVLAK YQEYSPMYIS MSYSLTYGLS FAAVTSIVFY TYLYSGKEIW AKFKDAKHGG EDIHKRLMSS YKEVPDWWYG VLTLVVLGLG IFTCRYWDTQ LSVWGFIVVC FGMGLVLIVP EGILEGTTNQ RIFLNIITEL IAGYAWPGKP IANMLVKCYG YNSVKHGMDF AQDLKLGQYM KIPPRTLFWA QIYSTLLATM TQTGVLRWMI GNIKDLCQPT NSDRFTCAGA KVVYNASLIW GTIGPQRMFQ AGQVYNGLMY FFLIGPVVTV LVYLVYRRYP SSWVKYINVP VFFNAAGNIP PANTTQYSLW FIFGFIFNYL IRRRAFAWWK RYNYLTQAAM DTGTALATII IFFALSYNGV KLNWWGNTVG SDTDDAKGTP WLTVPSGSYF GRGPGEF
|
| |