Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01880 |
Symbol | |
ID | 3258474 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 549159 |
End bp | 552330 |
Gene Length | 3172 bp |
Protein Length | 751 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257313 |
Product | hypothetical protein |
Protein accession | XP_571528 |
Protein GI | 58268744 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00727] small oligopeptide transporter, OPT family [TIGR00728] oligopeptide transporters, OPT superfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.314778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATCA ACCCCTTAGA GGCCGAGGCT TACGAGCTAC CACCGCTATC TCAGGCTGCT ATTTCACCTG CAGAACAATA TTTTGAAGAC CTTGTAGCGG AGGAAGAGAG CGCCAAGTTG CTGGATACTG AAGGACAGCT CAAACAAGAT GTCGAGGACG AACTGGTAGA TGATTCTGCT CTGTACATGC GGCGAACGCA AGATGAGGAA GTGATTGGGA GAGGTAGCAA GGTCGAGGAA CTGATTGCTA GGGTAACTTC TGTGCAACTC CTCAATGCAT CACTGAGCTT ATTAAGTCTG TAGACTGTTC CGTCACATGA TGACCCGACC CTTCCGACTC TCACCTTGCG CGTAATCTTC CTCGGCTCGT CATTTTGTAT ATTGGGAGCT TGTGCTTCCC AAATATTTTA CTTCAAGTCC AACGCTCCTT CCTTCTCTTC ATATTTTGTG GTTTTAGCTA CATATCCCTT GGGCCATTTA CTGGCAAATG AGAGGCTTGT GCCGAGAGGA AATAAGATAT TTGGATGGGA GATAAACCCA GGGAGATTTA GCATCAAAGA GGCCATTCTG ATCAGGTATG TGGTAGTTTA TACTTCCTTG AGGCTTAACT TTGTAGCTTA CTAGAGCGCA TCTCAGCGTT CTCTCCTCCT CAGGGGCGTC TTCAGCCTAC GCGGCCGATA TCTTGGCTAT CATGGACCTA TATTTTGACA CCCCGCTGTC GCCGTTACCC TCCATTCTAC TCCTTCTGAC AACTCAGTGC ATAGGGTTCG GCCTCGCCGG TAAGGAGAAA TCTGACATGA TATCTATAAC TGACTTTCCC TCTAGGAATG TTACAAAACC TACTAGTCAG TCCGCCTGCC ATGTACTGGC CCTCAACCCT CGTCACTGTT CAGCTCTTCA CCACCCTGTA TTCGACAACA TCATCCACTC TATCACTCAC CCAGCAACAA CTTACCTTTA CGAGACTTCG CGTCTTCTTA GTCATCTTTC TTGCCATATT TCTCTATCAA TTTCTCCCGT TCTTATTCTT CCCGACATTG ACCAGCGTGT CACTGCTGTG CCTTGTCAAC AACCGTTCAT GGTGGATGCG GACGCTGGGT AGTGCTTATC AAGGGCTGGG AATAATGGAT TTTAGCTTTG ACTGGAGTAG TATCGGCTCG TCTGGGCCAT TATATACCCC CTACTGGGCC CTCGGGAATT ATTTTGGAGG TTTGGTAGGA ATGCTGTGGA TTGTAAGTCG TCATTTCTAA CTTGAAAGGA CTATTGCGTT TAACATTTGG TAGATATTGC CTTTGCTCTT GATGCTCAAC TTTTGGGATG CTAGAAGTTT TCCTTCTCCA GTGTCAGCTG GACTTTTTAA CGCAACATTC CAGAAATTTG ATGTAGCTTC CATCCTCAAA CCTGATTTGT CGCTGGACGA AGCTGCATGG GAGGAGAAGA AACCTTTACT CCTCACCCCT TATTTGTGAG TTTGGCATCT TCAAAATCGA AACAAGTGGA ATTAAAGTAG AATTAGTGCT CTGACTTACG GGTTATCTTT TGCTGCCCTG TCAAGTGTTT TGGTGCATGT CTGGCTCTGG CATAGAGACG AGATCAAGCA AGGCAAGTGA AATCGTGATG TTGCAGCGAA CAGTCAAACT AACAATTTGA AAAGCTCTTT CAAGCAGATT ACAGCTCAAT GATGTTCACA AGTAGGGTTT TGAAAACTAT GCCTCCTCTA CTATTCTGAC ACCACCAACT ATAGTAAACT TATGCGATCA TACCTACCAG TGCCTTCGGT ATGGTATGTG GGTCTTTTGG CTGTTAATTT TGGTGCTGCA GGCAAGTGTC TTGCCATCTT TCCGTATCAC GATAATATTG ATTGAGAACA AAGTCATTCT GGTCAAAACC ACTTCATTGC AGATGCCAAT CTGGGCTCTG GTGTTGGCAA TGGCCATCGC CACTGTAAGT GCTCGGGAGC TAATGTCAAA ACAATGAGGC GCTCATCGAG ATACCACAGA TATTTCTGGT TCCGTAGGCA CTTCTGAATG ATTGGTCCAG AATACAGCTT CTAATATGTT TGAGACAGTG TCGGTATCAT AGCGGCTGTC AGCAATACAC AAATTGGACT TGTGCGTCAC CTTTTCAGAC ATAGAGCTCC TGCTGATGTA CATGTGTTAC AGAATGTCTT GACTGAATTG TAAATTTCCG CAATCATCAA CCGTCCAACC ACTGTTCTAA CAATTTCAGC GTGGCCGGCA TCCTCATGCC CGGTAAGCCA ATAGGCAAGT GAGTGATGAT GTTTATTTTC CTTGAATGCT CTTTTCACTG CTGATCCATA CAGTGTCACA TTCAAATGTT ATGGATGTAA GCTATCACTT TTAGCATATT GGTGGTTGAA ACATTGACAG TCATATAGAT ATGGCCATGT CACAAGCCCT TGCTCTCACT GCTGACCTGA AACTGGGCTG GTATACGTCC ATTCCCCCTC GAGAAATGTT CACGTGCCAA ATCATTGGAA CTGTTCTGGG TGCTCTCACG AACTGTGCGT TTTAGCCACC TTTCAGCCAT CCGGTGCAAA GTAATGCTGA TGTTCTGCAG ATGCCACCCT TGTATCCGTC ATGGCAGCAA AAAGACCATA TCTCAACGGC TCCCTGACTG ATCCTACCGG GCAATGGACT GGACGCGCCC CCAGTATATT TTACTCTGCG AGTATTATCT GGGGAGCCGT TGCTCCTGCT CGTTTCTTCA GCGGTGGCTA CGAAGTCCTT TATCTCGGTT TTTTAGTGGG GGCACTTGTG CCCGTTGGTT TTTGGTTGGC ACATAAGAAA TGGCCGGGGT ACAAGTTGAA TAAGGTGGTA TTTCCTATCA TTTGTAGTGG TGCGACTGTG GTCCCTCAAT AGTGAGTGGG ACAGATTGTT GAGCAACGGT GAACTCATTC ACGCTCAGTC CGAGCAACAT CATCCTCACT TCTCTCTTGA CCGCGGTTCT GGTCAACTCG TGGTTTGCGA AGCGCCATCC CAAGTTGCAC AGGCAGTACG TATATGTAGC TTCATCAGCT CTGGACGCTG GGACATCAAT AACGGCGCTG GCGATTTATG TGCTCTTCGG AGGTGTGTTT TGGAGTTGGA ATGGGTGGGA GTGGTGGGGT AATTCAGGGG TGGACTCTGA GCACTGCGTT CCTGGAAGCT GA
|
Protein sequence | MPINPLEAEA YELPPLSQAA ISPAEQYFED LVAEEESAKL LDTEGQLKQD VEDELVDDSA LYMRRTQDEE VIGRGSKVEE LIARTVPSHD DPTLPTLTLR VIFLGSSFCI LGACASQIFY FKSNAPSFSS YFVVLATYPL GHLLANERLV PRGNKIFGWE INPGRFSIKE AILISVLSSS GASSAYAADI LAIMDLYFDT PLSPLPSILL LLTTQCIGFG LAGMLQNLLV SPPAMYWPST LVTVQLFTTL YSTTSSTLSL TQQQLTFTRL RVFLVIFLAI FLYQFLPFLF FPTLTSVSLL CLVNNRSWWM RTLGSAYQGL GIMDFSFDWS SIGSSGPLYT PYWALGNYFG GLVGMLWIIL PLLLMLNFWD ARSFPSPVSA GLFNATFQKF DVASILKPDL SLDEAAWEEK KPLLLTPYFA LTYGLSFAAL SSVLVHVWLW HRDEIKQGNK LMRSYLPVPS VWYVGLLAVN FGAAVILVKT TSLQMPIWAL VLAMAIATVS ARDVGIIAAV SNTQIGLPIG NVTFKCYGYM AMSQALALTA DLKLGWYTSI PPREMFTCQI IGTVLGALTN YATLVSVMAA KRPYLNGSLT DPTGQWTGRA PSIFYSASII WGAVAPARFF SGGYEVLYLG FLVGALVPVG FWLAHKKWPG YKLNKVVFPI ICSGATVVPQ YPSNIILTSL LTAVLVNSWF AKRHPKLHRQ YVYVASSALD AGTSITALAI YVLFGGVFWS WNGWEWWGNS GVDSEHCVPG S
|
| |