Gene CNF01880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01880 
Symbol 
ID3258474 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp549159 
End bp552330 
Gene Length3172 bp 
Protein Length751 aa 
Translation table 
GC content47% 
IMG OID638257313 
Producthypothetical protein 
Protein accessionXP_571528 
Protein GI58268744 
COG category 
COG ID 
TIGRFAM ID[TIGR00727] small oligopeptide transporter, OPT family
[TIGR00728] oligopeptide transporters, OPT superfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.314778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATCA ACCCCTTAGA GGCCGAGGCT TACGAGCTAC CACCGCTATC TCAGGCTGCT 
ATTTCACCTG CAGAACAATA TTTTGAAGAC CTTGTAGCGG AGGAAGAGAG CGCCAAGTTG
CTGGATACTG AAGGACAGCT CAAACAAGAT GTCGAGGACG AACTGGTAGA TGATTCTGCT
CTGTACATGC GGCGAACGCA AGATGAGGAA GTGATTGGGA GAGGTAGCAA GGTCGAGGAA
CTGATTGCTA GGGTAACTTC TGTGCAACTC CTCAATGCAT CACTGAGCTT ATTAAGTCTG
TAGACTGTTC CGTCACATGA TGACCCGACC CTTCCGACTC TCACCTTGCG CGTAATCTTC
CTCGGCTCGT CATTTTGTAT ATTGGGAGCT TGTGCTTCCC AAATATTTTA CTTCAAGTCC
AACGCTCCTT CCTTCTCTTC ATATTTTGTG GTTTTAGCTA CATATCCCTT GGGCCATTTA
CTGGCAAATG AGAGGCTTGT GCCGAGAGGA AATAAGATAT TTGGATGGGA GATAAACCCA
GGGAGATTTA GCATCAAAGA GGCCATTCTG ATCAGGTATG TGGTAGTTTA TACTTCCTTG
AGGCTTAACT TTGTAGCTTA CTAGAGCGCA TCTCAGCGTT CTCTCCTCCT CAGGGGCGTC
TTCAGCCTAC GCGGCCGATA TCTTGGCTAT CATGGACCTA TATTTTGACA CCCCGCTGTC
GCCGTTACCC TCCATTCTAC TCCTTCTGAC AACTCAGTGC ATAGGGTTCG GCCTCGCCGG
TAAGGAGAAA TCTGACATGA TATCTATAAC TGACTTTCCC TCTAGGAATG TTACAAAACC
TACTAGTCAG TCCGCCTGCC ATGTACTGGC CCTCAACCCT CGTCACTGTT CAGCTCTTCA
CCACCCTGTA TTCGACAACA TCATCCACTC TATCACTCAC CCAGCAACAA CTTACCTTTA
CGAGACTTCG CGTCTTCTTA GTCATCTTTC TTGCCATATT TCTCTATCAA TTTCTCCCGT
TCTTATTCTT CCCGACATTG ACCAGCGTGT CACTGCTGTG CCTTGTCAAC AACCGTTCAT
GGTGGATGCG GACGCTGGGT AGTGCTTATC AAGGGCTGGG AATAATGGAT TTTAGCTTTG
ACTGGAGTAG TATCGGCTCG TCTGGGCCAT TATATACCCC CTACTGGGCC CTCGGGAATT
ATTTTGGAGG TTTGGTAGGA ATGCTGTGGA TTGTAAGTCG TCATTTCTAA CTTGAAAGGA
CTATTGCGTT TAACATTTGG TAGATATTGC CTTTGCTCTT GATGCTCAAC TTTTGGGATG
CTAGAAGTTT TCCTTCTCCA GTGTCAGCTG GACTTTTTAA CGCAACATTC CAGAAATTTG
ATGTAGCTTC CATCCTCAAA CCTGATTTGT CGCTGGACGA AGCTGCATGG GAGGAGAAGA
AACCTTTACT CCTCACCCCT TATTTGTGAG TTTGGCATCT TCAAAATCGA AACAAGTGGA
ATTAAAGTAG AATTAGTGCT CTGACTTACG GGTTATCTTT TGCTGCCCTG TCAAGTGTTT
TGGTGCATGT CTGGCTCTGG CATAGAGACG AGATCAAGCA AGGCAAGTGA AATCGTGATG
TTGCAGCGAA CAGTCAAACT AACAATTTGA AAAGCTCTTT CAAGCAGATT ACAGCTCAAT
GATGTTCACA AGTAGGGTTT TGAAAACTAT GCCTCCTCTA CTATTCTGAC ACCACCAACT
ATAGTAAACT TATGCGATCA TACCTACCAG TGCCTTCGGT ATGGTATGTG GGTCTTTTGG
CTGTTAATTT TGGTGCTGCA GGCAAGTGTC TTGCCATCTT TCCGTATCAC GATAATATTG
ATTGAGAACA AAGTCATTCT GGTCAAAACC ACTTCATTGC AGATGCCAAT CTGGGCTCTG
GTGTTGGCAA TGGCCATCGC CACTGTAAGT GCTCGGGAGC TAATGTCAAA ACAATGAGGC
GCTCATCGAG ATACCACAGA TATTTCTGGT TCCGTAGGCA CTTCTGAATG ATTGGTCCAG
AATACAGCTT CTAATATGTT TGAGACAGTG TCGGTATCAT AGCGGCTGTC AGCAATACAC
AAATTGGACT TGTGCGTCAC CTTTTCAGAC ATAGAGCTCC TGCTGATGTA CATGTGTTAC
AGAATGTCTT GACTGAATTG TAAATTTCCG CAATCATCAA CCGTCCAACC ACTGTTCTAA
CAATTTCAGC GTGGCCGGCA TCCTCATGCC CGGTAAGCCA ATAGGCAAGT GAGTGATGAT
GTTTATTTTC CTTGAATGCT CTTTTCACTG CTGATCCATA CAGTGTCACA TTCAAATGTT
ATGGATGTAA GCTATCACTT TTAGCATATT GGTGGTTGAA ACATTGACAG TCATATAGAT
ATGGCCATGT CACAAGCCCT TGCTCTCACT GCTGACCTGA AACTGGGCTG GTATACGTCC
ATTCCCCCTC GAGAAATGTT CACGTGCCAA ATCATTGGAA CTGTTCTGGG TGCTCTCACG
AACTGTGCGT TTTAGCCACC TTTCAGCCAT CCGGTGCAAA GTAATGCTGA TGTTCTGCAG
ATGCCACCCT TGTATCCGTC ATGGCAGCAA AAAGACCATA TCTCAACGGC TCCCTGACTG
ATCCTACCGG GCAATGGACT GGACGCGCCC CCAGTATATT TTACTCTGCG AGTATTATCT
GGGGAGCCGT TGCTCCTGCT CGTTTCTTCA GCGGTGGCTA CGAAGTCCTT TATCTCGGTT
TTTTAGTGGG GGCACTTGTG CCCGTTGGTT TTTGGTTGGC ACATAAGAAA TGGCCGGGGT
ACAAGTTGAA TAAGGTGGTA TTTCCTATCA TTTGTAGTGG TGCGACTGTG GTCCCTCAAT
AGTGAGTGGG ACAGATTGTT GAGCAACGGT GAACTCATTC ACGCTCAGTC CGAGCAACAT
CATCCTCACT TCTCTCTTGA CCGCGGTTCT GGTCAACTCG TGGTTTGCGA AGCGCCATCC
CAAGTTGCAC AGGCAGTACG TATATGTAGC TTCATCAGCT CTGGACGCTG GGACATCAAT
AACGGCGCTG GCGATTTATG TGCTCTTCGG AGGTGTGTTT TGGAGTTGGA ATGGGTGGGA
GTGGTGGGGT AATTCAGGGG TGGACTCTGA GCACTGCGTT CCTGGAAGCT GA
 
Protein sequence
MPINPLEAEA YELPPLSQAA ISPAEQYFED LVAEEESAKL LDTEGQLKQD VEDELVDDSA 
LYMRRTQDEE VIGRGSKVEE LIARTVPSHD DPTLPTLTLR VIFLGSSFCI LGACASQIFY
FKSNAPSFSS YFVVLATYPL GHLLANERLV PRGNKIFGWE INPGRFSIKE AILISVLSSS
GASSAYAADI LAIMDLYFDT PLSPLPSILL LLTTQCIGFG LAGMLQNLLV SPPAMYWPST
LVTVQLFTTL YSTTSSTLSL TQQQLTFTRL RVFLVIFLAI FLYQFLPFLF FPTLTSVSLL
CLVNNRSWWM RTLGSAYQGL GIMDFSFDWS SIGSSGPLYT PYWALGNYFG GLVGMLWIIL
PLLLMLNFWD ARSFPSPVSA GLFNATFQKF DVASILKPDL SLDEAAWEEK KPLLLTPYFA
LTYGLSFAAL SSVLVHVWLW HRDEIKQGNK LMRSYLPVPS VWYVGLLAVN FGAAVILVKT
TSLQMPIWAL VLAMAIATVS ARDVGIIAAV SNTQIGLPIG NVTFKCYGYM AMSQALALTA
DLKLGWYTSI PPREMFTCQI IGTVLGALTN YATLVSVMAA KRPYLNGSLT DPTGQWTGRA
PSIFYSASII WGAVAPARFF SGGYEVLYLG FLVGALVPVG FWLAHKKWPG YKLNKVVFPI
ICSGATVVPQ YPSNIILTSL LTAVLVNSWF AKRHPKLHRQ YVYVASSALD AGTSITALAI
YVLFGGVFWS WNGWEWWGNS GVDSEHCVPG S