Gene CNK02980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02980 
Symbol 
ID3254717 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp873302 
End bp875448 
Gene Length2147 bp 
Protein Length598 aa 
Translation table 
GC content47% 
IMG OID638253789 
Producthypothetical protein 
Protein accessionXP_567893 
Protein GI58260966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.157268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGC AGCTCTTCTT CCTCTCATCA ACAGTTCTCT ATCTCACCAT CCCGGGCACT 
CCATTCCGTC TTACTTGGTT CAAGAAGACA AAGCAATTTG ATACACCAAC AATGAGCGAA
CTCCCTCGAA CAAAGACGAA CGATACCGTC TGCACTCTCA CATCCCCTAT TTCTGGTGCT
CCATCACCCG AGAAAACAGC AGTTGATGTA GACGCCATTT GCAAAGAAAA GAAACTTGAC
AGCAAATCTA TCGATGAACT ATCACCTCCT TCGACTAACA CTGAAGTACA CCTCGAAGCT
CAAACTCAGG AGGAAAGTAG TTTGACCAAG TTAAGTCCAA GCCGCAAGTG GTTCTTACTA
CTGGTTTTCA GTGTTGCTCA GGTAGGCACA GTGGTTCGCT GCATTGTATC TATATTTGCT
GACAATGCTA CACCTGCAAA GTATCTCGAT GTCTGTTCCG TATCTGCCCT TTTTGTCCTT
ACCGATGCGA TTCAAAAAGA CCTTGGCATC CAGTACGAGG CTTCCTCTTG GATCATTGTA
AGTTGACCAC ACAGTACTAT CAGTCGGTCT CAAACTGACC ACTTCATCAC AGACTAGCTA
CTCCGTCACA TTTGCCTCAT TTTTATTATT CTGGGGACGT GTAGCGGACC TATACTCGGC
AAAAGCAGTC TTCGCATACG GCTTCCTTGG ACTCGGTGCT ACCAACTTAG TCATCTCTTT
TATGCCTAAT CAGTTTGCCT ATTTCATCTT CCGTGCTTTG AGCGGAATTG CTGGGGCTGC
TACTGTAAGT TTCAGTTCCC ATTTAATAAG TAGTGATGGT TGATCAGATA TTGCAGATTC
CCTCGGCTTT CAGGTTGATC CTCGCTCTTT TTGAGCCGAA AGAGCTCAAC ATTGCTCTTA
CCATTTTCGG TCTCAGCGGT GCCATCGCCA ACGTCACAGG CCTTGTTATT GCTGGCTTCT
TTGGATTCAT CACTGCTAAT GACCAACAAG CCGGCTGGAG ATGGTTCTTC CGAGTAGGTC
TCAAATCCAT CGACTGGAGA TTTGCTCAAC TGACAATCGT TTCCTTTTTT TCGCAGATGA
TGGCTATCGT AATTGTACCT TTCGGCGTTT CCGCCCTTAC TCTCATCCCT CAATCCGCTG
GGAAACTGGT TGAACAACTT TCACCCCGAG ATAAGCTCAA AAGGCTCGAC ATTGTCGGCT
GCTTCATGAT GCTTGCCTCC ATCATCTTGC TTATACTGGG TATCACTCTC GGTGCTTCTT
ATGGCTGGAA AAAACCAGGC TTCTTGGTTC CATTTTTGCT TAGCTGGCCC ATCTTTATTG
CTTTCTTTAT CTACGAAGCG AGATTACCTG AGAGCTACGC TTTGATCCCT CCATCATTCT
GGAAAATTCC AAACATGACC TTGTTGATTG TCTTTGCGCT CGGTATTTAC CCTTGGTGGT
GTGTAAGTCA ATTCTTTGTT GCTACCGATA CGGCTGAGAG CTGATGTCTG GAATGCAGGT
GAGCCAACTG CCCCTGATCG AACGATTCAT CGACTACTTC AACGAACCTG CCATCATCGC
CGCCCTCCGC GTCTTCCCTC AAGGTGCTTC AGGTCTCTTC GTGGCATTCT TTGTCCCTCG
ACTTCTTCAA AAGGTCGGCA GTGCCCGAAT ACCTCTTGCC GGCGGTATGT TCGTCGGTGC
TGCCATGTAT CTACTCATTA TTTTCAATGA TGGCAAACTG GGGAGCGATT ATTGGAGATG
GCTTTTCCCC GCTTTCATCA TTGGAAGTGG TGCTGCTATG ATAAGCTTCT TATCTACAAA
GTATGTTTAT CTCCTTTGAT GGCTTCTTCA TAATCCTTTT CGCAGATGAC TGACTCTAGC
ACAGTATCAC AGTTATGACC TCCGTTCCCC CCGAAATTTC TGGTGTTGCG GGTGCGATGC
TCCAAGTCTC TCTCCAAGTC GGCGCTGCCA TTAGCCTCAC TGTGCAGGCT GGGCTTCTGA
CCCTCAATCC CGGAGGAATG ACCAACTATG CCAATGTGCA AGCATCCTTA TGGTTCCAAT
TCGGGTGGCT TGTGCTGAAT GCTTTGATTA TTATTGTTTT CTTTAGACAG AACAAAATGC
CAAAGATATC AGAGGAAGAG GGAGCTGGGG CGGCGTTTGG AGCATAA
 
Protein sequence
MKEQLFFLSS TVLYLTIPGT PFRLTWFKKT KQFDTPTMSE LPRTKTNDTV CTLTSPISGA 
PSPEKTAVDV DAICKEKKLD SKSIDELSPP STNTEVHLEA QTQEESSLTK LSPSRKWFLL
LVFSVAQYLD VCSVSALFVL TDAIQKDLGI QYEASSWIIT SYSVTFASFL LFWGRVADLY
SAKAVFAYGF LGLGATNLVI SFMPNQFAYF IFRALSGIAG AATIPSAFRL ILALFEPKEL
NIALTIFGLS GAIANVTGLV IAGFFGFITA NDQQAGWRWF FRMMAIVIVP FGVSALTLIP
QSAGKLVEQL SPRDKLKRLD IVGCFMMLAS IILLILGITL GASYGWKKPG FLVPFLLSWP
IFIAFFIYEA RLPESYALIP PSFWKIPNMT LLIVFALGIY PWWCVSQLPL IERFIDYFNE
PAIIAALRVF PQGASGLFVA FFVPRLLQKV GSARIPLAGG MFVGAAMYLL IIFNDGKLGS
DYWRWLFPAF IIGSGAAMIS FLSTNITVMT SVPPEISGVA GAMLQVSLQV GAAISLTVQA
GLLTLNPGGM TNYANVQASL WFQFGWLVLN ALIIIVFFRQ NKMPKISEEE GAGAAFGA