Gene CNK02920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02920 
Symbol 
ID3254567 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp857511 
End bp859161 
Gene Length1651 bp 
Protein Length362 aa 
Translation table 
GC content47% 
IMG OID638253783 
Producthypothetical protein 
Protein accessionXP_567887 
Protein GI58260954 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTCA ACCCATTTTG GGCTATGGTC CTCAGCAGAT TCTTGCAGGG AGCTTCAAGT 
ACTGTTGTGT GGTCTGGTGA GTCCATGATC CACCGAAATG AACATTGTTA ACTGATGTAT
CTAATGTATT CAGTCGGATT CGCTTTAATG TAAGCGTACA TGGCCAAGGA AATTTTAAGG
CTCTGGCTGA CCTGAATGCA GATGTGAAAA TGTGGATGAG GAGCATATTG GCCGTCAAGT
CGGGTTTGCC ATGGCAGGAG TGTCCATCGG CACAACCGTT GTACGTGTTC TCAATTAATA
TGCATGCTTG AAAAGCCATA TACACGAGCT AACACGTCAA CCCACCGTAG GCTCCTCCCA
TTGGCGGTGT ACTTTACTCC AAACTAGGCT GGCATGCTCC CTTCATCTTT TGCATCATCA
TCTGTTTCAT CGATCTGATT ATGCGCCTCT TCGTCCTTGA GCGTACCGAC CTCCGCAAAT
GGGAAGAAAG GCGCCTCAAT CTTGCCCCTG GAAGTCTTCA ACCCAAAGTA GTAAATGGTG
AAGTCATCAT GCCGGCCCAG GCGGAAACTT CACCTTTTAT CCATTTGACG ACAGCAGAGA
AGGCAAGGCT GTCGGGAGTG GAGTTATCTC CTTGGCAGGT GCTTGTGGCA TTGGCTAGTT
CGCCAAGGGG CATGACTTCG TTCATACAGA TGTTTGCGTA CGGGACGATC ATCGGTGCTT
TAGAGCCTAC GTAAGTGGTT CATGGTTATC AAACGATGAA GATTGGATAG CTAATATATG
AATCAGGTTG ACACTTCATG TACAAAGCCT CTGGGGGAAA GACTCCGACT TTGTTGGCCT
CATTTACTGT ACGTCTCCTC CCCTTGTTGA TCCCCTGTAT CCGTGCTCAT TCGTAACCAT
CAGTGGCCGC TGCTGCTCCA ACATTCTTCT GCGGCCCAAT TGTCGGTGCT CTCGCCGACA
AATATGGCGC TGAATGGCTC ATGCTGCCGG CTATGGTACT CACACTTCCA TGGCTACCTC
TTTTGCTCCT GAAAAAGAGT TTGAGTGCAT TTATTGTCTT CTTCGCCTTC TCCGGTATGT
ATTCGCCTTT GCTAATTGTA ATATGGTAAC CCATATGTTG ACCTCAAAAT TACTGACTTT
CATTGGGTCA AATGGTAGAT ATCTTCCCCA ATTGTGCGAT GGCGCCGACA GGCCTGGAGG
TGACGATGGT TGCGCGAAAC ATTGACGGTG TCAGTGAAAT TCGTAGGTTG TTCGGCCATG
TATCGCATCT GGTCAGGTAA TTTAATTGAT GTGTTCCATA GATCAATTCG CTGCTATGAA
CATCGCTTTC GGTAAGTTTC GATCCCCTAT TCAACCAAAA ATGGCTACTA AACAATCTTA
ACTTGCATCT CTTTCCCATC CACCTTTATT AGCTATATCT AGCGCTATTG GAACCATAGT
CGGCGGCCAG ATGTACGATC ACGTACCCAA CGGATGGGCA GCTACGATCT GGTTCTGCTT
CGGTATGGCG GTGGTCGTCA TCCCTTTCCT GTTCTTTTTC GCTGGAAACA GGTCTCTGTA
CCAGCGGTTA TTGCATATCC GTAAGAAAAA GGGGGAGGAT GTAGAGATGG AGGAGGCCAA
GGGAATTTCA AAAAGAGACT ATACCGGTTG A
 
Protein sequence
MLVNPFWAMV LSRFLQGASS TVVWSVGFAL ICENVDEEHI GRQVGFAMAG VSIGTTVAPP 
IGGVLYSKLG WHAPFIFCII ICFIDLIMRL FVLERTDLRK WEERRLNLAP GSLQPKVVNG
EVIMPAQAET SPFIHLTTAE KARLSGVELS PWQVLVALAS SPRGMTSFIQ MFAYGTIIGA
LEPTLWGKDS DFVGLIYLAA AAPTFFCGPI VGALADKYGA EWLMLPAMVL TLPWLPLLLL
KKSLSAFIVF FAFSDIFPNC AMAPTGLEVT MVARNIDGVS EIHQFAAMNI AFVGGQMYDH
VPNGWAATIW FCFGMAVVVI PFLFFFAGNR SLYQRLLHIR KKKGEDVEME EAKGISKRDY
TG