Gene CNK00540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00540 
Symbol 
ID3254488 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp178627 
End bp180642 
Gene Length2016 bp 
Protein Length535 aa 
Translation table 
GC content46% 
IMG OID638253542 
Producthypothetical protein 
Protein accessionXP_567617 
Protein GI58260414 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATTC CAGCAGAGGC ACAAGTAGCT TCTCCACCCA TTTTCACTGA AGACGAGGAA 
GTCGTCTTGG AAGAGGAGGA TGTCGAACAA CCTGATCTGC GTCGAATCGC CACACATCTA
CATGACCCAG CCTCCACAGC TACGTTGAGC GATGATCAAG CAGGCACAAC TGCCGGACAG
ACTGTCTTAT CGCATGATCT GGAGAAGGGG GAGGGTCGTA TGGTAGTGGA TTTCGCAGAA
GGGCATTATG AAGACCCCAA GGAATGGTCG AAAGGAAAGA AATGGTAAGT TTGACACTTC
ATCTCCTATG CAAGTTGCTG ATTGCAAGTA GGTTTGTCAC CATTGCAACC TCTATACTTT
GTCTCACAGT CGCTCTTGGT TCCGCTATGC CCACTGGTGA TTTACCTGGA GCTGCTGAAA
CTCTCCACGT TTCCAATGAA GCTATCTACC TTACTATTGC CCTTTTCGTC GTTGGTTTCG
GTGTCGGTCC TCTTCTGTTC GCTCCATGTA AGTTCCTTCT TGACAGGTAA CGGATAGTAG
CATTGACCTT TCACCAGTAT CTGAAGTCAT TGGACGGAAG ACAGTCTACT GCATCAGTAT
TTTCTTTTAT TTCATCTTCA CCCTCCCGTC ATGTCTCGCG CCCAATATCG CCACAATGTT
GGCTGGTCGT ATGGTGAGTC ATCGACCTGC AATCCCTCCA GGCCTTGCTG ATGAAGATTT
TTAGATCGCC GGTATCGCCT CTTCGGCTCC CATGACCAAT GTGGGAGGTA CCATTGCTGA
TATCTGGTCG GTTGAGGAAC GTGGTATTCC TATGGCTCTT TTCAGTGGTA TGATTTTGTG
AGTTAAACGA AGCGGTCGAG AGAGACCCCC GCTGATGCCT TTTTTAGCAT GGGACCTTGT
CTTGGACCAT TGTTTGGTGG TTGGATCGCT TACAAGACCG GACAATGGCG ATGGATTTAC
TGGGTTTTGT TCATTTTTGT CGGAGTCGTC TTCCTCTTCA CGCTCGTTAT GCCTGAAACT
CTCGCCCCTG TCCTCCTACG ACGGAAAGCC AAGAAACTAA ACAAGGAGAA CCACGTTGAC
TCCTATGTTT CGAAACATGA TCTCCACCAC GTTCCCCTTT CCACCACTCT GAAAACTGCC
ATGATTCGAC CATTCATTCT CATGTTCATG GAACCCATTA TCTTGTTCAT GAGTTTTTAC
TTATCTTTCG TCTACGCTCT GCTCTATGCC ACTTTCTTCG CCTTCCCAAT TGCTTTCGAA
GAAATTAGAG GGTGGAATAT GGGTACCACT GGCGTTAGTT TCGTATCTAT CATCGTAAGT
TGTCTTTTTC CTGTCATTGT AACATTTGTG CTGATTTCAG CCAGATCGGT ATTGCAGCTG
CCTTGCTCTG TATGCCCTTT CAAGAAAGAA TCTACAAAAA GGCTTGTCGA AATGGTCAAG
TCCCTGAAGC GAGATTGTAC CCCATGTTAC TTGGTTGTGT GTAAGTACTC CTGTATGGGT
GCTGTTCAAT CTCTAATGTA GCCATAGCAT CCTCCCAATT GCTCTTTTCA TCTTAGCTTT
CACATCGTAC CCTGGAATCC ACTGGATTGG ACCTTGTGTC GCTGGTGTGC TTTTCGGATT
TTCAATGGTT ATCATTTATA TCTCTGCCAA CAGTGTGAGT TTCCATTATT GTATGTCTTT
TTTTTCCACA CTTTTACTAA CGTCAAACAG TATATTGTTG ATTCCTATGC TTCTTTCGCT
GCGTCAGCCA TTGCTGCCAA GACACTGATG AGGTCTCTCA TCGGAGCCTC AGTTCCTCTT
TGGATCACTC AGTTATTTGT GAGTTGTTAA TGACATTTCA CAGGCTGGAA TCTAACTAAC
TCTTTCTTCA ATCAGCACAA CCTTGGGTTC CAATATGCTG GTCTCTTTTT AGCACTCATA
TCTTGTGTTA TTATTCCCAT TCCTTGGGTC TTCTTCCTCA AGGGTGCAGC TGTCAGAAAG
CGATCAAAGA GAGCCGAGAA GTCTGGTACC AATTAA
 
Protein sequence
MIIPAEAQVA SPPIFTEDEE VVLEEEDVEQ PDLRRIATHL HDPASTATLS DDQAGTTAGQ 
TVLSHDLEKG EGRMVVDFAE GHYEDPKEWS KGKKWFVTIA TSILCLTVAL GSAMPTGDLP
GAAETLHVSN EAIYLTIALF VVGFGVGPLL FAPLSEVIGR KTVYCISIFF YFIFTLPSCL
APNIATMLAG RMIAGIASSA PMTNVGGTIA DIWSVEERGI PMALFSGMIF MGPCLGPLFG
GWIAYKTGQW RWIYWVLFIF VGVVFLFTLV MPETLAPVLL RRKAKKLNKE NHVDSYVSKH
DLHHVPLSTT LKTAMIRPFI LMFMEPIILF MSFYLSFVYA LLYATFFAFP IAFEEIRGWN
MGTTGVSFVS IIIGIAAALL CMPFQERIYK KACRNGQVPE ARLYPMLLGC VILPIALFIL
AFTSYPGIHW IGPCVAGVLF GFSMVIIYIS ANSYIVDSYA SFAASAIAAK TLMRSLIGAS
VPLWITQLFH NLGFQYAGLF LALISCVIIP IPWVFFLKGA AVRKRSKRAE KSGTN