Gene CNK02660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02660 
Symbol 
ID3254453 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp770147 
End bp772380 
Gene Length2234 bp 
Protein Length532 aa 
Translation table 
GC content47% 
IMG OID638253758 
Producthypothetical protein 
Protein accessionXP_567865 
Protein GI58260910 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.11556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGCCACTCA ATTCTTAAGC ACCACTACTC CACTCGCATA TAACAAGCCC AATCACTCAC 
TCTCTCCCCT TTACACATAT TATTCTAGCC AATATGGCCG CAGTTGACGA CAAAGTCAGC
ATCGAGCATG TCGAGTCGGG AGATCACTAC ACCAAACACC ATGTCACCCA GGAGGAGGTC
AAGCATGGTG ACAATGCCCT GAAATACATT GGTGATGAAC GAGTAGAAGT CACACAAGAG
GATGTGAGTA ACGAATTTTC CCTCTAGTGG TTCTGAGTTT CTTGTTTTGA AACAAAGAAA
AAAAAGTTTA CGGGATTTTC CGAACCTTAA AATGCGCAGG TGCTGACTAC TTTTTGTTGC
AGGATGCGCG AATTAGGAGG AAAACCGACA AGTACATTTT GTCTTTACTA GCATGGGTGT
ACTTCTTGCA AATCCTAGAC AAGACTGTAA GCGCTTTTTG CTCGAGACAT GTATTATTAT
TATTTCTAAT TTTGCCACCT ATAGGTATTG GGTTATGCCA ACACTTTCGG TCTCTCCGAG
GACACAAATC TCGTTAACAA CCAATACTCT CTCCTTGGTT CCATCAATGC CATTGTCCAG
TTGGCTTGGC AGCCATTCTC GTCCTACTTA ATCGTCAAAG TTCCTGCCAG ATATCTCATG
CCTGCCATGG TTTTCGGCTG GGGTGCCGCT CAAGCATGCA TGGCCGCGGC TCACAAGTAA
GTAGACTTTC CTTTTACCTA ATGATCCGTG ACGCGCAGTT ATCTGTGTAT CCTGATAATA
AAGCATGTGC TGATAACGGG ACTTAGTTTT GGCGGTTTAA TGGCGTCGCG AGCCATTTTG
GGTCTGTTTG AAGCTGGTTG TCTTCCGCTC TTTTCTCTCC TTACCTCTCA GTGGTACCGT
CGATCTGAGC AACCTGTCCG AGTGGCCGTC TGGTACTCGA CTAATGGTCT TGCCACTATC
GTGGCCGCTC TTCTTTCCTT CGGTCTCAGC CACGTCGACT CCCCTCACAT CAAGTCCTGG
CAACTCATTT TCATTGTCTG TGGGATTATC ACCTGTGTTA CAGCTCCAAT CGTCTACATG
TTCGTCGATG CCGATGTCGC TTCGGCTCGT TTCCTTACGG AAGAAGACAA GGCCAAAGGC
ATTGAGCGAC TCCGTGCGAA CCAAACAGGT ACCGGTTCCA ACGAGTTCAA AGTATCTCAC
GTCTGGGAAC TCTTCTGCGA TCCCAAGTCC TATCTTTTCT TGGCGATTTC TCTTCTCCTC
AATGTTGGCG CCTCAGTGAC TAACATCTTT GGGCCGACGC TCATCAAGGG CTTTGGATTC
AACAGCAGGA TCACCTCTCT GCTCAATATG CCATTCGGAT TCCTTCAGTT CGTCGCGATT
CTGGCTGGGT GTTACTGCGC GTACAAGTTC AAGCTCAAGT CAGCTGTCCT CGCCGTTTTT
ATCATCCCTG TTATCATTGG TCTCGCTCTC TTGTATGTCG AGAATGCCGC AGCTGTTTTG
AAGCAAGCTC CTGCTCTTGT CGGATATTAT CTCCTTGCCT TCCTTTTCGG CGCCAATCCA
ATCATCGTAT CTTGGATCGT TGCCAACACT GGTGGTCAAA CAAAAAAGGC TCTCCTTATG
AGTGTATACA ACGCCGGATC TTCCGCTGGT AATATCATTG GTCCTTTGTG AGTCGGCGCA
TTCAGCGTGT TTCATGGATT ACTCGAAAGC TGACTGTTTG GCACAGGCTC TTCCAAGACA
AGGACAAGCC TCACTATCTT CCTGGTATCA AAGCCACTCT CGGTATCTTC TGTGCCTTGA
TGGCGTGTGT CGGTATCACT GCGGCTCTTC TTTTCGCTCT TAACAAACAG AGGCAGAGAC
AACGTGTTGC TGTCGGCAAA CCTCAATACA TCAAGGATAC TTCTATGAGC AACAAGTACG
AAGCCTATGG TGCTGATGAC GTGGACGGAA GGCTCGGTCA GAATGGTATG TCTCCGTTTT
TTTATGTTTT AACCTGCTGT GCATCGCTAA CATTACTTAT ACAGCCTTGC TTGACTTGAC
CGACTTTAAA AACGACGAGT TCGTGTATGT GTATTAGGAG TCAAAGCTTG TTGCGCCTGG
GTGGAATCCA CCGGCTATTG GCATAATAAG GTGGTATATG ATCATGTAGG GGTCGTTTAG
AGCATTTTTC GATCATCAAA CATATCTAGT GTAATCAGTA TATTGATACG TGTACAACTA
CAATGCAAGA ATCA
 
Protein sequence
MAAVDDKVSI EHVESGDHYT KHHVTQEEVK HGDNALKYIG DERVEVTQED DARIRRKTDK 
YILSLLAWVY FLQILDKTVL GYANTFGLSE DTNLVNNQYS LLGSINAIVQ LAWQPFSSYL
IVKVPARYLM PAMVFGWGAA QACMAAAHNF GGLMASRAIL GLFEAGCLPL FSLLTSQWYR
RSEQPVRVAV WYSTNGLATI VAALLSFGLS HVDSPHIKSW QLIFIVCGII TCVTAPIVYM
FVDADVASAR FLTEEDKAKG IERLRANQTG TGSNEFKVSH VWELFCDPKS YLFLAISLLL
NVGASVTNIF GPTLIKGFGF NSRITSLLNM PFGFLQFVAI LAGCYCAYKF KLKSAVLAVF
IIPVIIGLAL LYVENAAAVL KQAPALVGYY LLAFLFGANP IIVSWIVANT GGQTKKALLM
SVYNAGSSAG NIIGPLLFQD KDKPHYLPGI KATLGIFCAL MACVGITAAL LFALNKQRQR
QRVAVGKPQY IKDTSMSNKY EAYGADDVDG RLGQNALLDL TDFKNDEFVY VY