Gene CNH00110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00110 
Symbol 
ID3259310 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1151780 
End bp1153983 
Gene Length2204 bp 
Protein Length537 aa 
Translation table 
GC content49% 
IMG OID638258474 
Producthypothetical protein 
Protein accessionXP_572205 
Protein GI58270098 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCGAGTTAT CTACCGCTAA GTCTATTAAG CCCTCAAAAC TATCCACCCT TTAGATCACC 
AACAAACGCC ACCATGTCCG CCTTGATGGA CGAAAAATAC AGTATCGAGC ATACCGAAAA
TGGGGAGGTC TTAAACAAAG ATGATGGCCA CCACCATCTG ACCCAGCAGG AGATGAAGCA
TGGTGACAAT GCCCTCAAGT ACATTGGAGA AGAGCGAATT GAGGTTACGG TTGAAGATGT
GAGTTACTGT TTGTCTCCAG CAATTGTTTT CAGGCGACTG ACACCTCTTG TAGGATAAAC
GAATCAGGAA ATTGACAGAC AAGTACATCC TGTCACTACT CACATGGGTC TACTTTTTGC
AGATCCTTGA CAAGACTGTA TGTCAGCGTT ACATACAATC AATGAATTAC TCTAACCAAA
ACGTAGATCC TCGGTTATGC CAACACATTC GGAATGTCCA CTGATACCCA CCTTGTCAAC
AACCAATACT CTCTCCTGGG CTCCGTCAAC GCCATTGTAC AACTAGCTTG GCAACCGTTT
TCCTCTTATC TTATCGTTAA GGTCCCCGCT CGTTACCTCA TGCCTGCAAT GGTCTTTGGC
TGGGGCGCTG CTCAAGCTTG CATGGCCGCC GCTCACAAGT AAGTCTTTCC CATCCCTTTC
CACTGTGGTG ACATACCACT CCTTTGGCCA CGTGCTGATA AAGCCTTTTG TAGCTTTGGA
GGTTTAATGG TTTCTCGAGC CCTCCTTGGT CTCTTCGAAG CTGGTTGCCT TCCTCTCTTC
TCTCTTCTCA CTTCCCAATG GTACCGTCGA TCGGAACAGC CCGTTCGTGT CGCCGTATGG
TACTCCACCA ACGGTCTCGC CACCGTTGTC GCCGCTCTCC TCTCTTTTGG TCTTAGTCAC
GTCTCCTCCC CCCACATCAA GGTCTGGCAG CTCATCTTCA TCATCTGCGG TGGTATTACT
TGCCTGACCG CTCCTGTCAT CTACTTCTTC ATTGACGCCG ACGTTCCCTC TGCCCGATTC
CTTTCTGAAG AAGACAAGGC GAAAGGTATT GAGCGACTGA GGGCTAACCA GACCGGTACC
GGTACCAACG AATTTAAACT CTCCCACGTA TGGGAACTCT TTTACGACGT CAAGTCTTAC
CTCTTCTTGG CCCTCGCATT GCTCTTGAAT GTGGGCGCAT CAGTTACTAC AATCTTTGGT
CCGACTCTCA TCAAGGGCTT TGGATTCAAC AGCCGAATCA CCTCACTGCT CAACATGCCA
TTCGGTTTCC TGCAGTTCCT CGCTATTCTT GCAGGCTGTT TCGCCGCTTA CAAATTCAAG
ATCAAGTCTG CCGTTCTCGC CTCATTTGTC ATCCCCGTCA TCGTCGGTCT TGTCTTGCTT
TACGTCGAGA ATTCTGCTGC TGTGCTCAAG CAAGCTCCTG CTCTTGTCGG TTACTACCTC
CTCGCCTTCC TCTATGGCGC CAACCCCATC ATCGTGTCAT GGATCGTCGC CAACACTGGT
GGACAGACCA AGAAAGCGTT GCTCATGAGT GTCTACAACG CCGGTTCTGC TGCTGGTAAC
ATCATCGGCC CTTTGTAAGT CCCTCTTCCG CCTACTCTTA TCCCAACTTC GCTGACCCGT
CCTTCTTCAC GTAGGCTCTT CCAAGACAAA GACAAGCCCC ACTACCTTCC CGGTATCAAA
GCCACCCTCG GTATCTTCTG CGCTCTCATC GCCTGTATTG GCTTCACAGC CGCCTTCCTC
TTCTTCCTTA ACAAGCAGAG ACAGCGACAG CGTGTGGCCG TGGGCAAGCC TCAGTTTATC
AAGGATACTT CGATGAGCAC CAAGTATGAG GCTTATGGTG GTGATGATGT CGAAGGCAGA
CTCGGTCAGA ATGGTGAGTT TTCCCTCTTT CTCCCTCCCT TGAACAGATT GTTGACGAGT
CATATCTTAG CTTTACTCGA TTTGACCGAC TTCAAGAACG ACGAGTTTGT TTATGTCTAC
TAGTGCATCA AAACGCCCGG ACCCAGTTTT CGTCTTTGTC TCCTGCTTGT TGTTTTTTTT
TTTTTCTCAC TTCGCATTTC GACCCTTTTC GGGCCTCGAT TCATAGGATT GCTAGTCCAA
AAGGACAGTT GTATTACTTC ATATTAGGGA CTTAGTTGAC AAATAAAGTT GGAAATTATA
TCTCACATAC CATCTATGTA AATAGTCATC AATCAATGCA ACTA
 
Protein sequence
MSALMDEKYS IEHTENGEVL NKDDGHHHLT QQEMKHGDNA LKYIGEERIE VTVEDDKRIR 
KLTDKYILSL LTWVYFLQIL DKTILGYANT FGMSTDTHLV NNQYSLLGSV NAIVQLAWQP
FSSYLIVKVP ARYLMPAMVF GWGAAQACMA AAHNFGGLMV SRALLGLFEA GCLPLFSLLT
SQWYRRSEQP VRVAVWYSTN GLATVVAALL SFGLSHVSSP HIKVWQLIFI ICGGITCLTA
PVIYFFIDAD VPSARFLSEE DKAKGIERLR ANQTGTGTNE FKLSHVWELF YDVKSYLFLA
LALLLNVGAS VTTIFGPTLI KGFGFNSRIT SLLNMPFGFL QFLAILAGCF AAYKFKIKSA
VLASFVIPVI VGLVLLYVEN SAAVLKQAPA LVGYYLLAFL YGANPIIVSW IVANTGGQTK
KALLMSVYNA GSAAGNIIGP LLFQDKDKPH YLPGIKATLG IFCALIACIG FTAAFLFFLN
KQRQRQRVAV GKPQFIKDTS MSTKYEAYGG DDVEGRLGQN ALLDLTDFKN DEFVYVY