Gene CNL04910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04910 
Symbol 
ID3254875 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp381027 
End bp383532 
Gene Length2506 bp 
Protein Length526 aa 
Translation table 
GC content47% 
IMG OID638253963 
Productconserved hypothetical protein 
Protein accessionXP_568033 
Protein GI58261246 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.413844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGATC GTCCACCTGA TCAAACTGGT CCATCGAAAA ACTCTGAGAA ACAGGCGGTG 
AAGGGTCTCA ATGCGATTGA CATTGAGCAT CTACCAGTCG ATAATGATCC AAGGAAGTGG
AGTGATAGGA AAAAGTGGAC AGTATTGATG ATCATGACTT TTGCGATGGT GAGTACCTGT
AAGCAAGTCC CATGGCGAAG AACATAGGGT ACTGACTGGG TGGTCGTCCT CTAGTTGGGC
CCAATGATGG CTCCCAGTAT TTACAACCCG GTAATCAGCG ATGTTCGCGA AGACCTGCAC
GCCACAGAAA GTGAACTCGG TTTATCGATC TCTCTTTACA TCCTGTGAGC TTCGATCAAT
CTTTGCGGCC GATGCCCAGT ATCAAACCGA GAACGCTAAT GCACCTGTCA GATTTCAAGG
GTGCACTCCC GTCTTGTGGG CAGCTATTGC GGAAGTAAGT ATCATCTATG GCGATCATTA
TTAAGAATGC TTACTCACAT CTGGCCATAG CTCAATGGAC GCAAGGTATG TCTTGTATAG
AAACCGGTGG AGCACTCGCT AACGTATCGT TGCAGACTGT CTATCTGGTG TCATACACAG
TAAGCCTTAC ATCAAAACGT ATCACCAGTT GCCAGACTAA ATGATGATTG TAGCTCTATG
TTATCGCTCT CGTGGTTGCT TCTCGGGCCA ATTCAATGCC CCTTCTGATT GTAATGCGTA
TCTTACAGTC CACAGGCAGC GGACCAACAG TTTCTTTGGG AGCCGGCAGT CTGGCCGATA
TGTACGAGAC GCATGAGCGA GGGGCAAAGG TGAGTTTCAT CCTTTAAATT AGCCATTTGC
TCATTTACTG TATGTCTGCA GTTGGGCCTG TTCTACGGAG TGCCGATGAT AGCTCCTGCC
GTAGCACCAC TCATCGGTGG AGCTCTAGGC CAAGTACGGT TTCATCTTTT GGCTATGTTG
ATGTTTGCTA AAGTATAACA GTCGTTTGGC TGGCGAAGTG TTTTCTACTT CCTTGCTGTG
TATGCTTTCA TCATGTTGTG CTGCTTCATG GTGTTTCCGG ACTCTTGGCG ACGTCAGGTG
ATTTCTTTCT GCTCATTTGG AGTAAAAATA ATGTCAACTG ATTTCCACAA TGTTAGCGTT
CGAGAGTTTA CCAAAGAGCG CTTACCAAAG CAATCGAAAG AGCAGAAAAT CGGGATGCGA
AAGATTTAAA GCGGAAGGCC AAGCTCGCAA AACAGTCGCA AGTGTTAGAC ACCATACCAG
CCACGCCAGA TGCCACTCCC GGCAACTCTC CTGGTAATAG CAGAAGGCCT AGTGCCGAGG
CTGGAGGGGC CGCGGTGACG GATACTACAC TGGTTGATGT GACCTTGAGT AATGACCAGA
AGCAAACGGT AGGCAGATCG GGCAGGATAA AGGTCAAAAT GATGAAATGG TTGCCGTTTG
GGGCAAAGGA AGAAAAGATA CAGGCCGAGA ACGAAGTGGA ATTCAAACCC ACCTTCAGGG
ATCTCAACCC TTTGCCATCG ATGGCTCTTA TTCTCAAACA GCCCACCAAT TTACTCATTC
TTACCTCATC GGGTGAGCGC CCTTCTCTTT TCATAAGTAC TCAGTCTATT AACATGCCGG
CAAATGTCAG CCTTGAGTTT CTCTGCTCAA TATACCATCG TCTACACAGC ATCAATTACC
CTTGGTAAAG CTCCTTACAG TTATGGATCG CTCAAGATAG GTCTTGTGGT CTTGGCGTTT
GGAATAGGCA ATATTCTAGC TAGTATGATT GGAGGGAAAT ATTCTGACAT GGTGTTGAAA
AAACTGAAAA AGAAGAACGG AGGTGTGGGC AATCCCGAGG TACGCGAATA TGTTTATATT
GCGGAAATGA CAAGCTGAAG CACTTGGACA GATGAGGCTG AGATCCACAG TGTTGGCGAT
GCCAATCCTC GTCGCTAGTT TCCTTGCCTA TGCTTGGACT GCGGAGGAGA AAGTGCACAT
CGCAGCCTTG GTGGTTTGCC TGTTCTTTGC TGGTTTCTCT CTCTTGCAAG TACAGTTTCT
AGTGTAGCGC ACGACTCTTC TGACATGCGG CTTTGTTATT CAGGTGGATC TATAGCAGTA
CTCTGGCGTA CGTGGTGGAC GCCAATCCTG TACGTGTTAC ACTTTTTTCA TCCCTGCGTC
AAGTCAGTTC TGATGCTGAT GATTGTCTAG GGTGCTTCCA GCTCTGCAGT CTCTTGTAAT
TCAATGTTTA GGGGTATCTG CGCTTGCGTT ATGTCACAAG TCGCTACTCC GATACAGAAT
GGTATAGGAG ACGGGGGGTT ATATACCTTG TTTGCTGGGA TCTTGGCATT CGCCTGTGCT
TGCAACTTAT TGCTTATGGG TGCGTTTCCG CCCTTGCTTT TTGATACATA AACAGAAAGC
TGACACTTGT ATACCTAATA CAGTGAAAGG TGAGCAATGG AGGTCTCCCG AACATCGTTG
GCCCTGGCAA AAGAAGCGAG AAGAAGGAAA CGATGAGAAG GAATGA
 
Protein sequence
MIDRPPDQTG PSKNSEKQAV KGLNAIDIEH LPVDNDPRKW SDRKKWTVLM IMTFAMLGPM 
MAPSIYNPVI SDVREDLHAT ESELGLSISL YILFQGCTPV LWAAIAETVY LVSYTLYVIA
LVVASRANSM PLLIVMRILQ STGSGPTVSL GAGSLADMYE THERGAKLGL FYGVPMIAPA
VAPLIGGALG QSFGWRSVFY FLAVYAFIML CCFMRSRVYQ RALTKAIERA ENRDAKDLKR
KAKLAKQSQV LDTIPATPDA TPGNSPGNSR RPSAEAGGAA VTDTTLVDVT LSNDQKQTVG
RSGRIKVKMM KWLPFGAKEE KIQAENEVEF KPTFRDLNPL PSMALILKQP TNLLILTSSA
SITLGKAPYS YGSLKIGLVV LAFGIGNILA SMIGGKYSDM VLKKLKKKNG GVGNPEMRLR
STVLAMPILV ASFLAYAWTA EEKGASSSAV SCNSMFRGIC ACVMSQVATP IQNGIGDGGL
YTLFAGILAF ACACNLLLMV KGEQWRSPEH RWPWQKKREE GNDEKE