Gene CNF04090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04090 
Symbol 
ID3258310 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1184097 
End bp1186861 
Gene Length2765 bp 
Protein Length601 aa 
Translation table 
GC content47% 
IMG OID638257527 
ProductMSF transporter, putative 
Protein accessionXP_571381 
Protein GI58268450 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0723621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGACAGTATA CCTCCAGTCC AGGTACATGA GCATATACAT CCGGTTCTAT TCCCCCAGCC 
CTCCGAAGAA AGAACTATAG ATCGACGAAA TCAATTCGGA ACTCGGGGAT CGCACGCCAT
TCAATACCGG GCCATTCTTC CGTCGTAGTA TTGTACCGGC TCAAGAACTA TTCACCAGCT
CCAGCTGAAG AACTCGTCTC TTCGTCTACA ACGCTTTCAC CACCTCCTTC CACTGCCAAT
AATATGTCTC AGAAGACATC ATCTAGCATT ACTATCCACA GCTCAGAAGG CTCAACTGCC
GCCAATACAC CGATGGAACT AGCCGATCCT CTTAAGACGG GGTCGAAAAT GACTCCGGAT
TTCGAAAAAG AGCAATCATC ACCATCGTCC CAACCATCGA GCCCCTCCAC ACCACAATCA
ACCGTCTCAA ATCGTCTCGT ACCTTTATAC AGATCTGCAA CAGATAGACG TTCGCGTGCC
CACTCCAATT CTTTGTCAAG ACGCACAAGT GGGCAGACGG CAGAACTTCA AAATGAGCTT
AGAAGACATG TGTCATTACA TGGCGTTGCC ACGAATTGTG GGAGTGAGAG TGTCATGGAG
GCTAGAAAGA GGCTAGATAT GGGAGGAGAG AAAGGGCATG AAGAAGTAGT GGTTATTGAT
TGGTTACCCA ACGATCCTGA TGTAAGTATC GATCAAGCTG CATGGCATTT ATACTCATTG
ACTAATACTA TTTGTCCTGG TCAGAATCCA GTCAACTACT CCTCTCCAAG AAAATACATC
ATTCTCACTG CTGCGAATAT TGCTGGTTTC ATCGCTGCTT CCAACCTTGC CTCCTGCGCT
GTCCTCGGCA CTTGGGGTGT GCCATACTAC GAAATATCAA GAGAGGTCTG GGTCCTGTCG
ATCACATTAC CGATGATTGC ACTTGCTGTT GCTCCTTTAA TATTGGCACC ATTAAGTGAA
AGTGTAAGTT CAAATTTCTC TCCCAGAGCC TGGCGTAAAG ATTTACAGGA CGTGTAGCTT
GGTCGAAATA TGGTGTACCA AGTCACATCC GTCATGTGAG TTCAGTATCT TTCTGAATCG
GTTCAGCTTT TACTTACTTG TGCTGCAGCA CGGCGGTGCT CTTTGTGCCT CAAATATGGA
ACAACAAGAA CGTTGGCGGT TTCCTTGTAT CACGGTTTTT CGTGCGTTCT ACTTCATCTT
TCCGCGCTAG CAATAAATGT ACGTGTATGC TGATATAAAC CACTGTAGCA AGGTATTGGG
ATGTCTGTGT CCAATTCTAT GGTAGGCGGT ACTGTCGCAG ACTTGTTTTC CCCAACCGAC
CGAGGGTTCC CCATGTCCCT CTTTACTCTA TCAATCTTTT GCGGGCAGGT TAGTTGTGCA
ATGAGATTTC GTCTCCTAGG TTTACTAACA TCAGACTAGG GTCTTGGCGT GTGCTTTATC
GGATGGTCAG GGCAGGGACT TTCTCTTCAA TGGGCATACG GGGTAAGTCT AAGTCCATGC
TTTCAATTCT TCTCATCCCT GGCCTTTTTA GCTCACTAAC GAATGATATT CAGGTCCAAG
CTATTATTGC CACCGCTTCC ATCATCTTCA ACATCTTTTT CATGCGTGAG ACACGAGCCG
ATGTTCTATT ATCTTGGCGT GCGAAGAAGA TGACGAAAGA GACGGGCATC AAGCACATCG
CTGCAGCTGA CTTGGAGAAG ACGGATATGC TTACTTTAAT TAAAGTGTCT CTCATCAGGC
CTTTGCGTGA GTATTGATCC GTCGGACTTT GAGACTATAG AGATTGTAGG AACAGTGCTA
ACGGTGTGGC AGAGTATTTG GTGACAGAGC CCATCGTCTC TGCCCTTTCT GCCTGGATCG
GTTTTGCTTG GGCATGTATT TTCTTGAGTC AGAGTTCTAT TCTCCTGGTC TTTGAGTCTT
ACGGATTCAA CGCCGCCCAA GCTGGCAGTT TTCAAGCGTA AGTATAATAT TCCGCTAGTT
TTCGCATTAT CCGCGCACTG ACATTATACT CTTAGGTCCA TGGCCATTGG AGCTGTCCTT
GGTTTGATCT CGCAGAACCA TCAAGAATAC CTCTATCGCC GCTCCTGCGC CAAACACAAC
GGCAAAGCAC CTCCCGAAGT CCGTCTCTAC TGGGCGGCAT ACGGCGGTCT TCTGTTTCCT
TTTGCGCTGT ACGTATACGC ATGGACAGGA CAGGCAGGTG TCGTGCATTG GGCAGTTCCT
GGCGTGGCGT TAGTGTTCAT GAATTGGGGA GTGTTTGCGA TGTACAGCGG TGTTTTGTGA
GTTTTGGCTT TTATTTGTGC ATTTTTAGGG ACGCTTTGTG CTGACGGAAG TTGGATGGCG
AAAAGCACAT ACTTGGCAGA TGCGTACGAG ATATATTCTT CATCAGCTCA GGCTGCTCAG
AGCTTTTGTC GAAACATGGC CTCAGGTATC TTCCCGCTTT TTGCTCATCA ATTGGTGCGT
GCTTTCTAAA TAGATATGTT GAACCTACGA TGTGCTGACG GAAATCGTGG CCGTAGTATG
TGAACCTTGG CTATCCTGAA GCTTCAACTC TCGTTGCGAG TATAGCACTC TTTCTATCCG
CTGCTCCCAT ATTACTAGTG TTTTACGGGA AAAAATTGAG AGCACAAAGT AAAGTTACAA
GTCAGTTGTT GAAAGACGAA TGATGGTGAT CCTATATGGA CAAATAGTAG TAACACTGCC
ATCATCATGG TCGATGGCTT GTGTAATTCA CATCTTGTCA GCGCATGTTG CATAAATTCT
TACAG
 
Protein sequence
MSQKTSSSIT IHSSEGSTAA NTPMELADPL KTGSKMTPDF EKEQSSPSSQ PSSPSTPQST 
VSNRLVPLYR SATDRRSRAH SNSLSRRTSG QTAELQNELR RHVSLHGVAT NCGSESVMEA
RKRLDMGGEK GHEEVVVIDW LPNDPDNPVN YSSPRKYIIL TAANIAGFIA ASNLASCAVL
GTWGVPYYEI SREVWVLSIT LPMIALAVAP LILAPLSESL GRNMVYQVTS VITAVLFVPQ
IWNNKNVGGF LVSRFFQGIG MSVSNSMVGG TVADLFSPTD RGFPMSLFTL SIFCGQGLGV
CFIGWSGQGL SLQWAYGVQA IIATASIIFN IFFMRETRAD VLLSWRAKKM TKETGIKHIA
AADLEKTDML TLIKVSLIRP LQYLVTEPIV SALSAWIGFA WACIFLSQSS ILLVFESYGF
NAAQAGSFQA SMAIGAVLGL ISQNHQEYLY RRSCAKHNGK APPEVRLYWA AYGGLLFPFA
LYVYAWTGQA GVVHWAVPGV ALVFMNWGVF AMYSGVFTYL ADAYEIYSSS AQAAQSFCRN
MASGIFPLFA HQLYVNLGYP EASTLVASIA LFLSAAPILL VFYGKKLRAQ SKVTSQLLKD
E