Gene CNF04800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04800 
Symbol 
ID3258491 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1396889 
End bp1398957 
Gene Length2069 bp 
Protein Length447 aa 
Translation table 
GC content46% 
IMG OID638257598 
Productmonocarboxylic acid transporter, putative 
Protein accessionXP_571619 
Protein GI58268926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCGACGGAA TCACAGAAGC TCATTTGAAA TGTCATCCGA TATTGAGCTT TTCCAACTTC 
CTGACAGACG ACCATCACAA TCAAAAATCG AGCACTCTCT CTCTTATGAT CAAGAAGAAG
AAGTAGATGA ATCCGTCACT CACCATGCCT TGCCTCCAGT AGACGGTGGT CGGAAAGCAT
GGTCATTCCT CGCTGGTGCG ACTGTCGTTG AGATGCTCGT CTGGGGATTT CCATACTCTA
TCGGTATCCT GCATGTGTAC TGGACAAATA CTCTTTTCAA AGGATATGGA GAGTCTATGG
TGGCTCTTGC TGCTACATTG CAGACCGGTC TGCTCTATAT GAGCTGCGCC GTGTTCGGAC
CGTAAGCTGG AACTCCTTTT TGGAACGACC AACTCGAGTA TGTTGACTTG CCTGTAGCAT
TTTCACAAAA TGGCCAAAGT GGCAGAAGAC TTTCCAATAC ATTGGCTTAT TTGCTGCGTC
GTTATCCTTT ATTGGTAGTG CTTTCGCGAG CAAAGTAAGC CATGTCCCTT ATTTTGAAAA
ACAGTACTGA CCATCATAAT AGCCATGGCA TCTTATCGTC ACCGTCGGCT GTATTTACGC
TTTCGCGGGA GCCCTCTATT TGCCCTGTTG TACTCTCCTC TTTGAGTGGT TTGTTGCCAA
GCGGTAAGGT CATTTCAGGT TCATCGGGGT TCGAACGAAA GTTGATTCCG CATTAGCGGT
CTCGCCAACG GTGCAATGTT TGCAGGAGTA TGTTCCATTT ATAATTTTTT TGACTTGAGG
TGCCGTCCTC CAGCTAACCA TGAAACACAG ACTGGTGTGG GAGGCGTGTT GTATCCTTAC
ATCATGAGCG GTCTCTTGAA CCATTTTGGC TATAAAACAG CCATGATTTC CATGGGTGTT
GGCTACGCCA TTTTGGGAAC TATTTCACTC ATCCCAGTCA ATCGACGAGT CCCTGTCTCT
CGACATGACT TTGTTGGACC TGGAAGAAAG AAGCCCATAA ATTTGACTTT CTTGAGGAGT
ACGCCGGGCA TCATTGGCCC TCTCATCATC CTGCTTGTCA GCTTGGGTAA TTTTATCCCT
ACACTTTGGC TTCCTTGTAC GTTTTTATAC CTTTTTGGGT ATTTAAGCTG ACCATCAACA
GCTTATGCGG ACGACCTTAA GATGAGCCGT ATAGATGGTA TAGCCTTAAT CGCAATCCTG
AATGCCGCCT CGGTTCCCGG TAACACGCTG CTTGGCTACT TTTCCGATTA CTCCCTACGC
GCCGTCATCA TCGTTTCGTG CGTCGGAAGC GCTCTCGGTT GTGCGTTCTT GTGGGGATTT
GGGACGAACC CAGGAGTCTT GATTGTTTTT GCCATCGTCT ACGGATTGTT GGGGACCAGT
TTCCAGGCAT TGTGGTCAAA TATGATTGGG GTCATAAGCA GTGAGCCGCT TGAGCGACGA
CTTTCTTTCG CAGCTAATGG TTAATTTTGT AGGGGATGAC CCGATAGCTC CTTCAATAGT
ATTTTCCATT TTCGCCTTCA TGAGAGGTAT CGGTAACATT ACTTCTGGTG AGTTTTGATA
TTGCCTTGGT TGATGAAAAT GGTCCAACAG GAGCTGATGG TGTCTACAGG ACCCATTTCA
GGCGCTCTCT TGAAATACAA CACATTTCCA AATGGTGCGG GCGCTTACGG TTTCCACAAC
TATGTGAGTA GTTTCGTCAT TTTATCGGGC ATCTCTGCAT GTAATTGACA TAATTCGCAG
GGCGCCTTGT TACTGTACAC AGCCATCACA ATCTTTTCAG GGGGTGCCGC AGGGATATTG
TTCAAGTGAT GATAGGCGAT GATAATAAAA GAGAACGACC AGACTGCATC CACTAAATAC
GACTCTTCAA CATGCTGGCA ATATCTGTCT GTATTAAACA GCGCATAGTA CCTCCATGTT
TTTTCTTACA TATACATAAT ACAACATTTC CTGTTCAATA GCACAGAAAT GTAAGTTCTC
CGTCCAATAT TTTTAATGCA CGCTACATAA CAGTACCAAG GAGACTCCAG GTATATCAAG
ATTGTTCGAT GTAAACAAAC AGCTGGGGG
 
Protein sequence
MSSDIELFQL PDRRPSQSKI EHSLSYDQEE EVDESVTHHA LPPVDGGRKA WSFLAGATVV 
EMLVWGFPYS IGILHVYWTN TLFKGYGESM VALAATLQTG LLYMSCAVFG PIFTKWPKWQ
KTFQYIGLFA ASLSFIGSAF ASKPWHLIVT VGCIYAFAGA LYLPCCTLLF EWFVAKRGLA
NGAMFAGTGV GGVLYPYIMS GLLNHFGYKT AMISMGVGYA ILGTISLIPV NRRVPVSRHD
FVGPGRKKPI NLTFLRSTPG IIGPLIILLV SLGNFIPTLW LPSYADDLKM SRIDGIALIA
ILNAASVPGN TLLGYFSDYS LRAVIIVSCV GSALGCAFLW GFGTNPGVLI VFAIVYGLLG
TSFQALWSNM IGVISRDDPI APSIVFSIFA FMRGIGNITS GPISGALLKY NTFPNGAGAY
GFHNYGALLL YTAITIFSGG AAGILFK