Gene CNL04310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04310 
Symbol 
ID3254789 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp198033 
End bp200345 
Gene Length2313 bp 
Protein Length591 aa 
Translation table 
GC content48% 
IMG OID638253902 
Productmonosaccharide transporter, putative 
Protein accessionXP_567982 
Protein GI58261144 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTCA AAAGAAATTC TACTGACACA CCAAAGCCGC CAAAGCATGC TCCAGATAGG 
GATACTATCC CAAGCCTAGA GGACCTAGGT GTTCAGCTCC ATCCCGATAC TGCTCGTTAT
GTGGCAGACA GTGAGGCATT GGCCCATGCA ATTGGCGGCA ACGGTCAGTA ATGTTTTTCC
CCTACATTAC CTACCTGCCT TGTACTCACA CGTCTGCAGG CCTCAAAGAT GTATTCAGCA
GTGGACTTGT TTTGATTGCC TCCTTTGCCA CTTGTATGGG TGGCCTGCTC TTTGGTTTCG
ACCAGGGTAT TCTTACCATC GTCCTTACCA TGAAACAATT CCTTGGGCAA TTCCCAGATA
TCGATCCCGA CGTCAGCTCT TCCGCGGCCT TCCACAAAGG GTAGATGTCG CTTTTGAAAT
TTATAAGGGT ACAAAGACCT GTACTGACTG TCACGCTTCC CAGACTCATG ACTGCCCTTT
TGGAACTTGG CGCTTTGATC GGAGCCCTTC AAGCGGGTTT CGTTGCAGAC AAGTATTCTC
GTAAATCGGC TATCGGTTAG TAATCAAAAC CACGTTGCAG ACAAGCGCCA TTAGCTGACT
GAAGCTGGAA AAAAAAGGTC TCGGTTCTGT GTGGTTCGTC ATTGGAGCTA TTCTGCAGAC
CTCATCCTTC TCTTATGCTC AACTGGTTGT CGGCCGATTA GTAGGCGGTC TTGGTGTCGG
TCTTCTCTCT GCTGTGGCCC CAATGTATAT TAGTGAGATT GCACCACCAA ACATCCGGGG
CGCATTGCTT GCCATGGAGG CTACTACCAT TAACGGTGGT ATCGTCATCA TGTTCTACAT
TGTGAGTGTT ATCTCTACTA GCTACCCAAA CCTTTTACTT ACTTTTGTAT AGACTTATGG
CTCTCGACAC ATCCCCGGTG ACTGGAGTTT CCGACTTCCT TTCCTCGTTC AGATTGCCCC
CTGTATCCTC TTGATATTCG GTCTCTGGAA ACTCCCATAT TCACCGCGAT GGCTTGCTCA
GGCAGGTCGC GACGAAGATG CCCTCCACGC TCTCGTACGT CTTCGAGGAT ATCCTGCCAC
CGACCCTCGA CTTCAGGCCG AGTGGATCAG CATCAGGGCG GAGGCCATCC AGAACCGAGA
GGTTATTGTC AAGAGCCATC CTTCTCTTCA AGGCGAGGAC TTTATGTCCG AATTCAAACT
CGAGATTGCT TCTTGGATCG ACATGTTCAA ACCGAAGCTT ATCAGGCGAA CTATCATCGG
TCCCACCTTG ATGATGTTCC AGCAGTTTTC TGGTATCAGT GCTGTAAGTT TTCCTCAGAT
CATGCTTGAA CGTTGAATAT TGATTCAACA ATCCATCAGC TCGTCTACTA CTCCCCTACT
CTCTTTGAGC AGCTCGGCCT CGACTATACA ATGCAGCTTG ACATGAGCGG TGTCCTTAAC
ATTATTCAGT TTGTCGCCAC GGGGCTTGCC TTTTTCATTC TTGACCGAGT AGGACGAAAA
CCTCCTCTCC TTTTCGGATC TGTGGCCACC ACTATTTGCC ATGTAATTGT GGCGGTCATT
ATGGCTAAAT TCAGTCACGA TTGGGTCCGG TACAACAAGG AGGCATGGGT GGCTGTGGCC
TTCATCTTCA TCTACATCTT CTCTTATGGT GTGGGTTGGT CTCCGGTACC TTGGGCCATG
CGTAAGTTCC TTTTGAGCTC AAGCAACATG GAAGGCCAGC GCTAATAACA GCATCGTAGC
TGCCGAGGTT CACACGTCGA GTCGCCGAGC CAAGGGTGTC GCCATTACAA CCGTCTTCAA
CTGGTTGGGT AACTTTATCA TTGGTCTCAT CACACCACCT ATGCTTGAAA ACATTAAGTA
CGGCACTTTC ATTTTCTTCG CCACCTTTTG CTTCCTTTCT GGTCTCTATG TCTGGTTCTT
CTGCCCTGAG CCCATGTAAG ATTCGAGTGT CCGAAATCCG TGTCTCGAAA TTGAATGCTG
ACGACTCTGG ACAGGGGTAA AACGCTCGAG CAGATGGATC AAATCTTCCA CTCCAACACT
GCTCATGAAG ACAACCTAGC TAAATCGGAT ATTCAGGCGG CCATCTTGGG CATTTCTATT
GCTGGACCAT TATCCGGTTC TGCCATTGCT GGAAAGGTCG GCGACAAGGA TATTCAGCAG
GAGTGGATAG AGAACGCTTG ATCTCAGAGA TGCTCAAATT GTTTTATGTT GGTATTATTA
TTTGTAGCTC AGAATAAGGA ATCAGAAGAA TCTGAAAGGA GCGTGTTTAG TCATAGAGGA
TACATACGTA TGGCTCGTGT TTACATTATT AGT
 
Protein sequence
MFFKRNSTDT PKPPKHAPDR DTIPSLEDLG VQLHPDTARY VADSEALAHA IGGNGLKDVF 
SSGLVLIASF ATCMGGLLFG FDQGILTIVL TMKQFLGQFP DIDPDVSSSA AFHKGLMTAL
LELGALIGAL QAGFVADKYS RKSAIGLGSV WFVIGAILQT SSFSYAQLVV GRLVGGLGVG
LLSAVAPMYI SEIAPPNIRG ALLAMEATTI NGGIVIMFYI TYGSRHIPGD WSFRLPFLVQ
IAPCILLIFG LWKLPYSPRW LAQAGRDEDA LHALVRLRGY PATDPRLQAE WISIRAEAIQ
NREVIVKSHP SLQGEDFMSE FKLEIASWID MFKPKLIRRT IIGPTLMMFQ QFSGISALVY
YSPTLFEQLG LDYTMQLDMS GVLNIIQFVA TGLAFFILDR VGRKPPLLFG SVATTICHVI
VAVIMAKFSH DWVRYNKEAW VAVAFIFIYI FSYGVGWSPV PWAMPAEVHT SSRRAKGVAI
TTVFNWLGNF IIGLITPPML ENIKYGTFIF FATFCFLSGL YVWFFCPEPM GKTLEQMDQI
FHSNTAHEDN LAKSDIQAAI LGISIAGPLS GSAIAGKVGD KDIQQEWIEN A