Gene CNM00050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00050 
Symbol 
ID3255110 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp12166 
End bp15364 
Gene Length3199 bp 
Protein Length801 aa 
Translation table 
GC content49% 
IMG OID638254165 
Productamino acid transporter, putative 
Protein accessionXP_568372 
Protein GI58261924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCATTTCCTG CTCATTACAA TTCACACCCC GCCCATAGCC GCTCCGCCAA AGTACTATGT 
CCCCACCTCA AGGAAAGGGT GTACCGGTTC CTAACCTTGG CCTGAAGCAC GAGTCAAAGA
AGAAGGGACA TGTCATGCAG AGCTCATTGG AACTTGTTTT CAGTTGTACG TATCGAAATT
TGTTCTCGGT ACGCGGCGAT ATGCTAATAA ATAGCGAATA GATTCTCGGT CACAGCAAAT
GCATTATGGA TCACGATCAG CTCCCTCTTT CCTAGACGAC AGTTCATTTC GAGGCGATCA
ATTTGATGTT GATGACTACG ATGAGGATCA AAATTACAAC GATATCTACG ACGGGATAGA
GAGAACTGGA GAGGAGCAGT TTGAAGGTAG AGATTTGAGT GGCAGATTGG AAGATCTGAA
AGTGGCTGGC GCTATTTCTG AAGCAAATGA AGATGGTAGC ACGCAAGAAG AGCTTGGACA
AGACTCGCAC ATTACGAGAC GATCATCCGA CGAACCGGCG TTCTTCCACT CCGACTCTGT
GCCTATATCA CCCATTTCCC AACTTCAACC GTCGGTGAAC CAAGGCGCCA AAGTAAACAC
ACTCCTTCCA GATGGTATCG GGCCGATTCA AGGGCTCTTT TTGGCTGCTA TGCCATCTCC
CCGACTTGCA CCCGTTCTTC CACATCCCGG ATCGCTCGAC ACTTCATATT GTACAGAGGC
GGGACCATCA TCTAATCCAC GTACATCCAA GCGATGGAAC AGTCATAGTA GGGGTGATTC
GAACCCGATG GAGATTAGAG AATCTGCTGC TCTAGTATCT TCATCCCCAC AATACCATGG
CAAAGGCAAG AGTGCAGAGA ATCGGCCTTT GCTGGATGGT CGGCCGACAG AAGGGTATGC
TTCGATACAG TCGCAAAAGG AGATAAGACG AAAGTCAAAT AGAAGAAAGC GAAGCGAAGA
TGGCCAGAGC ACAGAAGGAC AAACCGTAAG TCTGATCCTC TCTATCTCTC TCTCCCACAC
TAAAACCCGC TTGTTCTATC TCCAGCTGTT CAACGCGACG GCAGTACTAG TTGGTATAGG
TCTCCTCTCC CTTCCACTTG CTTTCGCATA TGCCGGATGG ATAGGAGGTA CGATCATGCT
TCTCGGCTTC GGGTGGCTGA CTTGCTATAC GTATATTTTG TTCTCATCTA CTAACTTTAC
CGGAACACTA ATCATGAGGC TCGTCATAGG GCGAAGCTAC TGGCGAGACT GATCAGAGCC
GACGGAAGAA TGATGGGATA TACAGACATT GGTCTGCGAG CTTTCGGCAG TTGGGCGGGT
GCGGGTATAA ATCTATTGTA AGTGGCGGTC ATATCAATCA CTGAATGATT ATGCTGATTA
CAGGGTAGAT TCTGCATGGA GCTCTTTGCA TTAGGGTACA TTTCATCGTC TCGTCTTGTA
ATGATGGTAT ACTGACAACA GGCTCTAGTG TTGCCCTAGT TCTTCTTTTC GGAGACACAC
TCAACGTTCT CTACCCTTCA ATACCATCAA ATGTCTGGAA GCTCGTTGGT TTCTTCATGT
ACGCTCCACC GAGTTTTCAC CAGTGTTCAC ACCCTTGGTT TGCTAACGCC CTTTTGATTG
ACAGTATCGT GCCCACTGTT CTGCTTCCTC TCCGTCTTCT GTCACTCCCT TCTCTCCTTT
CCTCAATCTC CTCCTTTTTC CTCATCATAG TTCTTCTCGT CGACGGTTTT CTGCCTTCCC
CTGAGCCCTC ATCGGCTTCG ACCGGCTCTC TCCTCCACCC CTCACCAACC AGCCTTTCTC
CTGAATGGTC CCGTGGTAAC TGGTTGGGCG GAATAGGGCT TATTCTAGCC GGTTTTGGAG
GCCACGCAGT GATGCCCAGT TTGGCGAGGG ACATGAAGCG GCCAGAGAAA TTTGATGGGA
TCGTTAATTG GGCATTTGTA AGTTTCGCTA TTGTCTCTTC CTATCCTCTC TCAAACTCGG
TGTGACCATA TAACAAACTT TTTTTCGTTT TCTGACTCGG GATTTGACAC AGGCGATAGC
AACTGGCATA TCTTTCACAG CAGGTGCCGC TGGGTATCTT ATGTTCGGTG AGACCGTATC
TGACGAAGTA TGTCCCTCCA GCTCTCATTT TTTCCAGGAT GGTTCCCTAG GCTGATGAGA
TGTTGCAAAA TTACCTAGGT TACGAAAGAC CTGATGCGAG AGAAGTATCA TTACCCTCGA
ATACTCAATA TTGTGGCTCT ATGGATGATT GTCATCAACC CCCTTACAAA ATTTGGGCTT
TCCTCTCGTC CTGTGAGTGT TAATTTATTT TATGTTCGCC TTTTCTTGAG AGCTAGGCTC
CATTAGTATG GGATCAAACG CTAATGGATA GCTCTCTCTT GCAGCTCAAT CTGACAATCG
AAGGAATACT TAGGATATCC CCTTCTCCAC CACCGTCCCT CTTCTCTCCC TTTGATGGTG
GACTCGAATC AGCGCTCGGG TCGGGTGCAC CAGAGACAAG CCAAGTTAAT TTTCCCAAAC
GTCGCGATCG GTCATCTTTC TCTACTCGCA ACGATCCCCG CACATCCCGT CGACCGTCTG
CCCTCACCTC TCCATCCAAC TCTCGCCCCT TTCCTCAGTC CCAATCTCAG ATAGTCGCCT
TCGAGCAATA TGTCTCTAAA GAACGCAAGA AGAGATGGCT GCGAATGGTG TCGCGAGTGG
TCATTACCGC TCTCTGTGTG GGTGTCGCTG TAGTCTTGCC GGGTTTCGGA CGTGTGATGG
CCTTTTTGGG AAGTTTCTCC GCATTTATGA TTTGTATCAT CTTGCCTGTA CGTCCTCATC
TGCGTTTTCA CCATCCTCCC TCCGGAACTC AAAACTAACT TTTGTTTTTA AAAAGCTCCT
ATTCTACATC CGCCTGTCCC CCACTCTCCT TCCAACTTGT CCCCCGTCGC CCCATTCGCT
CACATTCCCA CCGACTGCCC GCTCGCAACC AACGTCAAAA TGGGCCAGCG AGAAATTTAC
GAATGCGATA CACTGGGTAT TGGTAATCGC CAGTACGGCG CTGATGATAG CCGGGACTAT
ATGGGCGTTC TTGCCAGGAA GTGGGCATGA TGAGTTGGAA ACGTAGCGGT TTCGAGGGCA
TAGAAGGGCT GTAACAGTTG TAGCAAGTGT TTCGTTTTGA TAGTTGTATT TTTCAAATCA
GGTCTCGCAT TTTACTTTG
 
Protein sequence
MSPPQGKGVP VPNLGLKHES KKKGHVMQSS LELVFSYSRS QQMHYGSRSA PSFLDDSSFR 
GDQFDVDDYD EDQNYNDIYD GIERTGEEQF EGRDLSGRLE DLKVAGAISE ANEDGSTQEE
LGQDSHITRR SSDEPAFFHS DSVPISPISQ LQPSVNQGAK VNTLLPDGIG PIQGLFLAAM
PSPRLAPVLP HPGSLDTSYC TEAGPSSNPR TSKRWNSHSR GDSNPMEIRE SAALVSSSPQ
YHGKGKSAEN RPLLDGRPTE GYASIQSQKE IRRKSNRRKR SEDGQSTEGQ TLFNATAVLV
GIGLLSLPLA FAYAGWIGGT IMLLGFGWLT CYTAKLLARL IRADGRMMGY TDIGLRAFGS
WAGAGINLLV HFIVSSCNDG ILTTGSSVAL VLLFGDTLNV LYPSIPSNVW KLVGFFIIVP
TVLLPLRLLS LPSLLSSISS FFLIIVLLVD GFLPSPEPSS ASTGSLLHPS PTSLSPEWSR
GNWLGGIGLI LAGFGGHAVM PSLARDMKRP EKFDGIVNWA FAIATGISFT AGAAGYLMFG
ETVSDEVTKD LMREKYHYPR ILNIVALWMI VINPLTKFGL SSRPLNLTIE GILRISPSPP
PSLFSPFDGG LESALGSGAP ETSQVNFPKR RDRSSFSTRN DPRTSRRPSA LTSPSNSRPF
PQSQSQIVAF EQYVSKERKK RWLRMVSRVV ITALCVGVAV VLPGFGRVMA FLGSFSAFMI
CIILPLLFYI RLSPTLLPTC PPSPHSLTFP PTARSQPTSK WASEKFTNAI HWVLVIASTA
LMIAGTIWAF LPGSGHDELE T