Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNM00050 |
Symbol | |
ID | 3255110 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006682 |
Strand | + |
Start bp | 12166 |
End bp | 15364 |
Gene Length | 3199 bp |
Protein Length | 801 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254165 |
Product | amino acid transporter, putative |
Protein accession | XP_568372 |
Protein GI | 58261924 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCATTTCCTG CTCATTACAA TTCACACCCC GCCCATAGCC GCTCCGCCAA AGTACTATGT CCCCACCTCA AGGAAAGGGT GTACCGGTTC CTAACCTTGG CCTGAAGCAC GAGTCAAAGA AGAAGGGACA TGTCATGCAG AGCTCATTGG AACTTGTTTT CAGTTGTACG TATCGAAATT TGTTCTCGGT ACGCGGCGAT ATGCTAATAA ATAGCGAATA GATTCTCGGT CACAGCAAAT GCATTATGGA TCACGATCAG CTCCCTCTTT CCTAGACGAC AGTTCATTTC GAGGCGATCA ATTTGATGTT GATGACTACG ATGAGGATCA AAATTACAAC GATATCTACG ACGGGATAGA GAGAACTGGA GAGGAGCAGT TTGAAGGTAG AGATTTGAGT GGCAGATTGG AAGATCTGAA AGTGGCTGGC GCTATTTCTG AAGCAAATGA AGATGGTAGC ACGCAAGAAG AGCTTGGACA AGACTCGCAC ATTACGAGAC GATCATCCGA CGAACCGGCG TTCTTCCACT CCGACTCTGT GCCTATATCA CCCATTTCCC AACTTCAACC GTCGGTGAAC CAAGGCGCCA AAGTAAACAC ACTCCTTCCA GATGGTATCG GGCCGATTCA AGGGCTCTTT TTGGCTGCTA TGCCATCTCC CCGACTTGCA CCCGTTCTTC CACATCCCGG ATCGCTCGAC ACTTCATATT GTACAGAGGC GGGACCATCA TCTAATCCAC GTACATCCAA GCGATGGAAC AGTCATAGTA GGGGTGATTC GAACCCGATG GAGATTAGAG AATCTGCTGC TCTAGTATCT TCATCCCCAC AATACCATGG CAAAGGCAAG AGTGCAGAGA ATCGGCCTTT GCTGGATGGT CGGCCGACAG AAGGGTATGC TTCGATACAG TCGCAAAAGG AGATAAGACG AAAGTCAAAT AGAAGAAAGC GAAGCGAAGA TGGCCAGAGC ACAGAAGGAC AAACCGTAAG TCTGATCCTC TCTATCTCTC TCTCCCACAC TAAAACCCGC TTGTTCTATC TCCAGCTGTT CAACGCGACG GCAGTACTAG TTGGTATAGG TCTCCTCTCC CTTCCACTTG CTTTCGCATA TGCCGGATGG ATAGGAGGTA CGATCATGCT TCTCGGCTTC GGGTGGCTGA CTTGCTATAC GTATATTTTG TTCTCATCTA CTAACTTTAC CGGAACACTA ATCATGAGGC TCGTCATAGG GCGAAGCTAC TGGCGAGACT GATCAGAGCC GACGGAAGAA TGATGGGATA TACAGACATT GGTCTGCGAG CTTTCGGCAG TTGGGCGGGT GCGGGTATAA ATCTATTGTA AGTGGCGGTC ATATCAATCA CTGAATGATT ATGCTGATTA CAGGGTAGAT TCTGCATGGA GCTCTTTGCA TTAGGGTACA TTTCATCGTC TCGTCTTGTA ATGATGGTAT ACTGACAACA GGCTCTAGTG TTGCCCTAGT TCTTCTTTTC GGAGACACAC TCAACGTTCT CTACCCTTCA ATACCATCAA ATGTCTGGAA GCTCGTTGGT TTCTTCATGT ACGCTCCACC GAGTTTTCAC CAGTGTTCAC ACCCTTGGTT TGCTAACGCC CTTTTGATTG ACAGTATCGT GCCCACTGTT CTGCTTCCTC TCCGTCTTCT GTCACTCCCT TCTCTCCTTT CCTCAATCTC CTCCTTTTTC CTCATCATAG TTCTTCTCGT CGACGGTTTT CTGCCTTCCC CTGAGCCCTC ATCGGCTTCG ACCGGCTCTC TCCTCCACCC CTCACCAACC AGCCTTTCTC CTGAATGGTC CCGTGGTAAC TGGTTGGGCG GAATAGGGCT TATTCTAGCC GGTTTTGGAG GCCACGCAGT GATGCCCAGT TTGGCGAGGG ACATGAAGCG GCCAGAGAAA TTTGATGGGA TCGTTAATTG GGCATTTGTA AGTTTCGCTA TTGTCTCTTC CTATCCTCTC TCAAACTCGG TGTGACCATA TAACAAACTT TTTTTCGTTT TCTGACTCGG GATTTGACAC AGGCGATAGC AACTGGCATA TCTTTCACAG CAGGTGCCGC TGGGTATCTT ATGTTCGGTG AGACCGTATC TGACGAAGTA TGTCCCTCCA GCTCTCATTT TTTCCAGGAT GGTTCCCTAG GCTGATGAGA TGTTGCAAAA TTACCTAGGT TACGAAAGAC CTGATGCGAG AGAAGTATCA TTACCCTCGA ATACTCAATA TTGTGGCTCT ATGGATGATT GTCATCAACC CCCTTACAAA ATTTGGGCTT TCCTCTCGTC CTGTGAGTGT TAATTTATTT TATGTTCGCC TTTTCTTGAG AGCTAGGCTC CATTAGTATG GGATCAAACG CTAATGGATA GCTCTCTCTT GCAGCTCAAT CTGACAATCG AAGGAATACT TAGGATATCC CCTTCTCCAC CACCGTCCCT CTTCTCTCCC TTTGATGGTG GACTCGAATC AGCGCTCGGG TCGGGTGCAC CAGAGACAAG CCAAGTTAAT TTTCCCAAAC GTCGCGATCG GTCATCTTTC TCTACTCGCA ACGATCCCCG CACATCCCGT CGACCGTCTG CCCTCACCTC TCCATCCAAC TCTCGCCCCT TTCCTCAGTC CCAATCTCAG ATAGTCGCCT TCGAGCAATA TGTCTCTAAA GAACGCAAGA AGAGATGGCT GCGAATGGTG TCGCGAGTGG TCATTACCGC TCTCTGTGTG GGTGTCGCTG TAGTCTTGCC GGGTTTCGGA CGTGTGATGG CCTTTTTGGG AAGTTTCTCC GCATTTATGA TTTGTATCAT CTTGCCTGTA CGTCCTCATC TGCGTTTTCA CCATCCTCCC TCCGGAACTC AAAACTAACT TTTGTTTTTA AAAAGCTCCT ATTCTACATC CGCCTGTCCC CCACTCTCCT TCCAACTTGT CCCCCGTCGC CCCATTCGCT CACATTCCCA CCGACTGCCC GCTCGCAACC AACGTCAAAA TGGGCCAGCG AGAAATTTAC GAATGCGATA CACTGGGTAT TGGTAATCGC CAGTACGGCG CTGATGATAG CCGGGACTAT ATGGGCGTTC TTGCCAGGAA GTGGGCATGA TGAGTTGGAA ACGTAGCGGT TTCGAGGGCA TAGAAGGGCT GTAACAGTTG TAGCAAGTGT TTCGTTTTGA TAGTTGTATT TTTCAAATCA GGTCTCGCAT TTTACTTTG
|
Protein sequence | MSPPQGKGVP VPNLGLKHES KKKGHVMQSS LELVFSYSRS QQMHYGSRSA PSFLDDSSFR GDQFDVDDYD EDQNYNDIYD GIERTGEEQF EGRDLSGRLE DLKVAGAISE ANEDGSTQEE LGQDSHITRR SSDEPAFFHS DSVPISPISQ LQPSVNQGAK VNTLLPDGIG PIQGLFLAAM PSPRLAPVLP HPGSLDTSYC TEAGPSSNPR TSKRWNSHSR GDSNPMEIRE SAALVSSSPQ YHGKGKSAEN RPLLDGRPTE GYASIQSQKE IRRKSNRRKR SEDGQSTEGQ TLFNATAVLV GIGLLSLPLA FAYAGWIGGT IMLLGFGWLT CYTAKLLARL IRADGRMMGY TDIGLRAFGS WAGAGINLLV HFIVSSCNDG ILTTGSSVAL VLLFGDTLNV LYPSIPSNVW KLVGFFIIVP TVLLPLRLLS LPSLLSSISS FFLIIVLLVD GFLPSPEPSS ASTGSLLHPS PTSLSPEWSR GNWLGGIGLI LAGFGGHAVM PSLARDMKRP EKFDGIVNWA FAIATGISFT AGAAGYLMFG ETVSDEVTKD LMREKYHYPR ILNIVALWMI VINPLTKFGL SSRPLNLTIE GILRISPSPP PSLFSPFDGG LESALGSGAP ETSQVNFPKR RDRSSFSTRN DPRTSRRPSA LTSPSNSRPF PQSQSQIVAF EQYVSKERKK RWLRMVSRVV ITALCVGVAV VLPGFGRVMA FLGSFSAFMI CIILPLLFYI RLSPTLLPTC PPSPHSLTFP PTARSQPTSK WASEKFTNAI HWVLVIASTA LMIAGTIWAF LPGSGHDELE T
|
| |