Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND06100 |
Symbol | |
ID | 3256943 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | - |
Start bp | 1679606 |
End bp | 1682720 |
Gene Length | 3115 bp |
Protein Length | 819 aa |
Translation table | |
GC content | 48% |
IMG OID | 638256550 |
Product | conserved hypothetical protein |
Protein accession | XP_570525 |
Protein GI | 58266738 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0680473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAATCCTCCT CGAAGTGTAA ATAAGTCACT ATACACCGGA CATGCCTCCC AAACAACCGC CAAAAAAGTC ATCCACGCCA TCCATGAGTG GTGCTGAACC CAACATTAAC AGCAATGGCA ATACCGTCAA GCCCCGCTCG ATTTCCATCT CCCGTAACTC TCCGAGTACG AACTCCGGTG GTACACCCCC TGTGCCGAAT ATCCCTTCTC GAGCTTCCAT GTCTTCAACT GGGCCCCCTA GGTCATTTGC GGGATCATTC AGACCTGCGT CTTCGGTGGT TGGGATGGAG AATTCTAAAT CAAGGAGACA GAGTATATCA AAGGAAAAAA TTATCTCAAA GGACTCGAGT GATAGTTTGA TCGGGAATAA GACTATCCCC CAAACACAAA CCACGAGCTC AGCATTGACA GCTGCCCTCG AGTCCCCTCA ACCTCAAGGC TTATCTCACC AAGGCTCAGG CTCCTTATCA CGCCCACCAA TTTCTCGCGA TGACTCTGGT ACTTCCGCTC TTCGCAATGG CGCCGCTAGT ACTTCTGCTC CCAAAGATGG TGATAAGGCG GGCGAGTCAT CCTTCTCCAA TCTTGCAGAT GTGCCAGATG AGGAAAAGGC GCGTGTTTTA AGAAGACACC TCGTAAGCGC CGATGAGCGA GGCGCCTCTT CGCCCACCCC TGGCGGACAG TCGCCAGCTG GATCTGGAAT CAATATGGGA GGTACACCGA CAAGGGCCGA GTTTGACGAC ATTCTTGGGG AGAGCGGAGT TAGTGGATAT GGAAGTATTG ATGCCCACCC TAAGGATGAT TCTGATCAGT TCCCTATCCC TTATGACGCG CCCGGTGGAG ATGTTACGTA GGTTTTTTTT TTCTTCCCAT ACAATCAATC GCTGATCATA TCCTTAGACA TGATCTTTAC AAATGGCAAC ACGACCACCG CACACGCCCC GTCCGCTCCG CCTCTTTTTC ACACGTCCCA GTCGACCGTT CGACCATCCT TGATCCACAC TTGGCGCATA TCAAAGAGCC TGGAGGTTTC CGACGGAACT TTGTGGTGAA CCGTGCGCAA GAGCAGGGGT TGGAGGCGCC TGACATGGTC AGGAATGTAG TGGACTTCTT GTTTCTATAT GGCCACTTTG TATGTCCTCG CCATTGCCAA ATAAATGAAG AACAAAGCTG ATCTTTTCGA TAGGCGGGTG AAGATTTGAA CGAAGACGAA GACGTGCTGG AAGAAGACGA AGAATCCTAT CCCGAGGACG CCTCGTCTTC TTCTGCTTAT GCTCGCCGAC CATTCCCTGC TGGCGGAGAA GAAGCCGGAC AAATCGCTCG TGGAGAGAGG GCACCATTGC TGGGTTCAAC CAAACGATCG TTGAGTAGGC ATCGTAGGAC AAAGTCTGGG CATAATCAGG GAACGGCGAG TGTCACTCAG GCTGTGCTCA TGGTGAATCA ATTTTCTATC TTCCATGAGC TCTACGAATG GTGACTGACA TCCTTGAAGC TTTTAAAAGG TTTTGTCGGT ACCGGTATTC TGTTCATGGG CAAAGCCTTT TTCAATGGTG GTATCCTTTT CTCGTCCATC GTCATGCTTG CTATTGCCGG TATCTCCCTC TGGTCATTTT TGCTTCTCGT TCAGGCGTAC ATGAAGGTCC CTGGATCATT TGGTGATATC GGCGGAGAGT TGTACGGAAA TAACATGAGA CTGATCATTT TGACTTCAAT TACTGTATCC CAAATCGGTT TTGTTGCTGG TGAGGTCCCC ATGCTTTTTT GATCGAAGGC CTTTACTAAC AGTGTCTCTT AGCCTACTCC ATTTTCATTG CTGAGAACCT TCAGGCATTC ATAATGGCGG TTAGCAACTG TCGAACCTTT ATCCCTGTCA AATACCTCAT CTTTGCCCAA CTTATCGTCT TTATGCCACT TTCTATGATC AGAAACTTGG CCAAGCTTTC GGGTACTGCA TTGATCGCGG ATGCATTTAT CCTTATTGGT AGTAAGTTTG TCTGTCCGAG CTCCAAGTGA AGCAAAGCTG ATGGCAAGTA GTCATTTACA TTGGCGGTAA TGAGATCTCG GTTCTGTCCA AGAATGGGGT TGCGGACGTT GCGCTTTTTA ACAAGCAGAG CTTCCCTTTA TTGATTGGTA CCGCTGTGTT TGCATTCGAG GGTATTGGCC TGTACGTTTT GCCCCCCATG GTCATTACAT CGAAACTGAC AATCGTACAG TGTCATTCCT ATCACTGAAT CCATGCGTGA ACCTCAAAAA TTCCCTCGTG TCTTATCAGG TGTCATGTTC TGCGTCGCCA TCCTCTTTGC TGGTTCTGGT GTCATGTCCT ACGCTGCATA CGGGAGTGAC ATCCAGACTG TCGTCATTGT CAACTTGCCT CAAGATGACA AGTTTGTTCA GGCCGTTCAA TTTTTGTGTA TGTCCATTCC TTATTCTAGT TTGACCAATC ACGGTATTGA CTTGTTCCAT AGATTCTGTT GCCATCCTCC TCTCTTCCCC CCTCCAACTC TTCCCAGCCG TGCGTATCAT GGAGAACGGT CTCTTTTCCA AATCAGGCAA GCACAACCCC TCGGTCAAGT GGCAAAAGAA TGTGTTCAGG TCCTGTACAG TTATCTTCTG TTCTTTGCTG TCTTGGGCCG GGTCTAATGA ATTGGACAAG TTTGTGGCGT TGATTGGCAG TTTCGCTTGG TAGGTCTAGG ATTTTGCTTC TCTTGACTTG CAATCGTTGA CTGACTCGAA TAACAGTATC CCCTTGTGTT TCATTTATCC ACCCATGCTC CATTTGAAAG CTTGCGCTCG CACTCCTAAA GCACGGATTA TGGATTGGAC GCTCATCGTC TTTGGTACCA TTGTGGGTGC CTTCACGACG GTGCAGACTT TAAGGAGTTT GTTCATTCCC AGTGCGGAAG GACCAAAATT CGGCGGGTGT GAATAAAAGA GTAAAGCTAC ATGGTAGGCA GGGGCCATCA GAATACGAGA AGGAGCATCT AATCAGGGGT GTAGTGTCAT TGGTTCCAGT AATTATCATA CTTGCATCTT TGTGAGATAC TTGCCGAGTT TTTCGACGCA ATTTTTTACA TATTGTGACC CCAATCTCGC ATATCATTCG TCCAAGTTTC AGTTT
|
Protein sequence | MPPKQPPKKS STPSMSGAEP NINSNGNTVK PRSISISRNS PSTNSGGTPP VPNIPSRASM SSTGPPRSFA GSFRPASSVV GMENSKSRRQ SISKEKIISK DSSDSLIGNK TIPQTQTTSS ALTAALESPQ PQGLSHQGSG SLSRPPISRD DSGTSALRNG AASTSAPKDG DKAGESSFSN LADVPDEEKA RVLRRHLVSA DERGASSPTP GGQSPAGSGI NMGGTPTRAE FDDILGESGV SGYGSIDAHP KDDSDQFPIP YDAPGGDVTH DLYKWQHDHR TRPVRSASFS HVPVDRSTIL DPHLAHIKEP GGFRRNFVVN RAQEQGLEAP DMVRNVVDFL FLYGHFAGED LNEDEDVLEE DEESYPEDAS SSSAYARRPF PAGGEEAGQI ARGERAPLLG STKRSLSRHR RTKSGHNQGT ASVTQAVLML LKGFVGTGIL FMGKAFFNGG ILFSSIVMLA IAGISLWSFL LLVQAYMKVP GSFGDIGGEL YGNNMRLIIL TSITVSQIGF VAAYSIFIAE NLQAFIMAVS NCRTFIPVKY LIFAQLIVFM PLSMIRNLAK LSGTALIADA FILIGIIYIG GNEISVLSKN GVADVALFNK QSFPLLIGTA VFAFEGIGLV IPITESMREP QKFPRVLSGV MFCVAILFAG SGVMSYAAYG SDIQTVVIVN LPQDDKFVQA VQFLYSVAIL LSSPLQLFPA VRIMENGLFS KSGKHNPSVK WQKNVFRSCT VIFCSLLSWA GSNELDKFVA LIGSFACIPL CFIYPPMLHL KACARTPKAR IMDWTLIVFG TIVGAFTTVQ TLRSLFIPSA EGPKFGGCE
|
| |