Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND00470 |
Symbol | |
ID | 3257102 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 138162 |
End bp | 141413 |
Gene Length | 3252 bp |
Protein Length | 770 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255984 |
Product | expressed protein |
Protein accession | XP_570362 |
Protein GI | 58266412 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.966337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGAAGTCAA GGTCAGGTTT GACAGGCATA GGCCTGCTTC GTCTGTTCAA GAAGTCTGTA TAGCTCCGCA AGATCTTAGG TGCTACCGAC TAACAGGGCC GGGCGAAAAC AACGATAAAC AAGTCAGGTA ATTCAATCGA TCTAATCATC GAACGAACCT CAGCAGCTCT TGCCCCTTTT TAGTCGTATG CTTCATTTGC ATGTCTTCAT CAATTGAACA CAAACACCTC TATCATATTA TCTCTGACCC CCAGTCGCTC TTTACCCTGC ATTATCAAAT TGAGGAAAAG GCTGTTTCGG CCGCCTCAAC TCTGACATAC GATCACTTTA ATTAAGACTT CGAATATTGC TTTACCAAGT GAGTCATCCT ATTCACAGTA AAAAAAACCA GGATAACAGG AATCAGGGCT GTATCTTCTA TTGATAATTA TCATTATAAA ATAAATACTG GGTCTTCGAT TTCATAGTCC GTAAGTTGCT TGTTCTTCGT CGTGAACGAT TGGCCGCTCA CAGTCCTTTA AAGTGTCTTC GACTATTCTC ATGAGTCCAT CATTTGCTGG CTATCTCCAG CCATTAGGAT CGAATGGATA TAACGCTCCT GTGCCATCAC CCCCTCAAGT TACTGGTCAT TTTAATTCAG CGTCTTCAGA GGATAAGGAG AATATTCCAT CGCAACCCAC CTTTCGCGCG AGAGATTGGC AGGAAAAGCG AAATAACCTT CAAGTTCGAT CATTTCCTCA ATCGTATGAC CTCAACAGTT CAATTATTGA CTTCATTACA ACACAGGAAG CCGGTTTTGC CACTTTGAAA ATTGAAACTT TGGTGCCCTC AAATATTGGA AGGAAGAAAA AGGACATAGC AAATGTTAAG AGGAAGCCTG TATCATCCGT CTATGAGGAA GGAAAGAATG GTGGCGTTAC CAGCGTTAGC TCAAGTAATC CCTTGAATGT AAGTCTGGTA TCCTTGTAAA AACCGAAATA TGCCAATTAT AATCAGGCGG CCACCTTTTC TTCGGCCATC AATTCCTCTA CCCCATCTTT ATTGACTGTT GCTCCTCAGA ATCAATTGGA AATGATTCAG TTAACTGATT CTGCCGTATT TCCTCGCCAA GTGGTGCCAA GTTATTCGCC CATTAACAAT GATATGCCTT TTGTAGCTCC TCTTCCTGAT TCATGCTCTG AAACGACACT CACCTCCATT CTAAATGCGT GTGTATTCTC ATCTGACGAC GAGTTTTCTG AAGAAAATAT CCCGGCCAGG GATGCATTGC CTGGAGGCTT GGGTTCTGGG TTGGTAAATG AGTCAATCCA ATCTTTGGAC GCTATGATTG AACAGCAGTA CCAAGAATCC GGAGTGACTC GAGAGTCCCA CAACTCGTTC GACACCGAGG CCAATAACTC TGACCGAAGC AAGGAGCTGG ATTCCCCTAA GGAGAAGGCG AAAAGGAATA AACATCATTC CAAAATCTGC TCAAACCGGT CATACAATTC TTCTATTCTC GCAAGAAGAA GCAGTCTCAA GTCATCTATG TATACTGGAA TCGAGTTCAA CCAGGAAGCC GCTTCGCCAA CCAACGAAAA CACTGAAGGA AATGGGTGTT ATACGACCGA AAGTGACACT TGCTCGAATA ACAAGACGGT CTTGTTTGCT ACTCCAGCCT ATCAATTTGA CTATTTTCAG TGGTCCCGAA AACAGCCACT AAAGATGAAT GAAATTCTTA AAGGCGATGA AACGGTGACA GACCATCAGT CATATCCAAC TACTAGTGAC GATGACGTCT ATTACGCTGC TTTAACAAGT GAGAAGAGGT CCGCGATGAT CCATGAGATT ACGACTCAAC TAGCTACCAA GCTAACTTCA GATGGCGCCA ACACCAACGT TAACATGGCG GAACCTTTTA ACCCTCCCTG CCATTCGTCT ATCCAAGCTC TCAACCCCGA AAATTCCTCT CCTATGTCTA GCTTCTACGC AAGTAACCAT CATACCGCCT CAGCTGCCAC TGTTCCTCCG TCTCACCACC AGAGCGCTAT CCGCAGAAAT TTTGACGGAC GTAGTCTAGC TGTGAGGTAC GCACAGCGAC ATCCTTTCTT GCTTGAAGGC GATAACGTAC CTCACGATGA CGATGACTCT TCGAGTGCTG ATATCCGACC CTCCGCCAAC TCGAATAGCG ATGTGGAGAA TGGGAGGCCA CACAAGAGGA CTAAGGAGCG ATGGTTGTCG GTATGTCTTG TGATGGGGTT TGTCATGCCC TTGGCTTGGG TTGTGGGCGG ATGGTTCATA TCGGGAGTTG ATAAAGAGAC GGAAAAACAG CTGGTTGGAA GAGCGGCTTT GGAGCTTGAT GACGACCCGA GTTCCCAGCA AAGCGATTTA AATGGTCCTC ATACGGATGA TGTCCCCCGA AACAATCGCA ACAGTCACAA CAATGAGAGC ATTGATTCCC AAGCGACTCG ATTGGGCCCA TCTGACGAAC CTGAATTGGA TTTATGGACA CCCCGGATTA ACTTGACAAC CAGTGCTGCC AATGTGACGG CGTCCTTTCC GGCTCAATAT CGACCGTTTA CGTCGATGCC AAACCTTTTT ATGCCTAGCC CTCCCACGGC CATCACTCGG AACTCAATCT CAAGCCCGAA TCTTCTTGGC CTGTACGAAC CCCCCCAACC ACCCATGCCC GTCTCGGAAA GCGGCACGTT GGAATCCAGT ATTTTGCCGG TGCCTGTCCA CTCCTTTACG TCCAAAGATA TGTCGGCCAC TCCTTCTCCT ACTCCCACCC TAGTCAATCT ATCTCCCACC AATTTCACTC GTCACTGGAA CCCTAATTAC GGGCATCGAC CTACTCCATA CCCCGTTACC TTATCTGGGT CCAACTTGAC ATGTGTCCCT GCCGGTCCGG GACACCCTGT GCTCCCCTCT CACTATCTTT ATCATCTCTC AGTGAACGAC CCTGCATCGT CAAAATTGGC GCCTCCGGCA CCAACTGGAA GGCCCAGTTT TGTATCAAAC CCCGCCTGGC TTTTGGACCG TTGTGCCAGC ATGCGCACCG AATATCCTAA TGGCAAAAAC ACTGCAAGAT TCTGGAAAAA TTCTCTAGCG CGAAAGAAAA TGAGAGCTTG GCTGCCCGGA GCTCATCCTC ATCCGCTGGT CAGAGCGAAT CGATGGATGA TGGCGTTTGC CTTTCTGTTG GGCTGCCTTA CTGCTATTTG CTTTATGCTG GTGGGAATCA AATCGAAGAT GAATCAGGCG AGGGTGGGTT AG
|
Protein sequence | MSPSFAGYLQ PLGSNGYNAP VPSPPQVTGH FNSASSEDKE NIPSQPTFRA RDWQEKRNNL QEAGFATLKI ETLVPSNIGR KKKDIANVKR KPVSSVYEEG KNGGVTSVSS SNPLNAATFS SAINSSTPSL LTQYQESGVT RESHNSFDTE ANNSDRSKEL DSPKEKAKRN KHHSKICSNR SYNSSILARR SSLKSSMYTG IEFNQEAASP TNENTEGNGC YTTESDTCSN NKTVLFATPA YQFDYFQWSR KQPLKMNEIL KGDETVTDHQ SYPTTSDDDV YYAALTSEKR SAMIHEITTQ LATKLTSDGA NTNVNMAEPF NPPCHSSIQA LNPENSSPMS SFYASNHHTA SAATVPPSHH QSAIRRNFDG RSLAVRYAQR HPFLLEGDNV PHDDDDSSSA DIRPSANSNS DVENGRPHKR TKERWLSVCL VMGFVMPLAW VVGGWFISGV DKETEKQLVG RAALELDDDP SSQQSDLNGP HTDDVPRNNR NSHNNESIDS QATRLGPSDE PELDLWTPRI NLTTSAANVT ASFPAQYRPF TSMPNLFMPS PPTAITRNSI SSPNLLGLYE PPQPPMPVSE SGTLESSILP VPVHSFTSKD MSATPSPTPT LVNLSPTNFT RHWNPNYGHR PTPYPVTLSG SNLTCVPAGP GHPVLPSHYL YHLSVNDPAS SKLAPPAPTG RPSFVSNPAW LLDRCASMRT EYPNGKNTAR FWKNSLARKK MRAWLPGAHP HPLVRANRWM MAFAFLLGCL TAICFMLVGI KSKMNQARVG
|
| |