Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI00550 |
Symbol | |
ID | 3259631 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 128881 |
End bp | 132628 |
Gene Length | 3748 bp |
Protein Length | 898 aa |
Translation table | |
GC content | 47% |
IMG OID | 638258539 |
Product | expressed protein |
Protein accession | XP_572778 |
Protein GI | 58271244 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.187013 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGATACTCTG TGATTATTTG CCATCCACAA GGAAGCCAAA GAGGTCGTCC ATCTTAGTGC GTAGAGAAGA ATGCCTTCTA AACGCACCAG CTCGGGTGTG TCTTCGCCCA CTGCCAAATC GTCTAGCACC CGTAAATCCA AATCTACACC GAAGAAGACG TTGCCAAGGC CGATTACATC ACCTCCTCCA CCTCCTCATC ACCATGTTCC GGCACCAGCA CCTATGCACT TACCAGTGCA TAAGGCCGGA AGACTGAGCT GGTCGTTACT TGGTTTTATC TTAGTTGTCT TGCCGTTCTG GTTCTCCAGA TTACATTATG GATTGCCTGA ACCTCTTCCA CCATAGTAAG CCTGCCACTT AGTCGAACTT CATCATTTAT CTCTAATAAT TGGGTTGTTT AGTGATGCAG ATGGACGTCC ACAACCATCT GAGGAGATTG TACTTTCCCA CGTTCAAGCT TTAGAAAATA TCGGTTACAG AACCGTTGGT ACCCACGAAG CTCTTGCTGG CGAACAATAC GTCCTCAACC AAGTGCTGGA GCTAGTGGAG AAGTGCAATG CAGGCGGAAT ATTGAACTGC GAATGGTGGC ATCAAAAGGG CTCGGGTTTC CATGCGTGAG CTGCTCTCCA CTTACCTTGA GAGCGAAGCT GAAAGAGGCC CAGGTTCGAA ATCATCGATC ACGAAGTGTT GAAAGGCTAT GGGGGCATTT CTAACATCAT TCTTCGAATT GCTGCTTTTC ATCCGCCTTC ATATAATATT TCACAGCCGA AGGTAGAGAA GGACGCGATA TTATTAGGTT CCCATATCGA CTCCACTATG CCTTCTCCTG GCGCTTCAGA GTAAGCCGCA TTCCAAGAGA AGTTCTGCCA TATATACTGA TGCGAGTGTA GTGACGGTAT TGGAGTTGGA GTCATGTTAG ATACAGCTCG AATTCTGGTG GAAAGGAATG AAGCCTTTGA TGGTGCCATC ATCTTCAGTG GGTTGCTTCA TAAATTCGAT TTCATGGGGA ATCTCGCTCA CTAACATGTG TCAATAGTGT GGAATGGCGG AGAGGGTAAG TTTGGTCCCT TGAGTGCCGA CTAACGGCAT GTTAGAGACC TTGCAGGACG GCTCTCATCT TTATTCGACT GAACACAGTA CAGCACCTAC AGTAAAAGCA ATGATCAATC TTGAAGGTAT CTTGATCCAT TCGTTCGCGC TTCTAGCACC ATGGCTGACC TTCTGCTAGC CGCCGGATCT ACTGGAGGCG CTCTTCTATT CCAAGCCACT AGCAAAGAGA TGATTGAGGC TTATGTCCAT GCCCCTTTGT AGGTGATCAA GTTGTACATG ACCACATGGC TGACATAAAA TAGCCCCCGG GGAACGGTTA TCGCTGCTGA TGTCTTTGCC TCTGGTATCC TTATGTCAGA GTACGTGTTT TCGGCTGAAC AAGATGAGCA CATAACTAAA GTGTCGCTCA GCACCGACTT TGGGCAGTTT GAAAAGTACT TGGGTGTCTC TGGACTGGAT GTAAGTTCTA CTGTAATATG ATGATCAGCC ATGTAGCTCA CATATAATAG ATGGCCATTG TTGGGCATAG CTACTTCTAT CATACTCATA GGTAAGTCTC GATGATTACG ACAAATAGCT GACAACACCT AGGGATACGA TCAAGCACCT CGAAAAAGGT ACAGCCCAGC ATTTTACCTC CAATATTCAA GCCATTGTCG ACCATCTTCT CTCCCCTTCA TCTCCCCTCC TCTCCCCTGC GCCATTCTCA CCTCCTCATG TTGTCTACTT CTCCCTTTTC GACCGAGTCT TCTTCCATTT CCCAATGTCC AGAGCCGATG GATGGTACGT CAGCATCGCT GCAGTGGCGA CTGCTTTTGC TTTCAGGCAT TTGAGCAACA AGAAGGCAAA AGCAATTGTT GTGGCTGCTG TCGGTACACC TCTAGGTATT CTCGGTGGAT TGGTCGGAGC CAATGCTTTT GCAGCCGTAC TCTCTGCTAC CGATAACGGC CTACTCTGGT TCCCTCACGA ACACCTCCCG CTTCTGCTTT ACGTACCAGT CTCATACATC TCACTCTTCT CAATTCATCT GATGCTTACC CACTTCCTTT CCCCTGTTGA GCGTACGCAA CTCGAAGTGA CACACTATTA CATCCAACTC TTACTCTCCT CTTGGTACAT GTTGCTCTTG CAAAGCTTCA GAGTCAGGTC GGCGTATCTC TATGCGATGA TCACCGCTTT GCTTTTGGTT GGTGCGGTAG GTAACGAGCT GGGTCGAATG GGACGAAGGG GATTGTGGGA GGGAATGTCA TTCAAAATGA CCTATTTAGT GCCTTCCGCA TGTCTCATGG CGCTGGCCGT GGAAGCCGTT ACTACTGTAG GCAAAAGTAT TAGCAATACC ATCATGAATT CAGACACTGA CACAAACTAC AGGCGTTGGA CATCTTCACG CCTCTTGCAG GTCGTATGGG TAAAGAAGCC CCAGCCGAAC ACATCGTGGC CTCTCTCTCT GTCATTTGCG GCTTTGTCTT CTTCCCCACA GTTCTTCCCC TGTTCCACCG CGTATCTCGC ATGACTCAGA GAAAAGTCGT CCTTGGCTTG GTTCTGAGTG TCCTTGGTAC CGTTGTTGCG ATGGTGGGAC CTTGGTACTT CCCTTACGAC GAAATGCATC CGAAACGAGT TGGAGTTATA TACAATTACA ATGTGAGTAC TAGTATGATG AAAACACGCT TGGGAGCTGA CGAAATGTGC AGCATACATC TGACAAGCAT GTTGCGCATC TTGCGTTCAT GGACCGAGGG CCTGTGGCCG ATATCGTACC CTCCCTCTAC TCCCGCTATG GGACACCCGA CCTGCCTCTC GAACACACCT CTCTTACAGA TTACGATTCT GATTGGGATG TGCTTTACCC TGTCTCTACC TTCTTAGATA CATACAGATT CGATTTGCCC GTAAGCGAAG AGACTAAGAA ATTTACTTGG CCAGAGATGA AATGGGGAGT GAAGGATACC AAATGGGAAA ATGGGGTTAG AAAGATGGTT CTGACATTCA ACTTTGTTCG TCTACCATTT CATTTCAGAT ACATACCGCT GATTCGATCC ACACAGACTG GGCTTGTTTG GCCAACTTTG GCATTTGAAG CATCTGTATT GGATTGGTCA TTCCACTTTG ACCCTCCTCC CAAGAAGATG CAGCACCATA TCAAGATTGC AACTTCTGTT GATGAACCAG TGGTGAATTT GAGGCTCGAT ATTAGAGCAG ATGAAGGAGA GAGGTTGAGA ATTCATTGGA GTGCTTTTGG TGAGACTTTC ACAAATCTCG ACATGTCATA TGACATTATA CTGACAAAGT GTTCGTTCTT CTCTCATAAG ATATCAACCA GATGGTCCCT GGCACAGCGG CACGAGATGG CCCAGATATG CCCGCTTCGA AGATGTTGCT CGATCTCGCA AGCTGGTCAA GGGAGAAATA CAATGACGAT CTGGACTTGG TCATGAGCGG CGTCGTCTGC GGTGTCATTG AGGTGTAACT TTGTGATGTG CAATGGTCTT CCTTTTCATG GGGGTTATGT CTGGAGGGGT GTAGCATATG TTAGTGGAGT ATACAAAGAA TAATTGATTA GGCTAAGATG CTTTGTGATC TTTAAATACT TTTCTTTCCA CCGGTGTGGT CGATCTGGCA TATTTTAGTG TGCCAATACT GGTTCTGTAC GGACTTCATT TCTAGGTCTA CGTATGCGTG AACAATACTA GCCGCGTT
|
Protein sequence | MPSKRTSSGV SSPTAKSSST RKSKSTPKKT LPRPITSPPP PPHHHVPAPA PMHLPVHKAG RLSWSLLGFI LVVLPFWFSR LHYGLPEPLP PYDADGRPQP SEEIVLSHVQ ALENIGYRTV GTHEALAGEQ YVLNQVLELV EKCNAGGILN CEWWHQKGSG FHAFEIIDHE VLKGYGGISN IILRIAAFHP PSYNISQPKV EKDAILLGSH IDSTMPSPGA SDDGIGVGVM LDTARILVER NEAFDGAIIF MWNGGEETLQ DGSHLYSTEH STAPTVKAMI NLEAAGSTGG ALLFQATSKE MIEAYVHAPF PRGTVIAADV FASGILMSDT DFGQFEKYLG VSGLDLTTPR DTIKHLEKGT AQHFTSNIQA IVDHLLSPSS PLLSPAPFSP PHVVYFSLFD RVFFHFPMSR ADGWYVSIAA VATAFAFRHL SNKKAKAIVV AAVGTPLGIL GGLVGANAFA AVLSATDNGL LWFPHEHLPL LLYVPVSYIS LFSIHLMLTH FLSPVERTQL EVTHYYIQLL LSSWYMLLLQ SFRVRSAYLY AMITALLLVG AVGNELGRMG RRGLWEGMSF KMTYLVPSAC LMALAVEAVT TALDIFTPLA GRMGKEAPAE HIVASLSVIC GFVFFPTVLP LFHRVSRMTQ RKVVLGLVLS VLGTVVAMVG PWYFPYDEMH PKRVGVIYNY NHTSDKHVAH LAFMDRGPVA DIVPSLYSRY GTPDLPLEHT SLTDYDSDWD VLYPVSTFLD TYRFDLPVSE ETKKFTWPEM KWGVKDTKWE NGVRKMVLTF NFTGLVWPTL AFEASVLDWS FHFDPPPKKM QHHIKIATSV DEPVVNLRLD IRADEGERLR IHWSAFDINQ MVPGTAARDG PDMPASKMLL DLASWSREKY NDDLDLVMSG VVCGVIEV
|
| |