Gene CNI00550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00550 
Symbol 
ID3259631 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp128881 
End bp132628 
Gene Length3748 bp 
Protein Length898 aa 
Translation table 
GC content47% 
IMG OID638258539 
Productexpressed protein 
Protein accessionXP_572778 
Protein GI58271244 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.187013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGATACTCTG TGATTATTTG CCATCCACAA GGAAGCCAAA GAGGTCGTCC ATCTTAGTGC 
GTAGAGAAGA ATGCCTTCTA AACGCACCAG CTCGGGTGTG TCTTCGCCCA CTGCCAAATC
GTCTAGCACC CGTAAATCCA AATCTACACC GAAGAAGACG TTGCCAAGGC CGATTACATC
ACCTCCTCCA CCTCCTCATC ACCATGTTCC GGCACCAGCA CCTATGCACT TACCAGTGCA
TAAGGCCGGA AGACTGAGCT GGTCGTTACT TGGTTTTATC TTAGTTGTCT TGCCGTTCTG
GTTCTCCAGA TTACATTATG GATTGCCTGA ACCTCTTCCA CCATAGTAAG CCTGCCACTT
AGTCGAACTT CATCATTTAT CTCTAATAAT TGGGTTGTTT AGTGATGCAG ATGGACGTCC
ACAACCATCT GAGGAGATTG TACTTTCCCA CGTTCAAGCT TTAGAAAATA TCGGTTACAG
AACCGTTGGT ACCCACGAAG CTCTTGCTGG CGAACAATAC GTCCTCAACC AAGTGCTGGA
GCTAGTGGAG AAGTGCAATG CAGGCGGAAT ATTGAACTGC GAATGGTGGC ATCAAAAGGG
CTCGGGTTTC CATGCGTGAG CTGCTCTCCA CTTACCTTGA GAGCGAAGCT GAAAGAGGCC
CAGGTTCGAA ATCATCGATC ACGAAGTGTT GAAAGGCTAT GGGGGCATTT CTAACATCAT
TCTTCGAATT GCTGCTTTTC ATCCGCCTTC ATATAATATT TCACAGCCGA AGGTAGAGAA
GGACGCGATA TTATTAGGTT CCCATATCGA CTCCACTATG CCTTCTCCTG GCGCTTCAGA
GTAAGCCGCA TTCCAAGAGA AGTTCTGCCA TATATACTGA TGCGAGTGTA GTGACGGTAT
TGGAGTTGGA GTCATGTTAG ATACAGCTCG AATTCTGGTG GAAAGGAATG AAGCCTTTGA
TGGTGCCATC ATCTTCAGTG GGTTGCTTCA TAAATTCGAT TTCATGGGGA ATCTCGCTCA
CTAACATGTG TCAATAGTGT GGAATGGCGG AGAGGGTAAG TTTGGTCCCT TGAGTGCCGA
CTAACGGCAT GTTAGAGACC TTGCAGGACG GCTCTCATCT TTATTCGACT GAACACAGTA
CAGCACCTAC AGTAAAAGCA ATGATCAATC TTGAAGGTAT CTTGATCCAT TCGTTCGCGC
TTCTAGCACC ATGGCTGACC TTCTGCTAGC CGCCGGATCT ACTGGAGGCG CTCTTCTATT
CCAAGCCACT AGCAAAGAGA TGATTGAGGC TTATGTCCAT GCCCCTTTGT AGGTGATCAA
GTTGTACATG ACCACATGGC TGACATAAAA TAGCCCCCGG GGAACGGTTA TCGCTGCTGA
TGTCTTTGCC TCTGGTATCC TTATGTCAGA GTACGTGTTT TCGGCTGAAC AAGATGAGCA
CATAACTAAA GTGTCGCTCA GCACCGACTT TGGGCAGTTT GAAAAGTACT TGGGTGTCTC
TGGACTGGAT GTAAGTTCTA CTGTAATATG ATGATCAGCC ATGTAGCTCA CATATAATAG
ATGGCCATTG TTGGGCATAG CTACTTCTAT CATACTCATA GGTAAGTCTC GATGATTACG
ACAAATAGCT GACAACACCT AGGGATACGA TCAAGCACCT CGAAAAAGGT ACAGCCCAGC
ATTTTACCTC CAATATTCAA GCCATTGTCG ACCATCTTCT CTCCCCTTCA TCTCCCCTCC
TCTCCCCTGC GCCATTCTCA CCTCCTCATG TTGTCTACTT CTCCCTTTTC GACCGAGTCT
TCTTCCATTT CCCAATGTCC AGAGCCGATG GATGGTACGT CAGCATCGCT GCAGTGGCGA
CTGCTTTTGC TTTCAGGCAT TTGAGCAACA AGAAGGCAAA AGCAATTGTT GTGGCTGCTG
TCGGTACACC TCTAGGTATT CTCGGTGGAT TGGTCGGAGC CAATGCTTTT GCAGCCGTAC
TCTCTGCTAC CGATAACGGC CTACTCTGGT TCCCTCACGA ACACCTCCCG CTTCTGCTTT
ACGTACCAGT CTCATACATC TCACTCTTCT CAATTCATCT GATGCTTACC CACTTCCTTT
CCCCTGTTGA GCGTACGCAA CTCGAAGTGA CACACTATTA CATCCAACTC TTACTCTCCT
CTTGGTACAT GTTGCTCTTG CAAAGCTTCA GAGTCAGGTC GGCGTATCTC TATGCGATGA
TCACCGCTTT GCTTTTGGTT GGTGCGGTAG GTAACGAGCT GGGTCGAATG GGACGAAGGG
GATTGTGGGA GGGAATGTCA TTCAAAATGA CCTATTTAGT GCCTTCCGCA TGTCTCATGG
CGCTGGCCGT GGAAGCCGTT ACTACTGTAG GCAAAAGTAT TAGCAATACC ATCATGAATT
CAGACACTGA CACAAACTAC AGGCGTTGGA CATCTTCACG CCTCTTGCAG GTCGTATGGG
TAAAGAAGCC CCAGCCGAAC ACATCGTGGC CTCTCTCTCT GTCATTTGCG GCTTTGTCTT
CTTCCCCACA GTTCTTCCCC TGTTCCACCG CGTATCTCGC ATGACTCAGA GAAAAGTCGT
CCTTGGCTTG GTTCTGAGTG TCCTTGGTAC CGTTGTTGCG ATGGTGGGAC CTTGGTACTT
CCCTTACGAC GAAATGCATC CGAAACGAGT TGGAGTTATA TACAATTACA ATGTGAGTAC
TAGTATGATG AAAACACGCT TGGGAGCTGA CGAAATGTGC AGCATACATC TGACAAGCAT
GTTGCGCATC TTGCGTTCAT GGACCGAGGG CCTGTGGCCG ATATCGTACC CTCCCTCTAC
TCCCGCTATG GGACACCCGA CCTGCCTCTC GAACACACCT CTCTTACAGA TTACGATTCT
GATTGGGATG TGCTTTACCC TGTCTCTACC TTCTTAGATA CATACAGATT CGATTTGCCC
GTAAGCGAAG AGACTAAGAA ATTTACTTGG CCAGAGATGA AATGGGGAGT GAAGGATACC
AAATGGGAAA ATGGGGTTAG AAAGATGGTT CTGACATTCA ACTTTGTTCG TCTACCATTT
CATTTCAGAT ACATACCGCT GATTCGATCC ACACAGACTG GGCTTGTTTG GCCAACTTTG
GCATTTGAAG CATCTGTATT GGATTGGTCA TTCCACTTTG ACCCTCCTCC CAAGAAGATG
CAGCACCATA TCAAGATTGC AACTTCTGTT GATGAACCAG TGGTGAATTT GAGGCTCGAT
ATTAGAGCAG ATGAAGGAGA GAGGTTGAGA ATTCATTGGA GTGCTTTTGG TGAGACTTTC
ACAAATCTCG ACATGTCATA TGACATTATA CTGACAAAGT GTTCGTTCTT CTCTCATAAG
ATATCAACCA GATGGTCCCT GGCACAGCGG CACGAGATGG CCCAGATATG CCCGCTTCGA
AGATGTTGCT CGATCTCGCA AGCTGGTCAA GGGAGAAATA CAATGACGAT CTGGACTTGG
TCATGAGCGG CGTCGTCTGC GGTGTCATTG AGGTGTAACT TTGTGATGTG CAATGGTCTT
CCTTTTCATG GGGGTTATGT CTGGAGGGGT GTAGCATATG TTAGTGGAGT ATACAAAGAA
TAATTGATTA GGCTAAGATG CTTTGTGATC TTTAAATACT TTTCTTTCCA CCGGTGTGGT
CGATCTGGCA TATTTTAGTG TGCCAATACT GGTTCTGTAC GGACTTCATT TCTAGGTCTA
CGTATGCGTG AACAATACTA GCCGCGTT
 
Protein sequence
MPSKRTSSGV SSPTAKSSST RKSKSTPKKT LPRPITSPPP PPHHHVPAPA PMHLPVHKAG 
RLSWSLLGFI LVVLPFWFSR LHYGLPEPLP PYDADGRPQP SEEIVLSHVQ ALENIGYRTV
GTHEALAGEQ YVLNQVLELV EKCNAGGILN CEWWHQKGSG FHAFEIIDHE VLKGYGGISN
IILRIAAFHP PSYNISQPKV EKDAILLGSH IDSTMPSPGA SDDGIGVGVM LDTARILVER
NEAFDGAIIF MWNGGEETLQ DGSHLYSTEH STAPTVKAMI NLEAAGSTGG ALLFQATSKE
MIEAYVHAPF PRGTVIAADV FASGILMSDT DFGQFEKYLG VSGLDLTTPR DTIKHLEKGT
AQHFTSNIQA IVDHLLSPSS PLLSPAPFSP PHVVYFSLFD RVFFHFPMSR ADGWYVSIAA
VATAFAFRHL SNKKAKAIVV AAVGTPLGIL GGLVGANAFA AVLSATDNGL LWFPHEHLPL
LLYVPVSYIS LFSIHLMLTH FLSPVERTQL EVTHYYIQLL LSSWYMLLLQ SFRVRSAYLY
AMITALLLVG AVGNELGRMG RRGLWEGMSF KMTYLVPSAC LMALAVEAVT TALDIFTPLA
GRMGKEAPAE HIVASLSVIC GFVFFPTVLP LFHRVSRMTQ RKVVLGLVLS VLGTVVAMVG
PWYFPYDEMH PKRVGVIYNY NHTSDKHVAH LAFMDRGPVA DIVPSLYSRY GTPDLPLEHT
SLTDYDSDWD VLYPVSTFLD TYRFDLPVSE ETKKFTWPEM KWGVKDTKWE NGVRKMVLTF
NFTGLVWPTL AFEASVLDWS FHFDPPPKKM QHHIKIATSV DEPVVNLRLD IRADEGERLR
IHWSAFDINQ MVPGTAARDG PDMPASKMLL DLASWSREKY NDDLDLVMSG VVCGVIEV