Gene CNA01340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01340 
Symbol 
ID3253710 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp357523 
End bp359374 
Gene Length1852 bp 
Protein Length383 aa 
Translation table 
GC content49% 
IMG OID638252466 
Productcytoplasm protein, putative 
Protein accessionXP_566593 
Protein GI58258361 
COG category[R] General function prediction only 
COG ID[COG0496] Predicted acid phosphatase 
TIGRFAM ID[TIGR00087] 5'/3'-nucleotidase SurE 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAAGCCATTC AGCTAACAGC CCGTAGACCA GATATATGTC CGCTGTGCTC CTATCGTATT 
CTATCCCCAC TTTCCATATA TACACCATAA TATGCCTCAG CTCAAGACAT ACAGCGAGAA
GCCAGTCGTT CTTTTGACCG TGAGTGCGCC TTACAGGGCT TGATATAGTC TTGGCTCATG
TAGCTCATGT ATGATAAATG TACCCGGAGC AGAATGACGA TGGTCCTCCA TGCGCCTCCT
CTCCCAATAT TTATGCATTC TGCAAACTCC TTCAGTCACG TCTCGGGTGG GACGTACGAG
TAGTCATCCC TGACTGCCAG AAATCTTGGC AAGTTAACAA ATTGTGTTTC ATGGCGAGCT
CTTTGTGCTA ATCAGCCAAC GCAGGGTCGG AAAGTCATAT GCTATTAGTG ACATCGTCAC
TGCCAACTAC TTCTATCCGC TTGGTAAATG TCATATGATT AGACACTTGT TCCCGACGCT
GATGAAGCTT TCATCTGAAA CAGAACCGGA TGGATTGAAA GGAGAAATAA CTCAGACTCG
CCGTCCGCTG AAAGAAGGAG AGTCAATGGA ATGGGTTCTT CTATCTGGAG TATGCGTTCT
TTTAGTTAAT GCGGTTGTGG CTGACATCTC ATGCCCGTTC TAGACTCCCG CAACATGCGC
CAACATCGCA CTGCACAACA TTTACCCTGG CCAGATCGAC CTTGTCATCT CCGGTCCTAA
TCGTAATGCA TCCCTTACAG AAAAAGTATA ATAACGCTGA TATCCGATCT AGATGGCCGT
AACTCCTCAA CAGCATTCGC CCTCTCGTCC GGTACTCTTG GCGCTACCCT TGCCGCTTCC
CTCTCTGTCC CTATTCCCGG TCCCTTGACC TCCCCGTCCT TACATGAAGA CCACATGCCC
TGTATAGCCA TCTCTTACGG TGTCGTCACC CGTCCAGTTT CCGATAGAGT TCTTGAACTC
GCAACCGAGA CAGCGGTGGA TGTGTGCCAG CAGTTATTCG ATAACTGGGG AGAAGATAAA
GAAGTGGGTG GGAAGGGACT TGTGCCGATA TATAGCATCA ATATACCGCT TGTCGAGGCG
GCTCTTGAGA AGAACGAGAG AAAGATAGTG TCAACAGAGA TGTGGAGAAA TGCGTATGGG
CGATTATTCA AGACTACTAA ACTGTGAGTG TCTTTCCTGC TCTTCGTTGT CTTTTGGTGT
GTATCTCGGC CTTGGGGATG GGAGCTTAAC CACCTCTTTG TGGATGGCGA CCCTTCCCTC
AACTGGCGAT CCAACGCTTC CAAGCCAGAC CAACCAGTCA TCTCTTCATT TGTGACCTTT
CTACAAAGCG TGGCCGAGGC TTCCGAAGAG GAAGAAGAGA AGAAGAATAA TCAAGATTAA
GAGCAAGGAC AGAAATCTAA CATTCCCCCT CCTAGGTCAA AAGCGTTGTA CGATCCCGGA
GATGACCCCG TCCAGATTGC TCATGGCTAT ATGAAGAGCC AAGACAAGCC CAACACCGTA
CAGACCTCCA CCTCCCTCCC ATCTCATCCA CATACCTCTA AAACATCCAC TGCCGGCCCC
GCCGCGCTCC CGACTCCTGC CCCACCATCC CCAATCACGC CCAAAAACAC GAAGGACGAG
GAGCAACAGC TCAAGTTCCA CTTTGCGCCC AACATGCACC CGTTACTATT TCCACCTGAA
GGGAGCGTGC CTGAAGGCAC AGATGCGTGG GCGTTCGCAA AAGGATGGAT CAGTGTGACG
CCTATGAGAG CCGAGTATGC TTGTTTGGGA GCGGCAAGTA TCGAGTAACA TGAAAGATGG
AACATTAACC ATCATAACGT ACATTATAAC TGTGTAACGT AGACATTTAT GA
 
Protein sequence
MPQLKTYSEK PVVLLTNDDG PPCASSPNIY AFCKLLQSRL GWDVRVVIPD CQKSWVGKSY 
AISDIVTANY FYPLEPDGLK GEITQTRRPL KEGESMEWVL LSGTPATCAN IALHNIYPGQ
IDLVISGPNH GRNSSTAFAL SSGTLGATLA ASLSVPIPGP LTSPSLHEDH MPCIAISYGV
VTRPVSDRVL ELATETAVDV CQQLFDNWGE DKEVGGKGLV PIYSINIPLV EAALEKNERK
IVSTEMWRNA YGRLFKTTKL SKALYDPGDD PVQIAHGYMK SQDKPNTVQT STSLPSHPHT
SKTSTAGPAA LPTPAPPSPI TPKNTKDEEQ QLKFHFAPNM HPLLFPPEGS VPEGTDAWAF
AKGWISVTPM RAEYACLGAA SIE