Gene CNI04350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI04350 
Symbol 
ID3259710 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1160390 
End bp1163742 
Gene Length3353 bp 
Protein Length966 aa 
Translation table 
GC content52% 
IMG OID638258929 
Producthypothetical protein 
Protein accessionXP_573020 
Protein GI58271728 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGAG CAGCTACCGC CACTCCTCCA CCTCAAACTT ATTTTCCCCA GCGGAACGCC 
ACTCCGTCTA TGAGGCATGC CGCGCCTAGC CCGGCGATTC ATTTTGGGCG ACAACATCAG
CATTTGGGAC AGAGCCATGT GAAGAGGAGG GTGAAGGGTA ATGGTGGATC CGGTTTGAAG
AAGCAGCATA TACCGAATAG CAGAGAACAG AACGAGGAGG ATCAATTGGG GATGTCATTC
GAGGATCTCG GCGCGGAGCG CGTCCTTCAA AATCCAAACT CAGACAAAGC CGATATCGAT
GTCGATATCG ACATTGACTT GGACCTCGAC CCTAACCTCA CTCCGGCCTC TTCCCGAGCC
TCCATCGTCA GTGGCCTAAG CCAAGACCAA GGTCCTGTCC TTCTCACCCC CGTCCAATCC
GATTCAGAGG GTAGAGGCCC AGTGTTAGTG TCAAATTCCG GCAGGGTGTC GTCCGCAGGT
TCAGAAGGGG GCGAAGAAGA AGGCCGAGCG TCGCCGGGAT GGATGGGAAA GATGGCGATG
GCGGTTGTGA ATACGGGGAT GAGTGTTTCC AGAGGGTTTG GAAATGCACA AGCGCATGGT
AAAAGAGTCC AATCACAGCC ACAGCCACAA CCGCAGGCGC AAGCTCAAGC CCACGATCAG
ACTTTGGCAC ACCCAGCCGA AAAGATTGAC GATCAGCAGC GACAGCGGGA ATGGGCTGAA
GCAGAGAACC GGAGAATCCA TGAGTGCGCA AGGTTGTGTT CTCAGTGGCC GTTAAGTGGG
TACAACTTGG GCAAATACGG TCCCGGTGGT GAGTCAACTG TCCTTCACAT GTTCGCATCC
GGTAAACATG TGTTAACATT GGGAATGAAG GCGCTGCAAC GTTCTATCAA CCCCAGTCAT
TCTGTAACCC CCAACACGTC GCTTGGGTGA TGCATCGTCA AGCCCAAATC GAGCAGAGGT
TGGCATTGAC CGAAGGGACA TTCTTCGATT GCCGCAAAGT GAGGCAAAGG AAGAGGCAAG
GTGATTGTGA GAATGGTCGA GGGTGTAGAG CTAGAGAGGA AGATGAGCAG AGTGAAGTTT
CAACCTCGAC GGATGACTCG ACGTTGTCAT CTCGGTTGTC GGGATCGACA TGCTCAACCG
ACTCTCGCTC CAACTCTACC GCCAATTCTT CCAATCCTTC TTCCTCGCAA GACGAAAACA
ACTCTGAACC TCTGTCCCTC GTCCAACCCC AACGACCTTC TGCGCCAAGG TCATTGACGG
CTGAGCACTA TGCGTATGGG CGTACAGCGC AAAAGATGGA GAGAGAGAAG GAGATGGCGG
AGGATTTGAG AGAGGCGATG GCGTGCTCCT TATTGGAGTT TAGTCATCCT GAGCGTCGCG
GTTCAGTGAC TTCTTCGGCC ACCAAGGATG TTAGCGCTGA TGGAAGGGAC GAGAATGGAG
AGAAGTTGGA GAATGCCGAA GTGGAGGGCA GGAGAGAAAG GGGTCGCAAG ATGTTTGCCG
AGTTGGAAGC TCTTGCGAGG GATGTAGGTT TGGGGCATGT TGGGGAGGGG ATGGATGTGG
ATGAGGACTT GTGTGAAGAG TCTCAACCAC AGCCTCAACC CGACACGCAA GCTACCCCCT
CTGCCCAAAC TCACGCCGCA AGCTATTCTT CCGAACCATC TATGCCTCGC ACCCGAACAC
AACGCGCTCA GTCATGCGGC TCCAAACGTC CTATTCCTAG TACACGCTGT AGCGCCGATG
GGGAAGAGGA GAAACGTCGA AAAGTTCGTC CTGTCGCTCC TCCTACTACG TCTTCAGGTT
TTGACCCAAT GTGCACCGCA ACGATGGAAG TGGAAGAGGC TGTGGATGAA GCTATGGTTG
TTGAGGAAGC TGTTGATAAC GGCGTCATCA TGCCCGCTAC CCAAACGCAG TCTCAGAATC
AATCTCATAC AGGGGAAAGT AGGCTCAAGA CTGGAAGGGG AAAAGGAATG TCATCTTCCG
TACCCGACTT GTCCAAAACT TATCCTCATT CCGAATCAGC GACGACGTAC CCTCCTCAAC
GACAGCAACA GCCTGAACAA TCAAACTCTC GTGATATATA CCCTCCTCCG GCTGTATTCG
GTGTGGTCGT CAAAACTTCC GAGTCACATC CCATCATCAT TTCGCCCTTC TTTCCGAAAG
ATTTGTTGGG CATTTTGGCG GAACATCTCG TCCTGCCTCT TGGGGGAGCG AGGAAACCGT
TGTTGTTGGG CTCGAAGCTT GATGTTCCAA GTCTTTTACT CTCTTATTCA CCAGGAATGC
CCTTCCCCTC ATCCGGCTCG ATGCGAAATC AATCCCAATC TACTCAGACT CAGCCCGTGT
CCAACCTTTG TCAAGCTCCT CGGGTACCAG CATTAGGTAA CCTTCTTCTG AGCAGCTGTC
CTGGTAAGAG ACTGAGAATG GAAGGGCCGA GTAAAGGTAG GGGACCCGTT TGTAGGGATT
TAGCGACAGA TTTGAAGAGG ATCAAGGGGG AGGGTGTTGG GTGTTTGGTC TGGTGAGTTC
AAGCTTCGTT GCTTAAATCT TTTGGTTGCT AATGTGGGGT GGGGACGGGC ATTAGTTGTT
TGGATGATGA AGAACTTGAA CTGCTTGGTG TACCTTGGGA GACGTATCGC GATGTCGCTG
CGGAAACGGG GTTGGATGTC ATTCGGTATG TAATATGAAC TTCTTCCTTC TCTCCTTCTT
CCCCTCCTTG CCTGTACATT GGCATTAGCT CATCATTACT CCTAGGTTGC CAATGCCTGA
CGGCTTCACT CCCGTTTCGA TGGAACTTTT TGACTCTCAA GTATCCCTCA TCGCTACAAA
ATACACTTTG CAGGGCATCA ACGTTCTTGT TCACTGTCGA GGTGAGTTTC CATCTTTTCT
GCGCCTTCGA CCTCGTGATC GAGCGGGTGT GTGTTATGTG TATGAGGAAT TTGATGATTG
ACACTGTTGG CTCTGCCCTC CCCCTTCTAC CCCGTCCGCC TTTTTTCCCT TGGACAACCT
TCGAACACAT AATCATAGGT GGAGTTGGAA GAGCAGGTAT GACGGCTTGT GCGTGGGCTA
TCAAAATGGG TTTCGTCCAG CCTCATCCTT CATTGGTCAT CGTTGAAGAA GCTGCTCGAC
AACGATACAA TCTCACTCAC GGTATTTCAC CCACGTCAAA CTCCCCGACA CCTATCGCTC
CCTCTGCCGC TGTGCCCGCC GAACTCGAGC ATCAAATCGT TATGAGCATG GTTGAGAGAG
TCATCGCGAT GATCAGGTGT CGAAGGGGAT TGAAAGCGAT TGAAAGTTTC GAGCAGGTGG
CGTTCTTGAT GCGATATGTG GGATGGTTAA GGCAGGGTGC AAGGAGCGCC TGA
 
Protein sequence
MSRAATATPP PQTYFPQRNA TPSMRHAAPS PAIHFGRQHQ HLGQSHVKRR VKGNGGSGLK 
KQHIPNSREQ NEEDQLGMSF EDLGAERVLQ NPNSDKADID VDIDIDLDLD PNLTPASSRA
SIVSGLSQDQ GPVLLTPVQS DSEGRGPVLV SNSGRVSSAG SEGGEEEGRA SPGWMGKMAM
AVVNTGMSVS RGFGNAQAHG KRVQSQPQPQ PQAQAQAHDQ TLAHPAEKID DQQRQREWAE
AENRRIHECA RLCSQWPLSG YNLGKYGPGG AATFYQPQSF CNPQHVAWVM HRQAQIEQRL
ALTEGTFFDC RKVRQRKRQG DCENGRGCRA REEDEQSEVS TSTDDSTLSS RLSGSTCSTD
SRSNSTANSS NPSSSQDENN SEPLSLVQPQ RPSAPRSLTA EHYAYGRTAQ KMEREKEMAE
DLREAMACSL LEFSHPERRG SVTSSATKDV SADGRDENGE KLENAEVEGR RERGRKMFAE
LEALARDVGL GHVGEGMDVD EDLCEESQPQ PQPDTQATPS AQTHAASYSS EPSMPRTRTQ
RAQSCGSKRP IPSTRCSADG EEEKRRKVRP VAPPTTSSGF DPMCTATMEV EEAVDEAMVV
EEAVDNGVIM PATQTQSQNQ SHTGESRLKT GRGKGMSSSV PDLSKTYPHS ESATTYPPQR
QQQPEQSNSR DIYPPPAVFG VVVKTSESHP IIISPFFPKD LLGILAEHLV LPLGGARKPL
LLGSKLDVPS LLLSYSPGMP FPSSGSMRNQ SQSTQTQPVS NLCQAPRVPA LGNLLLSSCP
GKRLRMEGPS KGRGPVCRDL ATDLKRIKGE GVGCLVWLPM PDGFTPVSME LFDSQVSLIA
TKYTLQGINV LVHCRGGVGR AGMTACAWAI KMGFVQPHPS LVIVEEAARQ RYNLTHGISP
TSNSPTPIAP SAAVPAELEH QIVMSMVERV IAMIRCRRGL KAIESFEQVA FLMRYVGWLR
QGARSA