Gene CNC01950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01950 
Symbol 
ID3256196 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp539342 
End bp540486 
Gene Length1145 bp 
Protein Length233 aa 
Translation table 
GC content46% 
IMG OID638255415 
Productconserved hypothetical protein 
Protein accessionXP_569446 
Protein GI58264580 
COG category[R] General function prediction only 
COG ID[COG0693] Putative intracellular protease/amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATAATAAT AGTAGTTGTA TATCAATAGT ATTGTTCACT GCCTGACTAA CCGTACTTCA 
AAAACCTCAG TTCGTAAATT CAGCCATGTC ACAGCCTAGC AAAGCCGTAC TTTTCGTCTT
CACCTCGGCC GAGAAGCTTC TCAACGGTGC AGTACGTGTG ACCTCCCACC CTTTACCTTA
TTTATCTTTA GTACCTGCCA ATGACTGATA TTGTTTTGCA GCCGACAGGA TGGTATCTTC
CTGAAGCCGC CCACCCGTAC TACGTCCTTT CCCCCCATTA CCGTATTGAA GCCATCTCCA
CTAAGGGTGG CCCCGTCCCT GTCGACGAAA CCTCTGTCAA GAATTTCCAG GACGAAGATT
CGCAGAAGTT TTTGAAAGAT CCTGAAGCTC AAAATTTGGT CAAAAACACC AAAAAGGTAG
AAGACGTCAA GGCCGCGGAT TATGAAGCCA TGTTTGTCAT CGGCGGAGTG AGTGCTGATT
CCAAAAGGGA AGAGCGTAAT TCCAGACCGG AAGAGCGTGC TTGAATGGAA GTGTTTAGGA
GCTGATCATG AAGCATCTCG TAGCATGGGC CTTTGATTGA TTTGGCGAAG AGTGAAAAGT
TTGCCAAGCT TGTAGAGGAC TTCTACGTTG CAAAAAAGGT AAGACAATTG AACATTCTCA
TGGGTGTCAT TCGGCTAAGG AATAGGGCTA GCCGGTGTCT GCAGTGTGTC ATGGTCCTGG
CGCTTTCATC CTTGCTACCA ACCCGGCGAC TAGGAGGTCT ATTTTCGCTG GCGCACGTGT
CACAGGCTTT TCTAACAGTG AAGAAGCACA AACTCCTTAC AATGATTTTG TCAATATTCT
CCCTTTCAGT TTGGAAGACA AAATCAAGGA ACTTGGTGGG CAGTATGAGA AGGCCGACCA
AGACTGGGGT GTCAAAGTCA TTTGGGATCA GGGAGTTTTA ACTGGTAGGT CACCATTGCG
CATTACCCAA GCACCATCAC ATCACGCTCT CCACTGAACA TATATCCAGG CCAAAACCCT
GCTTCTGCTG GACCTCTCGC CGTAAAGTTG AAGGAAATTT TGGAAGCCTG ATATGGCATA
CAATTAAGCT AAGGACGGCA TATGTAAAGA ATACAAGGAG ATGTGACTTA TCATGATGTG
CAAAC
 
Protein sequence
MSQPSKAVLF VFTSAEKLLN GAPTGWYLPE AAHPYYVLSP HYRIEAISTK GGPVPVDETS 
VKNFQDEDSQ KFLKDPEAQN LVKNTKKVED VKAADYEAMF VIGGHGPLID LAKSEKFAKL
VEDFYVAKKP VSAVCHGPGA FILATNPATR RSIFAGARVT GFSNSEEAQT PYNDFVNILP
FSLEDKIKEL GGQYEKADQD WGVKVIWDQG VLTGQNPASA GPLAVKLKEI LEA