Gene CNL03980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL03980 
Symbol 
ID3254739 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp91855 
End bp93112 
Gene Length1258 bp 
Protein Length315 aa 
Translation table 
GC content47% 
IMG OID638253870 
Productconserved hypothetical protein 
Protein accessionXP_567954 
Protein GI58261088 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5160] Protease, Ulp1 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.401732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTCAG GGAACGTAGG TGCTTTAAAT CGGAGGAGAA CGCTTAGAAA AGGTGGGTAT 
GAAAGGATTC GTTGGTGGAT AAAGGTACTG ACAATGGATG GAAAGTCGAT GATAGGCTAG
CGGGTATACG GACGGTTCTG CGCAACGATC AAGCCACCAA GTCATTTGAC GAAGTTGTAC
AATCAAAACC GCAGCAAGAG GTTTACAAAC CTAAACAAAA ACGGGAAATA GGCGTCAAAG
CCCAAGAGCA AGCCTCCAAG TCGTGCGTCC AAACCCGCCA CCTCCCCGCA AAGCTAATAA
CCTACCGTAT AGATCCAGTT TTGAATTCGT CTCCATTCTC AAGAACCTCA AAGCACTGCA
GCTCGCCAAA GAAAAGGCCC TCAAACCTTC CGTGCCATCC AAACTGTCTC CTCAACAAGA
ATCCAAAGTT GACGCACACC TTCGAAATCC CAAATTCAAA GTCACTCTCA ACGTCTCTGA
AGTGGAAGCT GGAAGTCTCA GGAGGCTCAA GCCTAGTACG TGGTTGGATG ATGAGGTGAT
GAACGCGTAC TGCGATTTGA TGTGTAGTCG GTTCAAGGAT GGGAAGGCGG GGAGAAAAGT
TCATTCTTTG AATTCCTTTT TCTATGGCAA GCTTGTGGAT CAGGGGTACG CCGCTGGACG
GTTGAAGCGA TGGACTAAAA AAGTGAGCTT GTGCCCTATG CTCGTCCTGT CCATCCCGCT
AATCCTGGCA CGCTATTTCA AGATCGATAT CTTCTCGCTC GATGTTCTCA TCTTCCCTAT
CAACCAAGGT AACATGCACT GGACCGCATG TGCCATTAAT TTTGCCAAGA AACGGATAGA
GTACTACGAC TCGATGGGAG ATTATGGGAA TGCGAGGAAA CAAGTGTTTA GAAAAGTGAG
AGGATATGTG GAGGCTGAAC ACAAGGAAAA GAAAGGAAGG GCAATGGATT GGGAAGGATG
GCATGATTAC TTCAACAAGG TGTGTATCGC CAAATCTTAC CAATCTCATG CGCCATCAGA
CTCATGTTTT TTTCTTTTCT TTCTTTCTTT CTTTTAGAAC ACACCACAAC AGAATAACGG
TTCAGACTGT GGCGTCTTTT CATGCCAAAC ATTAGAGATG ATCACTCGCG GTCGGGATAT
TGTCACCCAG GGTTTCGAGT TTACTGCGAA GGACATGCCG TTCATGAGGA GAATGATGAT
TTATGAGATT GGGGAAGGCA AATTAGAGAA GAGGACCTGG GGTTCGCCTG CGTTATAG
 
Protein sequence
MFSGNVGALN RRRTLRKATK SFDEVVQSKP QQEVYKPKQK REIGVKAQEQ ASKSFEFVSI 
LKNLKALQLA KEKALKPSVP SKLSPQQESK VDAHLRNPKF KVTLNVSEVE AGSLRRLKPS
TWLDDEVMNA YCDLMCSRFK DGKAGRKVHS LNSFFYGKLV DQGYAAGRLK RWTKKIDIFS
LDVLIFPINQ GNMHWTACAI NFAKKRIEYY DSMGDYGNAR KQVFRKVRGY VEAEHKEKKG
RAMDWEGWHD YFNKNNGSDC GVFSCQTLEM ITRGRDIVTQ GFEFTAKDMP FMRRMMIYEI
GEGKLEKRTW GSPAL