Gene CNN01520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01520 
Symbol 
ID3255445 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp441032 
End bp442417 
Gene Length1386 bp 
Protein Length316 aa 
Translation table 
GC content47% 
IMG OID638254567 
Productconserved hypothetical protein 
Protein accessionXP_568626 
Protein GI58262432 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5539] Predicted cysteine protease (OTU family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.999443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGGA TAGTAAGCAG TGACTTCCGA AATTGCAACT TTTATCCTTA TTATTACCCT 
GTAGTTAAGT ATGGTTATCC ACCCAAGCCA CTTCCATCAA CGGCGGGGCC GTTGTCTTCC
ATCCCCATTG GCAAGGGAGA GCAGCTTATT GTCAGCTCTA TACCATCGGG TGGCCCTTCA
AAGAAGGTAC CAGTTGTTGC CCAACCTTCA ACAACTACTA GCTCAGCTCC TGCTGCTTCA
AGGCCTACCA ATGCGTCACC AGTTCTTGCT GCCCCCTTAG TGTCGAATGC TTCTCAAGGC
GAAGGTGTAG AAAGCGGCGA GAGCGTTGCC GTACCGGGCA GAGACGCTGG ATACCTGCAG
CTGAGGGTTG TACCGGATGA CAACTCTTGT CTTTTCAGTG CTATTGGTAT TGTATTTGAG
GGCGGTATCG AGGCAGCTCA GAGGCTGCGT ATGGTGGTGG CCAACGCCAT TAAGGATGAC
CCTTTTACAT ACTCCGAGGT TATGCTTGGG TAAGTGATCG GGATCGTATA AGTCAGGATA
ACTTTGAGCT AATGTGGATG TAAGCCAACC GATCGATCAG TATGTGAAGC GGATCCAAAA
GCCGCAGACA TGGGGAGGAG CTATCGGTAT GTTTCTATAC ATTTCCTGTA CAGATCTCAT
GCATGGCATT GTTGACGTTG TTCAGAACTC TCAATATTTG CCAAACAGTA AATCAAAAAA
ACCATTACAT ATCAGACTTT GCTTACCAAA AACCACAGCT ACAAGACTGA AATTGCCTCG
TTTGACGTAG CAACAGGGCG TTGCGATAGA TTCGGCCAAG ATGAATATGA CACACGGTAC
GTTCGTGTAA CTTGGGGTCC GTTCTGGTGG CTCATGGCGA TCTGCTCATT TAGTTGCATT
CTCGTCTACT CTGGTATTCG TGAGTCCTAA TTGACCTTAG TCGCCTTGCC ATCTCTGATT
TTTTAACCCT CCACAGACTA CGACGCCATC AGTCTATCAC CTCTCCCTGT TTCCCCAGCT
TCTTTCCACA CCACAATATT CCCTGTAACT GATCAAATCA TTCTTACTAC TGCGGACAAG
CTCGTCTCAC AACTTCGAGC TAGGCATTAT TATACCGACA CTGCAAACTT TGATCTCAGG
TGTGCGATAT GCAAGAAGGG TCTGAGAGGA GAAAAAGGTG CGAGAGAACA CGCCATGCAG
ACTGGTCGTG AGTTATCTCC TGTCAACTAT TCGTGACCTC AAGCTAATTC CGCCAGATGT
CGAGTTTGGC GAGTACTAAG GTTCCATCAA TTATCCTTCC GGCACAATAT ATGAGCGTTA
TTCTGTCTTT TATGTAGTGT TAAACTACCA TAATATCCAT CTAATGAAGT ATATCAAATC
CGTCTG
 
Protein sequence
MSRIVSSDFR NCNFYPYYYP VVKYGYPPKP LPSTAGPLSS IPIGKGEQLI VSSIPSGGPS 
KKVPVVAQPS TTTSSAPAAS RPTNASPVLA APLVSNASQG EGVESGESVA VPGRDAGYLQ
LRVVPDDNSC LFSAIGIVFE GGIEAAQRLR MVVANAIKDD PFTYSEVMLG QPIDQYVKRI
QKPQTWGGAI ELSIFAKHYK TEIASFDVAT GRCDRFGQDE YDTRCILVYS GIHYDAISLS
PLPVSPASFH TTIFPVTDQI ILTTADKLVS QLRARHYYTD TANFDLRCAI CKKGLRGEKG
AREHAMQTGH VEFGEY