Gene CNG01950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG01950 
Symbol 
ID3258791 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp545630 
End bp548552 
Gene Length2923 bp 
Protein Length803 aa 
Translation table 
GC content48% 
IMG OID638257813 
Producthypothetical protein 
Protein accessionXP_571889 
Protein GI58269466 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACAAAGAAGG ACAATCCAGA CGCTAGGGCT TCTCTGCTCG TCGTATATAT TTTGTTCATC 
TGCTAAATCG ACGTTCGATC CAATATTACA AAATTACCAT GTCTGCCATT GTCACACTCC
GTGCAATCCC TACTGAGGCT GACACGGATG CTTTGAAATC GATTGTCAGC CGTGATGTCC
GTCGTGCATA CATCTCTCCG GAGACACTCC GGACGCACAA AGTAGCTGCC GGTGATTGGA
TGCGATTTAA GTCAGGCAGT ACTTTCATTA TGGCCCAGGC ATGGCCCAAA GCTAGCATAG
ATGATGACGG TGAGTTCACA AACTCTATAG GAACCTGTCA GCAGTTAAAA TAACCTGTCT
TTCAGTTGTG GCACTTTCAT CAGCTCAAAT CAATAATCTT TGCGGGTCAG AGGTAGAGAT
GCATCATTTC AAAGCCGGAC AAAGCCAAGG CCGCTTGGTA TCAATTGTGC TGAAGGAAGT
ACCTTTGACC GAAGCACCCT CGTCTTCCAA GCTGAACGTC GTTACTCGAC CGCCCATAAA
CTTGGATAGC CCAAGAGAAT GGGTTTGGTG CAAAGCTGCT ATCAAGGAGG CACTCAGTAA
GACTTCTCTA GCTGTTGTCT TCTGCATAAT GTTGACACCT GATATAGCTT CCTTGCACTA
TGTCAGGACG GGCTACTCTC TTATCGTCGG TGACACTGAA AAGGCACCTC GCAAGTTTGA
AATCATCTCC GTCGAAGTGT CTCCCAAGGA TATTGAGAAA CGTTTAGCGT CGATCGAAGA
CGGAATGGAG GAACTTGCTA TTGATGAAGA AAACCAGCGA CTGTATGAGA TGCACTGGAA
AACTTCAGTG TCCCTTGAAG TAGAGAAACT TCAGGAAAAT AAAGAGGTGC ACGCGGCCAT
TCCAGACGAA AAAAAGGTGC TCAACACTAA AGACTTCGTA AGTTTTCGTC GTAACCAGAA
GATTAAATCC TGAGAGATTA TAGTCGAAGA TGTCTACAAG TTCCGTCCCT CATTATATCA
ACTTTTTCAC CCCCGCCGAA TCCCCTGTTT CCGCTTATAC ATTTCTGGGG GGCCTTCAAT
CTCAAATTGA TCAAATCAAG ACCCTTCTAG ATCTTCCGAT GCTTCATCCT GACCTGTATA
TCAAGTTTGG ACTCAACCCT CCTAGAGGCA TCCTCCTTCA CGGCCCCCCG GGAACGGGTA
AGACGGCGCT GGCCAGGGCG GTCGCATCGT CTGCTGGATG TTCTTGCATT GTCGTCAACG
GACCTGAACT TTCCTCAGCA TATCATGGCG AAACGGAAGA ACGGTTACGT GGAGTATTTA
CGGAAGCAAG GAAACGTAGT CCATGTATTG TAGTATTGGA CGAAGTGGAT GCGCTTTGTC
CTAGGCGGGA TGGTGGGGAG GGAGGCGAAG TTGAACGAAG GGTGGTAGCT ACTTTGCTGA
CCTTGATGGA CGGTATGAGT CACGAGAGCT TGGAAGGTGA ACGGGTATTT GTAGTAGCTG
CTACAAATAG ACCCAACAGC ATTGATCCTG CTCTTCGTCG GCCAGGGAGG TTTGATAGAG
AGATTGAAGT CGGTGAGCTA ACTTTCTTTT TTCAAGCAAC AAGCAGAGGC TAACATACTT
TCACAGGTGT GCCAGATGTC AAAGGCCGTC GAGAAATTCT CGACATTATG CTCTCAAAAA
TTCCTCATTC ACTTTCAGAA AAGGACCTCT CTTCTCTTGC CGCGCGCACT CATGGCTATG
TCGGTGCTGA TCTCTTCTCC CTCGTGCGAG AATCCGCTTC TGCGGCCATC TCCCGCTTTC
ATCTGTCTCC GTCATCAACC CTCTCCGAAC CTGTCTTGAC CAATGCGGAC ATCCTCTCAA
CACTTCCTTC CATCCGGCCG TCTGCCATGC GCGAGGTATT CATCGAAACT CCGACTGTTC
GATGGTCAGA TATAGGTGGT CAACAAGATG TAAAGCAAAA ACTGAGAGAG TGCATTGAAT
GGCCATTGAT GCATAGAGAC ACGTTCAAGA GACTAGGCGT AGAAGCTCCT AGGGGAGTGT
TGTTGTATGG GCCTCCGGGC TGTAGTAAGA CCATGACGGC TAAAGCACTG GCCACAGAGA
GTGGTATCAA CTTCATTGCT GTGAAGGGTC CGGAGGTGAG CTACAGCGAA TTTCTATGAA
GCGAGAGTGC TGATCATCTC CAGCTTCTCA ACAAGTACGT CGGCGAGTCC GAAAGGGCAG
TAAGAGAGAT TTTCCGCAAG GCACGCGCTG CTTCCCCTTC TATCATTTTC TTTGTGAGTC
GAAAGGCGGT TTTCAGTCTC ACAACTAACG AAGTATCGTT ACAGGATGAG ATAGACGCCC
TTGGCTCAGC ACGATCGGAT GACCATGCTC ATTCCGGTGT TCTTACTTCT CTCTTGAATG
AGATGGACGG TGTAGAAGAG TTATCGGGTG TGACAGTAGT GGCAGCTACC AATCGACCCG
ATGTCCTGGA CTCTGCTTTG ATGCGTCCCG GAAGATTGGA TCGTATCTTG TACGTAGGGG
CACCCGACTT TGAGACTCGA AAAGATATCT TCCGAATCAG ATTGGCCACG ATGGCCGTGG
AACCAGGCAT TAATGTTGAG CAACTGGCCG AAATAGTGAG CCATACTAAC AGAGCAATGA
CCCACAGGAC CTAAACTGAC ACATGGTTTA GACTGAAGGC TGCTCTGGTG CAGAAGTCGT
CTCAATCTGC CAGGATGCGG CTTTGGCAGC CATGAACGAA AGTCTTGATG CTCCATATGT
AAGTATATCT GGACGCCATA CAAGTGCGAA GACTCATCCT CGCTCGCAGG TCAAAGCTTC
CCACCTTGTC AACTCAGCTC ACACTGTCCG AAGAAGAATT ACTCCGGAAA TGATTGCATT
TTTCGAGGAA TGGAGAGACT TATCTGGGGT TCGTAGTGCA TAA
 
Protein sequence
MSAIVTLRAI PTEADTDALK SIVSRDVRRA YISPETLRTH KVAAGDWMRF KSGSTFIMAQ 
AWPKASIDDD VVALSSAQIN NLCGSEVEMH HFKAGQSQGR LVSIVLKEVP LTEAPSSSKL
NVVTRPPINL DSPREWVWCK AAIKEALTSL HYVRTGYSLI VGDTEKAPRK FEIISVEVSP
KDIEKRLASI EDGMEELAID EENQRLYEMH WKTSVSLEVE KLQENKEVHA AIPDEKKVLN
TKDFSKMSTS SVPHYINFFT PAESPVSAYT FLGGLQSQID QIKTLLDLPM LHPDLYIKFG
LNPPRGILLH GPPGTGKTAL ARAVASSAGC SCIVVNGPEL SSAYHGETEE RLRGVFTEAR
KRSPCIVVLD EVDALCPRRD GGEGGEVERR VVATLLTLMD GMSHESLEGE RVFVVAATNR
PNSIDPALRR PGRFDREIEV GVPDVKGRRE ILDIMLSKIP HSLSEKDLSS LAARTHGYVG
ADLFSLVRES ASAAISRFHL SPSSTLSEPV LTNADILSTL PSIRPSAMRE VFIETPTVRW
SDIGGQQDVK QKLRECIEWP LMHRDTFKRL GVEAPRGVLL YGPPGCSKTM TAKALATESG
INFIAVKGPE LLNKYVGESE RAVREIFRKA RAASPSIIFF DEIDALGSAR SDDHAHSGVL
TSLLNEMDGV EELSGVTVVA ATNRPDVLDS ALMRPGRLDR ILYVGAPDFE TRKDIFRIRL
ATMAVEPGIN VEQLAEITEG CSGAEVVSIC QDAALAAMNE SLDAPYVKAS HLVNSAHTVR
RRITPEMIAF FEEWRDLSGV RSA