Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG01950 |
Symbol | |
ID | 3258791 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 545630 |
End bp | 548552 |
Gene Length | 2923 bp |
Protein Length | 803 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257813 |
Product | hypothetical protein |
Protein accession | XP_571889 |
Protein GI | 58269466 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAAAGAAGG ACAATCCAGA CGCTAGGGCT TCTCTGCTCG TCGTATATAT TTTGTTCATC TGCTAAATCG ACGTTCGATC CAATATTACA AAATTACCAT GTCTGCCATT GTCACACTCC GTGCAATCCC TACTGAGGCT GACACGGATG CTTTGAAATC GATTGTCAGC CGTGATGTCC GTCGTGCATA CATCTCTCCG GAGACACTCC GGACGCACAA AGTAGCTGCC GGTGATTGGA TGCGATTTAA GTCAGGCAGT ACTTTCATTA TGGCCCAGGC ATGGCCCAAA GCTAGCATAG ATGATGACGG TGAGTTCACA AACTCTATAG GAACCTGTCA GCAGTTAAAA TAACCTGTCT TTCAGTTGTG GCACTTTCAT CAGCTCAAAT CAATAATCTT TGCGGGTCAG AGGTAGAGAT GCATCATTTC AAAGCCGGAC AAAGCCAAGG CCGCTTGGTA TCAATTGTGC TGAAGGAAGT ACCTTTGACC GAAGCACCCT CGTCTTCCAA GCTGAACGTC GTTACTCGAC CGCCCATAAA CTTGGATAGC CCAAGAGAAT GGGTTTGGTG CAAAGCTGCT ATCAAGGAGG CACTCAGTAA GACTTCTCTA GCTGTTGTCT TCTGCATAAT GTTGACACCT GATATAGCTT CCTTGCACTA TGTCAGGACG GGCTACTCTC TTATCGTCGG TGACACTGAA AAGGCACCTC GCAAGTTTGA AATCATCTCC GTCGAAGTGT CTCCCAAGGA TATTGAGAAA CGTTTAGCGT CGATCGAAGA CGGAATGGAG GAACTTGCTA TTGATGAAGA AAACCAGCGA CTGTATGAGA TGCACTGGAA AACTTCAGTG TCCCTTGAAG TAGAGAAACT TCAGGAAAAT AAAGAGGTGC ACGCGGCCAT TCCAGACGAA AAAAAGGTGC TCAACACTAA AGACTTCGTA AGTTTTCGTC GTAACCAGAA GATTAAATCC TGAGAGATTA TAGTCGAAGA TGTCTACAAG TTCCGTCCCT CATTATATCA ACTTTTTCAC CCCCGCCGAA TCCCCTGTTT CCGCTTATAC ATTTCTGGGG GGCCTTCAAT CTCAAATTGA TCAAATCAAG ACCCTTCTAG ATCTTCCGAT GCTTCATCCT GACCTGTATA TCAAGTTTGG ACTCAACCCT CCTAGAGGCA TCCTCCTTCA CGGCCCCCCG GGAACGGGTA AGACGGCGCT GGCCAGGGCG GTCGCATCGT CTGCTGGATG TTCTTGCATT GTCGTCAACG GACCTGAACT TTCCTCAGCA TATCATGGCG AAACGGAAGA ACGGTTACGT GGAGTATTTA CGGAAGCAAG GAAACGTAGT CCATGTATTG TAGTATTGGA CGAAGTGGAT GCGCTTTGTC CTAGGCGGGA TGGTGGGGAG GGAGGCGAAG TTGAACGAAG GGTGGTAGCT ACTTTGCTGA CCTTGATGGA CGGTATGAGT CACGAGAGCT TGGAAGGTGA ACGGGTATTT GTAGTAGCTG CTACAAATAG ACCCAACAGC ATTGATCCTG CTCTTCGTCG GCCAGGGAGG TTTGATAGAG AGATTGAAGT CGGTGAGCTA ACTTTCTTTT TTCAAGCAAC AAGCAGAGGC TAACATACTT TCACAGGTGT GCCAGATGTC AAAGGCCGTC GAGAAATTCT CGACATTATG CTCTCAAAAA TTCCTCATTC ACTTTCAGAA AAGGACCTCT CTTCTCTTGC CGCGCGCACT CATGGCTATG TCGGTGCTGA TCTCTTCTCC CTCGTGCGAG AATCCGCTTC TGCGGCCATC TCCCGCTTTC ATCTGTCTCC GTCATCAACC CTCTCCGAAC CTGTCTTGAC CAATGCGGAC ATCCTCTCAA CACTTCCTTC CATCCGGCCG TCTGCCATGC GCGAGGTATT CATCGAAACT CCGACTGTTC GATGGTCAGA TATAGGTGGT CAACAAGATG TAAAGCAAAA ACTGAGAGAG TGCATTGAAT GGCCATTGAT GCATAGAGAC ACGTTCAAGA GACTAGGCGT AGAAGCTCCT AGGGGAGTGT TGTTGTATGG GCCTCCGGGC TGTAGTAAGA CCATGACGGC TAAAGCACTG GCCACAGAGA GTGGTATCAA CTTCATTGCT GTGAAGGGTC CGGAGGTGAG CTACAGCGAA TTTCTATGAA GCGAGAGTGC TGATCATCTC CAGCTTCTCA ACAAGTACGT CGGCGAGTCC GAAAGGGCAG TAAGAGAGAT TTTCCGCAAG GCACGCGCTG CTTCCCCTTC TATCATTTTC TTTGTGAGTC GAAAGGCGGT TTTCAGTCTC ACAACTAACG AAGTATCGTT ACAGGATGAG ATAGACGCCC TTGGCTCAGC ACGATCGGAT GACCATGCTC ATTCCGGTGT TCTTACTTCT CTCTTGAATG AGATGGACGG TGTAGAAGAG TTATCGGGTG TGACAGTAGT GGCAGCTACC AATCGACCCG ATGTCCTGGA CTCTGCTTTG ATGCGTCCCG GAAGATTGGA TCGTATCTTG TACGTAGGGG CACCCGACTT TGAGACTCGA AAAGATATCT TCCGAATCAG ATTGGCCACG ATGGCCGTGG AACCAGGCAT TAATGTTGAG CAACTGGCCG AAATAGTGAG CCATACTAAC AGAGCAATGA CCCACAGGAC CTAAACTGAC ACATGGTTTA GACTGAAGGC TGCTCTGGTG CAGAAGTCGT CTCAATCTGC CAGGATGCGG CTTTGGCAGC CATGAACGAA AGTCTTGATG CTCCATATGT AAGTATATCT GGACGCCATA CAAGTGCGAA GACTCATCCT CGCTCGCAGG TCAAAGCTTC CCACCTTGTC AACTCAGCTC ACACTGTCCG AAGAAGAATT ACTCCGGAAA TGATTGCATT TTTCGAGGAA TGGAGAGACT TATCTGGGGT TCGTAGTGCA TAA
|
Protein sequence | MSAIVTLRAI PTEADTDALK SIVSRDVRRA YISPETLRTH KVAAGDWMRF KSGSTFIMAQ AWPKASIDDD VVALSSAQIN NLCGSEVEMH HFKAGQSQGR LVSIVLKEVP LTEAPSSSKL NVVTRPPINL DSPREWVWCK AAIKEALTSL HYVRTGYSLI VGDTEKAPRK FEIISVEVSP KDIEKRLASI EDGMEELAID EENQRLYEMH WKTSVSLEVE KLQENKEVHA AIPDEKKVLN TKDFSKMSTS SVPHYINFFT PAESPVSAYT FLGGLQSQID QIKTLLDLPM LHPDLYIKFG LNPPRGILLH GPPGTGKTAL ARAVASSAGC SCIVVNGPEL SSAYHGETEE RLRGVFTEAR KRSPCIVVLD EVDALCPRRD GGEGGEVERR VVATLLTLMD GMSHESLEGE RVFVVAATNR PNSIDPALRR PGRFDREIEV GVPDVKGRRE ILDIMLSKIP HSLSEKDLSS LAARTHGYVG ADLFSLVRES ASAAISRFHL SPSSTLSEPV LTNADILSTL PSIRPSAMRE VFIETPTVRW SDIGGQQDVK QKLRECIEWP LMHRDTFKRL GVEAPRGVLL YGPPGCSKTM TAKALATESG INFIAVKGPE LLNKYVGESE RAVREIFRKA RAASPSIIFF DEIDALGSAR SDDHAHSGVL TSLLNEMDGV EELSGVTVVA ATNRPDVLDS ALMRPGRLDR ILYVGAPDFE TRKDIFRIRL ATMAVEPGIN VEQLAEITEG CSGAEVVSIC QDAALAAMNE SLDAPYVKAS HLVNSAHTVR RRITPEMIAF FEEWRDLSGV RSA
|
| |