Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA05340 |
Symbol | |
ID | 3253303 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1427059 |
End bp | 1430042 |
Gene Length | 2984 bp |
Protein Length | 822 aa |
Translation table | |
GC content | 49% |
IMG OID | 638252852 |
Product | transposable element- crypton-Cn1, putative |
Protein accession | XP_566846 |
Protein GI | 58258867 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAACAACAT TGCTCATATT ATCCATCGTT CAACTTTTTT TGACTGCCAC CACCTTGCAC AATGCCAGCT ACCAGACGCA CAAAGGGGCT TGACGCATTC CAACCTGCTG ACACTCTCGA TCCTCGCACG GCTCACACGA TGCAAGATAT CGCTGCTCTT CGCAAAAGTA ATTTGACATT GAAAACAGCA CGACAATATG CAACCAAAAA TAAGCAATGG TATGCGTACT GTGCCCACAT GAATTTTCTT ACAAAGTGAG CCGCCATCTC CCAAAGATGT ATGTGTACTA GTTGTTGGCT GACGTGTTGC TCTTCAACCT GAAGAGATAC TGTAACTCCG GCAAACGCAG CGTCGTATCT GTACAATTGG GTACTTAAAC TTCCTCCTGA CCATCACCGT AAATGATTTC CCCACAGGCT TTGTAGAGCA TACGTTGATA TACTTCGCAG GTAAGAAGCA GGCGCACATC CTTGTAAGAG AAGCCCAAGG TGGGCAAGTG CCACAAACTG TGCGGGGTCA GGAGCAAGGT CAGACGGCAC TGTTTATTGA GGAAGACGAG GAAGTCGACG ATTCAATGCT TGCAGGTATT GATGAGCCTG CCGACCCAGA CACCGTTCAG GACGAGCGCC AGATGCTTCT GAATGGTGTC AATGACTTGC CGGATTTTAA GGAATTGCGT GAGAGTCTTG GATATCATGG TGCAGGTGAG CTATCTCAGC TTCAGAGCAG GGTGGCTGGT AACGAGCCTA CAGACTCGAG TCTTGAGCAT CTGCTGCAGT GCAACCAATC TCTTGCTTCT GTCAGGCTCT ATTTGGCTGC TCTTGTCGAC CTTTGGGAGA CCCAAAGGCA GGCGGGAATG AACGCCTTTC CCTCGCCGCG CACGAAAGCA ACCAACTCTA TCCTCAACGC TCTACGTCGT GTACGCAACG AACAAAGTAT CCTTCGTTGT GATGATAAAG GTGACGGTGA GCTTTATCCT GCCTTTCCAT GTAATAAGAA TCTAAGCTGA GCCCGCCCTC CAGATCTGTT CTACGATGGG ATAGCGACGA CCGAGAACAT GAAAAAGTTG TTTCTCCATT ATCTGCATCG GGACAGCGTC GAAGGTCTGC GCGATCTCGC TGCGCAAGCC GTCGGCATCC ATGGACTTTT ACGTGCGGAT GATCAGCTAA GGATTACCCT ATCGTCCATG TCCCTCAGAC TCTTCGAGGA TGAGGGACCC ACACCTTGTC GTGGTGTCGT TTTCGCCATA AGAGAGGGGA AGACGACACA TGACGGTCAG ATCCAGTACT CAACTCTGTT GAGGAACAAG GATGTGACCC GGTGCCCCGT CTCTTTCCTT GTTCTGTATC TGTTTGCCCG GTGAGCGGTT TTGTTTCTGC CGCCTTATGA AGTTGCTGTC CAAATGATCT GATTTGGTCT ACTTAGGTTC CATTTCAGTG AAGAGCCCTT TATTAACTCA GATGTCTCTT TTCCCTCTTT AAAAAATCGC CAGGACTGGT ATCACATCCC ACTCTTTGTG TCCCGCCAAT CCAACGCAGT TACGCGTCTG AAATATGACG CTTTGAACAA GAGTGTGCGA AAAGCACTTC AAAGCTGCAA CATCCATTGT AGAGCGTCCA CTCACACGTC TCGCAAATGG GGAGCCCAGC TTGCCGAAGA TGGCGGCGCG CCTGAAGAAG ATATCATGAG GCAAGGAAGA TGGTGTACAA AGGTGATGGA AACTGTCTAC CTCAGTAAAT TCCCTCTCAA AGCATTGAGA GCCCTTGCAG GGTTCCCAAA AAAGAAAGGT TCTTATTACC TTCCTCGCGA CATGGAAGTG CCTCAGGAGC TTATCGAGAG CGTCTTCCCA TGGGTTGATG CTGCGTTGAG TCATATTCCG TTACATGTGG AAAGTTAATT AACTGATGCT GAATATGCAG TGAGGCTGAG CTCTTCGACC CCGATCGCTT CCAAGGCGAC AAAGCTGGTC GAGCATTCAT CAAACTCATG GACTGGTTTC GCTCGGTATT AATCCAAGAT GCTCCCTTTA TTCGCCAGCT CGAACCTGAC CTCTTTGTCT GGAAACACCC TGTCTTCTCC ACACCCACTT TCCTTGCCTT TGAAGCAAGG GCGCTAGCCG AAGCGCAAAG CGCGGAAGCA CGCATGAGCG AGGATGCAAG GCAACTTATC CCTGAACTTT CCGATTATCT CTCTACCAAC TTCACTGCCT TGTTTAAGGC CACATATAAC ATCGACACCA CGTTAACAGG ACTAGCTGCG TCCGTTGCCA GCAATTCCCA ACTTATACAG GAAGAGCGGC GTGCTGAGAA ATATGATGCA CTGTTGAATG GTATTGGAGA TGCATTCCAT GCAATGGCTC GCCGCCAACA CAGCGGCAGC ATTTCATCTC GATCAAATGG TGAGCATGTT GTCACAGATG GTCATGGATG GCGTGAATGT TAGCTGATAA TATGTCAGTC AACGCCCAAG AGAGCCAAAC GACCAGTGTA CTTGAAGGTC AACACCATGG GGGGTTTAAC CCATCCGCTT CTTCCAACTC CGTCGCTTCT CAGCCCAACT CCGCTAGCGA CTCTGGTGAC CTAGTTCAAC TCGAAGCATT GGTCTACAAG ATGGATCGCG AGGTAGGAGA TGTACTAGAA TTATGGGATG AGTACATCGT TGGAAGGAAT GGTCGATTAC CTGTCAGGGA AATGAGTCAA CGAAACGAGT TTAAAAAGAA CGAAGCCGAA AAAAAGATGT TCAATCGGAG AAAACCTATC TATGAGGCCA TCAGAGACCT AGCTCGGGGG ATGAACATGG GTGAGAGGGA AGCTGCGGGG TTAATTGAGG AGTATAGGAT TAAAAACTCG ATGGGGCTAA ACAAGCTGAG CAATGTCGTC AAGGAAGTTG TCAAGAACAT GATTGTGCAT CAGTCTTATC GATATCGAAC GTTGTAACTG TTTTAAATTG TAACCATATG ATAGTTGTAC TTACGTTCAT GCAC
|
Protein sequence | MPATRRTKGL DAFQPADTLD PRTAHTMQDI AALRKSNLTL KTARQYATKN KQWYAYCAHM NFLTKDTVTP ANAASYLYNW VLKLPPDHHR KKQAHILVRE AQGGQVPQTV RGQEQGQTAL FIEEDEEVDD SMLAGIDEPA DPDTVQDERQ MLLNGVNDLP DFKELRESLG YHGADSSLEH LLQCNQSLAS VRLYLAALVD LWETQRQAGM NAFPSPRTKA TNSILNALRR VRNEQSILRC DDKGDDLFYD GIATTENMKK LFLHYLHRDS VEGLRDLAAQ AVGIHGLLRA DDQLRITLSS MSLRLFEDEG PTPCRGVVFA IREGKTTHDG QIQYSTLLRN KDVTRCPVSF LVLYLFARFH FSEEPFINSD VSFPSLKNRQ DWYHIPLFVS RQSNAVTRLK YDALNKSVRK ALQSCNIHCR ASTHTSRKWG AQLAEDGGAP EEDIMRQGRW CTKVMETVYL SKFPLKALRA LAGFPKKKGS YYLPRDMEVP QELIESVFPW VDAAEAELFD PDRFQGDKAG RAFIKLMDWF RSVLIQDAPF IRQLEPDLFV WKHPVFSTPT FLAFEARALA EAQSAEARMS EDARQLIPEL SDYLSTNFTA LFKATYNIDT TLTGLAASVA SNSQLIQEER RAEKYDALLN GIGDAFHAMA RRQHSGSISS RSNVNAQESQ TTSVLEGQHH GGFNPSASSN SVASQPNSAS DSGDLVQLEA LVYKMDREVG DVLELWDEYI VGRNGRLPVR EMSQRNEFKK NEAEKKMFNR RKPIYEAIRD LARGMNMGER EAAGLIEEYR IKNSMGLNKL SNVVKEVVKN MIVHQSYRYR TL
|
| |