Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA00840 |
Symbol | |
ID | 3253883 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 235743 |
End bp | 238630 |
Gene Length | 2888 bp |
Protein Length | 764 aa |
Translation table | |
GC content | 51% |
IMG OID | 638252416 |
Product | conserved expressed protein |
Protein accession | XP_566511 |
Protein GI | 58258197 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.460464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAACT CACCTACAGC CGAGCCAATA CCGCCAGCAC CGCCTGCTGC TCAACAGCCG GTAGAGGCTC ACGAAATCCC CGTTCCAGAT GCCCCGTCCA CGACCATTCC TACAGCGCCA CTTGACAATC CCGAGGCGCA GACTCAGGCT CCCACTGCGT CGATAGACCC TAACAAGCCA TCGAAGGAAT CTGCCCCTAC TGCTTTGCCT ACCAACCCTG CTACCCACCC GGAGGAGCAG AAAGAAGCCA TCAAAGAAGA GCATCCTCAG GTCATTAAGA GGGAAGTCGA AAAGACCAAA GCTGAAGAGA GGGATTTGAA GGGCGAGAAG CCGCAGGGAA CGGTAGTGGC TGGTCTCGAA GATGACCGTC TTTGGGCCAT GCTTCGACGG TTCGATGTAG TGAGTGCTTT TTCGTTCTTC CCGATTTGGC TGTTACAGAA TTATGAATTG GGACAGGGCT GACTACGCAT TCCCAGCAAA TTACCCATGT CCTGCACCCT GCTCGTAACC TTCCCGCTGC GGAGCCTGAC CTTCGTCCCT CTACACTTCC GAACCTCCCC TCCCACACAG ATGTGCTCCG TTCCAACCTC GAGCGTTGTA TTGCCGCCGT AGGCCCTTCG TCTCTACGAG GGGCACGCGA GCTTCAGCGC CTCATGTCTT GGTCTCCCGA AGAGCGTTGG AGAACTGGGA CGTATTGTGT GGTGTACTTC ACTGCTTGGA TTTTTGGCTA CGCTGTTGCC GCAATTATGA TTTTCTTATC CATCTTGGTC TGTTTCCCTC GCACGAGGCG ATTCCTATTC CCACCGGTGC CTCCTGCACC ATTCACGCCG CCTAGTGCTA CCGACCCCAC CAACCAGAAG GGAGACGAAA GTCTCTTGGG AAACATCGAT GGAAAGACAG TACATAGGAC CAAGGCAGAA CAGGCTGAGG AACAGGCTTT TGAAGCTACC TCGATTTTGA AGGCTTTCAC CACCAGATTG CTCTTTGTAA GTGCCCTAAA CCATCGGAAG GTCTAGCAAT ATGAAGCGCT AATCATTTAT TGGTCGTAGG ATGGCAAGAA GAAGGGTAAA GAGGCTGGGA ACTCCAACGT CGGTGAGAAA GAGGAGGAAC CCGAGTCATC ATCTTCGGAG GACGAAGCTG CCGTCGAGAC TGCCCCCGTT GACGGCAAAC CCCAAGGCCT CGAATCTGCC GATATTGTTA TAGGTGGTGA GAAAATCACG CCCGCTGAGC CCTTGAATGA TAAGGAGAAG AAGAAGTTGG CTCAAAGGGA AGCGAAGAGG AAAAGGGATG AGATGGTTTC AAAAATGACC AAGCTGTCAG AGGATGGTTT GGGAACCGTT GCGGATTCTA TCGAGCGATT CGCCAAGTAA GCTCTCGCAA TTACTGAGCG GCAGAATATA ATGGTCTTAA CTTTTCATAG TGCTCTTTCC CCTCCAGGCC CTTACCCCGA CAATTTCGCC AGGTTCAAGA TTGCTGGCGC CTTCCTCATT CCTCCTGCTT TTCTTCTTAC TTTTGTCCCC GCCTGGGTTT TTGGCCGATT GGCGACATTG GGTTTTGGTG TCGGCATGTG GGGTCAGCCC TTGATCATCA AGGGAATTAA CAAATTCGTC GAAGTCGTTC CCAATTGGCA AGAACTGTTG GACATGCGCA AGTGAGTCGT CTTCAAAATG CTCTATTGAG AAGTGAGAAT TGACTCTGGT TCAATGAAAG CTCGATTCTG TCTCGCGTCC CTACCGATAC TCAGTTGACC TTGCACTTGC TGAGGGTCAC AGAAGCGTTG GGCAAGCCCT TGCCTCGACC TCCGTGAGTC ATGCGCTTGC TACTCTCCTC ATTCACACAC TTACCTTTCT CAACTCAAAT CTAGTCCTCC CCCTCTTTCA GGTACTCCCA AAGTAAGCAA AAAGTCACAG CCTTGACTTA TACCCACTAA CACACCTTCA CTCCCAATAG GAAGCAATTA AAGACACTAC ACCCGCTGCG GTCACAGCGG ATGACGACGC CGAAGTCTTG GAAGCCGCAC AAGAAGGTGC TACCGCAGAG GTAGCTGCCA AGACCAAGCA CAAGACTAAG TCGCACATTA CAGGTGTCTT TAAGAGTGCC GGAAAACACA TGGCGGGATT TAGAGGAGAC GTCAAGGTCG ATGGGGCAAG GAAACAGGTG AGTATGGAGG ATTGGTGGTA CGAGTGGCTT GGGGGGCAGC TAATAAAGTT GGGGAAGATT GGAGATAAGG TGGATAAAGT CCTGTTCCGA GGTAATATTA AGGATGATGG TAACATTGAT TGTACGTGTG TCGATTGGCA CCGATCTGCG CGATTCCACT GATAGTGCTG TAGCCTACCC CGCAAAACTC AACGGTACCT CTGGTCACAT TATTATTGAC AGTAGAGAAG AGGGCATCTC CATGCCTACA ATCAGTTTTG TTCCCGTTTC CGGTACAAAA GCGCATTTTG TACGACCTGT CGATGACATC GTCGAAATTA AAAAGGCGAG TCGGCCACAT TTCCCAGTCC ATCCTATCAT ATTTAGATTC TGACTTCTCC TTCAGAGCCA CGTTTCTATG CCTCGTATGG CGCTTGGCTG GGCTTCCGGC GCGGATGTAG AAGGATTGGG CTTGACCATT CGCTTCAAAT CAGGAGTGCA ACAGGCGAAA GAATTTGCCG CCGGTCCTGA TGCTCCGAAG GATGATGGAG AGACGATGCA TTTCCGAAGA GTTGGCAGGC GAGAGGGGTT ATTTACGAGG TTGATCAGTA TCGGCAGACA GAGGTGGGAG GTTTTGTAAG ATGGCGTTGA CCTTGGAGAA GACAATGGGT GCCAAGAATA ACTGTCTACT GCATAAATAC AGTAGCTATT AGTGATTGGT TTTTGAACAT GTAGCCCTAT ATGTGACT
|
Protein sequence | MSNSPTAEPI PPAPPAAQQP VEAHEIPVPD APSTTIPTAP LDNPEAQTQA PTASIDPNKP SKESAPTALP TNPATHPEEQ KEAIKEEHPQ VIKREVEKTK AEERDLKGEK PQGTVVAGLE DDRLWAMLRR FDVQITHVLH PARNLPAAEP DLRPSTLPNL PSHTDVLRSN LERCIAAVGP SSLRGARELQ RLMSWSPEER WRTGTYCVVY FTAWIFGYAV AAIMIFLSIL VCFPRTRRFL FPPVPPAPFT PPSATDPTNQ KGDESLLGNI DGKTVHRTKA EQAEEQAFEA TSILKAFTTR LLFDGKKKGK EAGNSNVGEK EEEPESSSSE DEAAVETAPV DGKPQGLESA DIVIGGEKIT PAEPLNDKEK KKLAQREAKR KRDEMVSKMT KLSEDGLGTV ADSIERFANA LSPPGPYPDN FARFKIAGAF LIPPAFLLTF VPAWVFGRLA TLGFGVGMWG QPLIIKGINK FVEVVPNWQE LLDMRNSILS RVPTDTQLTL HLLRVTEALG KPLPRPPPPP LSGTPKEAIK DTTPAAVTAD DDAEVLEAAQ EGATAEVAAK TKHKTKSHIT GVFKSAGKHM AGFRGDVKVD GARKQVSMED WWYEWLGGQL IKLGKIGDKV DKVLFRGNIK DDGNIDSYPA KLNGTSGHII IDSREEGISM PTISFVPVSG TKAHFVRPVD DIVEIKKSHV SMPRMALGWA SGADVEGLGL TIRFKSGVQQ AKEFAAGPDA PKDDGETMHF RRVGRREGLF TRLISIGRQR WEVL
|
| |