Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA07770 |
Symbol | |
ID | 3253596 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 2143858 |
End bp | 2147495 |
Gene Length | 3638 bp |
Protein Length | 847 aa |
Translation table | |
GC content | 51% |
IMG OID | 638253100 |
Product | conserved hypothetical protein |
Protein accession | XP_567124 |
Protein GI | 58259423 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.622601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCCATCCTT CACGCCGTCT CCTCATTCCC CTCCACCTCC CGCCCGACAC TTCCACACCG AACGCCGATT CTCTACAGAC AGTCCTTTAA CACCCGACTA GTTGCACTGC GTGCACGAGC CTCTGCCTTT GGCCCTTTAC CATCAAATTC CTCAAGGACC CCTGTAACTT GCTATAAACA GGTGAGTCCT CTTTTCTGTT TGCCTCTTTT TATTCTTCCT CATTCTTTCT TCTTCCTTCC TTCTCCATCC TCTTTCGCGC CGCCCCCGTC TCCTGTGCCA TCATCGCGCC TTTCTCCCAA CAGCGCAGAA TTCTTCCTTC CTCTTGGCAT CATCCACTTG GCACTATCCA CTATTACGAC CCATCATTAC GAAAAAGTTT CGAATTGTTC CCCTTATCCA GATCGCTCAC GCCTTTAACG TCTGATAGTT ACGACCCACA TTCATTCTCT ATCACGATTC GTTCAAGCGG AAATCACAAA TCAACTTCTT AACCCTATAA ACGAGGCTGG TACTCGTCTG TCTATCATCG ATCATGTCTG CCTTTACCTA TACCGCCGCT CTCTTTTCCC TTATCTCCCT CCTCCCTTCC GCGCTCGCCG GTCCGACTGC TGGAACTACC TACTCCGTTT CTCCTAACCA ACACCCTTCC ATGTGTCTCG CTCCTGCCAA TAATTGGGAA GGAGCAGATG TTGTGCTCAA GGACTGTGAC GAGGATGACA CCACCTGGTT GTGGACCGGC CAATCGTTTC AGAACACGGC GACCGACCTC TGCATTGATA TCCGAGACTA CGGGGCGTGG TCAGGAAACA AGGCTCAGGT TTGGGGCTGC TTCTCTTACA ACACCAACCA GCAGTTCACT GTTGAGGAAT CTATGATTCA TTGGGACAAC TTTTGCTGGG ATTTGACAGA TGGAAGCTCT TCGGCTGGTA CGATGCTTCA GATCTGGAGT TGTTACAGCT ACAATGACAA TCAGCAATGG ACGCTTACTG AGATAGAAGA GGTGGATGAG TGCGATGCCA GTAAGTTATT CCCTCTTGAC ATTCAAGCGT TTTATCAAGT CTAATGGCCG TCGCAGCATC GGTCACTGAA ACTGCCACCA TCATGTCGAC TGCTACTGCT TCCGTCTCTG ATCTTTCTAC CGCCACTGCT TCTGTGTCTG CATCAAACAT CACCGAAGCT GTCACCGCCA CCGAATCGCT CACCGCGTCA GTCAACGCTA CGGACCCTTT CTTCACCGCG TCAGCCACCG ACTCGGGTTA TCAGGTCAAC GCCACTGCCT CTGCGACTTA CTCTAGCTAC GACGTTAATG CTACCGCCTC TGCCACTGAC TCTGGCTACG AGTCCATCAA TGTTACTGCT TCTGCTACCG AATCTGGCTA CGAGTTTGTC AACGCGACGG CCTCTGCTAC TCTCTCCGCC GAGACTTCTA CAGCCACCAA TAGCAGCATC GGTGAAGGTC TTTGGTCTCC TCACAAATCT TCTTCCGTTT CCTCCGATGA CTGGTCATCC GAGACTGCTA CCCGTTCCAA CACCGAGTGG TGGGCTACTT CCACTGGTTC CGACTCTTGG GCGTCTGCCA CCGCCTCTGC TTCCAACCCT GGGCAGAATG CTTCCCAGTC TGACTCCTGG AACGCCACCA GCACAGCGTC CAACCCCTGG GAGACTGCTT CTTCTCAGGC CTCGAATGAG ACCTCCACCG ACTCTTGGGG TGCCTCTGCT ACTGCCACTG CTTCCCAGTC TGACTCCTGG GACGCCACCA GCACAGCGTC CAACCCCTGG GAGACTGCTT CTTCTTCTCA AGCCTGGAAT GAGACCTCCA CCGACTCTTG GGGTGCCTCT GCTACTGCCA CTGCTACCGA ATCTGGCTCT TACGGGAATG CCACTTCAAC TTCCACGTCT TCCGCCATCA CTGCCACCGC TACTGTTGGC ACCATCTCTT CTGGCTACCT CCAGACTAGC GGCACCAAAA TTGTCGACTC TGACGGCAAC GAGGTGATCC TCCGCGGTAC CAACATTGGT GGCTGGCTCG TCCTCGAAGA CTGGATGTGT GGTATTACTG ACACATCTGG ATCTTCCGAC CGATTCTCTC TTAGTACTCT CGAGAATCGG TTTGGTACTG ACCAGGCCAG GACTCTTGTT GAGGCTTGGG CTGAGAACTG GTTGACTACT TCTGACTTTG ATGAGCTTGC CGCCATTGGT TTCAACGTCA TCCGTCTTCC CTTCTCTTTC CGAACTGTCC AGAACGCCGA TGGCTCCTGG AGAGACGACG CCTTCACCCG TATGGACTGG GCAATCAGTC AGGCCAAGGC TCGTGGTATC TACACCATTG TCGACTTCCA CATGTGGTCC GGCCAGGAGG CTGACTACTC TGCCATCTCT GAAAACACCG ATGAAGGACA GAGCCAGCGA GATGCTGCTG GCGAAATCTG GAAGAAGGTT GCTACTCATT ATCTCGGCGA GAGCAGCATC TGTGCTTTTG ATGTTATCAA TGAACCTACT GGTTCTTACG GCGATTATCT CCAGCAGGAT CTTTACAATG CTGTAAGGTC TGTTGATGCT AACCGTATCA TCATCGTGAG TGCTTCATTG ACTAGATACG AGTATTCACT AACATTGTCC CAGCATGAAT CAATCTCTAC CGACCCCTCT ACCTACGGCT GGACCAATGT CATCTACTCT CTTCATGAGT ACGACATGAT GGGCTCTGAC CTCTCGTCCA ACCAGGCCAC CTGGACTAAT GGTGTTCAAG CTTACATTGA CTTGTGGCAC GGCTATAACA TCCCCTTCAT GCTCGCCGAG TTCATGGCCG ACGGGTAAGT TGAGAAACAA TAATTAGACA CATGGACGTT GCTGACATCA TAACAGTGAA ACCCTTGACT ACATGCTAAA CTCTATGAAC TCTCAAGGCA TTTCTTGGCT CACTTGGGCT CACTCTACCG TCAACATGGG GCGATGGGGT ATTTGGAACC ACGAGGCTTT CAACGTTGAT GTTTCTTCTG ACTCTTACGA CACCATCTAT AGCGCCTGGA CCAACATGCC CAGCACTTTC CACACCAGTA TTTACGACCA GATGAAAACT GCCGCTACTG GCTCTACCAA CGTCAGCAGC AGGAAGCGAG ATCTCGCCTC TGCTGCGAGG GCTACCAAGC GCTTCCATGG TAGCCATGGT GGTAGGTCAA GAAGAAATGG TATGGCCCAC GCTGTTAGGG GTGCCGCTGG TGTCTCAATA TAGGCGAGAG GGAGTCGCTT CTTATTTTCA TCCATCTTTG AGAGAGCATT TTCTTCATTC ACGACATGAT CTGTTTTATT ATCTAGGGTT TATAGAAACG CATTGTTTAG CATTTTTTGT TTGTTACTTA GGTTTTCATA CCTTTCTCCC ATTCGCTCCA CTCATAACGC TTGTTTTGCA CTGCGTGCAA AGCAAAGCGG TATCGAGGAG GAGATGGGAT CGTGTAGAAC ACATTGGATG GACCTCGGGG GTATCTTTTT TATAAATACT TCGATTACTC TTAATACGTA CGGGCTACGG CGACAGAGGT CTCTTTGATA GGACTGTTGG GCCCTGCTGC GCAGGTGCTA CGTATTTGTA AGACGTAGAC GTCCATAGAT CTGGATCTAT GCACCACCCT CTCCTGTG
|
Protein sequence | MSAFTYTAAL FSLISLLPSA LAGPTAGTTY SVSPNQHPSM CLAPANNWEG ADVVLKDCDE DDTTWLWTGQ SFQNTATDLC IDIRDYGAWS GNKAQVWGCF SYNTNQQFTV EESMIHWDNF CWDLTDGSSS AGTMLQIWSC YSYNDNQQWT LTEIEEVDEC DATSVTETAT IMSTATASVS DLSTATASVS ASNITEAVTA TESLTASVNA TDPFFTASAT DSGYQVNATA SATYSSYDVN ATASATDSGY ESINVTASAT ESGYEFVNAT ASATLSAETS TATNSSIGEG LWSPHKSSSV SSDDWSSETA TRSNTEWWAT STGSDSWASA TASASNPGQN ASQSDSWNAT STASNPWETA SSQASNETST DSWGASATAT ASQSDSWDAT STASNPWETA SSSQAWNETS TDSWGASATA TATESGSYGN ATSTSTSSAI TATATVGTIS SGYLQTSGTK IVDSDGNEVI LRGTNIGGWL VLEDWMCGIT DTSGSSDRFS LSTLENRFGT DQARTLVEAW AENWLTTSDF DELAAIGFNV IRLPFSFRTV QNADGSWRDD AFTRMDWAIS QAKARGIYTI VDFHMWSGQE ADYSAISENT DEGQSQRDAA GEIWKKVATH YLGESSICAF DVINEPTGSY GDYLQQDLYN AVRSVDANRI IIHESISTDP STYGWTNVIY SLHEYDMMGS DLSSNQATWT NGVQAYIDLW HGYNIPFMLA EFMADGETLD YMLNSMNSQG ISWLTWAHST VNMGRWGIWN HEAFNVDVSS DSYDTIYSAW TNMPSTFHTS IYDQMKTAAT GSTNVSSRKR DLASAARATK RFHGSHGGRS RRNGMAHAVR GAAGVSI
|
| |