Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA07840 |
Symbol | |
ID | 3253651 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 2164497 |
End bp | 2168368 |
Gene Length | 3872 bp |
Protein Length | 1117 aa |
Translation table | |
GC content | 53% |
IMG OID | 638253107 |
Product | hypothetical protein |
Protein accession | XP_567131 |
Protein GI | 58259437 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCGG AAGAACGGCT TGATCGGAAA TTGTGGAATG CAGATATCCT AGCGGGATAT TCGTTGGAGG AGAGGGTGAG GAGGGAGTGG GATAGGCGGA AGCTAAGGGA GAGGGAGCAA TCGCAATCGC AATCGCAATC GCAAGCACAG GGGCAAAACG AGGAGGAGGA GGGGGGGCGA GAGGGTGAGC TCGTACAGGG GACAGAGGTG GGAATGGGGA CTCTCTCAGA GAGGCAAGTG TCTGAAGCGA CAGACGCGAC AGAGATGATG GAGGAGGGTC AAAGGGCGGA AGGGGCGGAG ACAGCGGAAC CCAGACTTTT GAGTCCAAGT TTGGAAAATG GCTCAGGCTT GGGCTTGGGT TCGGGCTCGG GGTCGGGCTC GCCAACTGTG CGTTCGAGAT CTTTGGAAGG TTGGTCTCCA GATGTGGAAC CGGGATCCGT ATCCAAACAC GAGTATAAGC ACGAGCGCGG GCCCAAGGCT GAACATCAAC TTGGATCAGA TCCACGACCA TCGCCGCAAC TCCAGCACAA TCAAGACCAA AATCAAGCGC AAGCGCAAGA CGCGAAAAAG CATTCAGAGT CCTCTTCCTC CGAGGTTGGA GCTCCAGCTG ATGCTGGACA GCCCTCTTTC AGCAGGAAGC AGCCTACAAG CACAGGCACA AGCACAAGTA TAGCGCCATC CACCGCACCG CTAGCCGCAC CTTTACTCTC CCCACCAATA ATTTTGCCGT CTCATGAGCA TGAGCATGAG CTGCCGCAAC CATCTACCAC CAAGCCCAAG GAAAAAGTGG GGAGGAAGAA AAATGCTAAT GTCGTTGATG TTGGCGCAAG CAAGGCTGCT GATCAGCAGT CAACCGGGAT TGAACTTACC ATTCCCACAC CCACACCCAC ATCCACATCT ACGTCCACAC CCACATCCAC ATCTACGTCC ACACCCACTC CTCATTCGAA AGAAAAAGAA AAAGCACAGG CTGCTCAGAC TCAAATTTTA GAGGAAAAGC AATCTCTTAC TGCAGAACAA CCTCCCGAAA GAGTCAAAGA GAAGAAGGAG AAAGAAAAGT CCAAATTCTC AGCTCCCGAA TTATCGTCGT CCTTACCAAA GGAGAAGAGG AAAGACAAAA GAGCAATAGA GAAGAAACCT TCTCAGCCTT TGCAACCTTT GAGACCGCTT TTCCACAAGA TCCCTTGGGG CAGCTCGCCC AACGTCGATA CGGCTGGTGT TGGCGTCACA TCATCTCAAG ACAAGCCCTC CGGTCGTTCC ACTCGTCTTG GGGAAAGTGA AGAGAATGTG CAAGCTGCTG GAGCGTCATT TGAGAAGAGG AGAGAGAGTG AGGGACTGCA AGGAAGGAGG TTGAATGAGG AAGTGCAGAG AAGGAAGAGT GAGAATGATG TTGGGTTTAG GGTTGGGAGC CCGAGAGAGG CAGGTAAGGG GGAAAAGACG GGATTACCGC CTAATAGGGA GGCAGCGTTG AAGCGAAGGG ATTTGCAACT CCCAGCACAG ACGCCCAAAC TTCAATCGCG ACCGGCTCAA CCCCAGGGGT CTACGACGCA ATATAAATCT GCCTTGCCAC GATTGACACC ACCTGTGGAA CAGCCCGTAC AAACCGCCAA ATCCAAAGCT GACATTGCGC CACCTGTACC AGCCAAGGCA AAGGTGGCGG AGGACAAGCA GCCCAAGGAA GAAAAGCTAC TTCAGCGCCT CTCAGGACCT TTGGTCGATA TTGATGATCC TGAGCCTAGA CTTGTTCCCA AGTTCCAATC CCAGGCCCCA TCTAGCACCT CCACCACTCA ACCTCAATAC CCCCCAATTA CCCAAGTCGG AGTTAATCAA GAGGCGAAGG GTAAAAAGGA TAAACGTGTT TCTCGAGAGA GTATAACGGC GCTGGCAGCT TCGTCGGCTG ATCTTTTACA GCTCCTCGAG TCGTCTGCGG AGGAAGATCC TTCCAACTCT AATTCGAGCT TGAAGGATGT GAACCAATCG AGAACAGAGG AACCTGCTCG AGATATGACA GAGGGGACCT TGGGCATAGG TACTTCCACC ACTCTTACGC CTTTTTGCGC CGGTGCTGAT GCGAAATTAG AAACGACGCC TAGCCGAGGT CCATCTTCTC TAGGCGCGAT GGCAGCTGCA ACCAAGAAGA AGGTCCCTCC GCCTCCACCA CCTCCTGTTT CGAGATCTAG GTTGCATTTA TCAAAGTCAA TGGGTACCTT GCGTCCTTCG GAAGGGGTCG AACGGGGGAA ACAAGATGGA CAGGCTACTG GTGAGGACGG GACACAGAAT GCACGGGTAC CTCGTACTCC TCCCCTACCG CCTCAGTTAC CTAAACGACG TCCACCTCCG CCTCCACCGC CTAGGCATCC TGCGGCTATC GCCAGACTCC CACCCCCTCC TCCTCCTCCT CAATCACAAG CAGAACCACG AGCTAGAGCT CCACCTCCGC CCCCGCCACT CCCGCCTCGT CCAAGGCCTT TGAGCGGCAT CTCATATCAG ACTATATCTT CTGTCTCTCA GGTTTCGTCC GCTGGTGCGC CTGCGCACGG GCAAAAAGTT GATACGGCTC TGCAAGCTCA ACGACAAGCA CAAGCTCAGG AGCCAGAGGA AGATGAAGAA GGAGATCTCC CAGGAAGTGA AAATGTTTCT CCAGGTCTGG TGAGGCGACC TTTAGGTCCT CGCCCCGCTC CTCCTCCCCG TCCTACTTTA CCTTCTCGAC TCAGGTTGTT TGGAAATAAC CAAACGAACT CTGGCCAGAG AATGGATTCC CGGATGGCGG ATGTACCTCA GAGTCAGAGA CTGGGCCAGG AAGAAAGAGT TGGAGAAATG GATAGTGGCT CGAAGTTTGT AGAGCACATT GAACGAGGAG GGATCCCACC TCCCCCTCCA CCTACTCTGC AAACTCGTGC TCGGGAGACA CTTCGACCTA CGCCTGTGTC AACCCTCACG CCCACCCCAA TCCCGGCTCT TAGGCTGAAC AACTCCACGA CAACCAATGG TTCCCGCCCA TCCCCAGTCG AAAGATCACA TAGCGATTTC CCCCCTCTCC CGACCCAGTC TCAGCTGCGG ATACATCAGA TGAACGAGAG GGAGAGCGAT AATCGATGGG CGTCGACTGT TGATTTAAGG GAGAGGCCTT TAAGTCCTGC GACTTCTGCG ACTCCTGTGA TTGGCAGCCG GGGCGCGGGA GATGGGGATG AAAGTAGGAT AAGCCCGGTG GAAACGCCTA GACGTGAAGA AGGTGAAGGA CCCGGTGTGG GGTTGCACAG TCAAGCACAG GGACAGAGGG GGATAAGAGA GTATACGGAT TTGGACCTGT TTGTTTCGAG GTTGGAAGGA AGTGGGCGCG AGTATGAAGT GAGTCCGTCT CCCATTGTCA TGTCTTGTAA AGTTTAAAAG TCAAAATGAC AGTGGCTGAT GTGATTGACT TCTTTTTTTC TTTCTTTTTT TTTGGGAAAA CAGGGTTATT CACAACTAAC GACCTTCCTC GGCCCTTCTA AACCTACTGC CGCATCTCCC GAAGCGATCT CCACCCTGCT CCCAGGACTC ATCACCATCG ATTCTCGCCG TACCACGCCA CAAGGTAAAG TGAAGCTTAA ACTTTCGCTC TTGGGGGTCA GAGTGGGTAA ATGCCCAATC TGCTTGAGTC AGTTTAGGGG GGGTGAGAAA GGGGTGTTGA CACCGACTTG CGTGCATGCG GCCCATCAAA GTTGTGCATT AAGGTGGTTT AGGGAGGATA GAAGGTGTTT TGTCTGTAGA GAGATTTTGA AGGAGGAAGA GTAAGGAGTC AGAGTTAAGG GAGATGGAAG AATGGATCAG TTAGATATTG TAGCGATTAC TGAGAAAGTG GGATAGTATA ATTAGCATCA TGTATATTTT GA
|
Protein sequence | MTAEERLDRK LWNADILAGY SLEERVRREW DRRKLREREQ SQSQSQSQAQ GQNEEEEGGR EDPRPSPQLQ HNQDQNQAQA QDAKKHSESS SSEVGAPADA GQPSFSRKQP TSTGTSTSIA PSTAPLAAPL LSPPIILPSH EHEHELPQPS TTKPKEKVGR KKNANVVDVG ASKAADQQST GIELTIPTPT PTSTSTSTPT STSTSTPTPH SKEKEKAQAA QTQILEEKQS LTAEQPPERV KEKKEKEKSK FSAPELSSSL PKEKRKDKRA IEKKPSQPLQ PLRPLFHKIP WGSSPNVDTA GVGVTSSQDK PSGRSTRLGE SEENVQAAGA SFEKRRESEG LQGRRLNEEV QRRKSENDVG FRVGSPREAG KGEKTGLPPN REAALKRRDL QLPAQTPKLQ SRPAQPQGST TQYKSALPRL TPPVEQPVQT AKSKADIAPP VPAKAKVAED KQPKEEKLLQ RLSGPLVDID DPEPRLVPKF QSQAPSSTST TQPQYPPITQ VGVNQEAKGK KDKRVSRESI TALAASSADL LQLLESSAEE DPSNSNSSLK DVNQSRTEEP ARDMTEGTLG IGTSTTLTPF CAGADAKLET TPSRGPSSLG AMAAATKKKV PPPPPPPVSR SRLHLSKSMG TLRPSEGVER GKQDGQATGE DGTQNARVPR TPPLPPQLPK RRPPPPPPPR HPAAIARLPP PPPPPQSQAE PRARAPPPPP PLPPRPRPLS GISYQTISSV SQVSSAGAPA HGQKVDTALQ AQRQAQAQEP EEDEEGDLPG SENVSPGLVR RPLGPRPAPP PRPTLPSRLR LFGNNQTNSG QRMDSRMADV PQSQRLGQEE RVGEMDSGSK FVEHIERGGI PPPPPPTLQT RARETLRPTP VSTLTPTPIP ALRLNNSTTT NGSRPSPVER SHSDFPPLPT QSQLRIHQMN ERESDNRWAS TVDLRERPLS PATSATPVIG SRGAGDGDES RISPVETPRR EEGEGPGVGL HSQAQGQRGI REYTDLDLFV SRLEGSGREY EGYSQLTTFL GPSKPTAASP EAISTLLPGL ITIDSRRTTP QGKVKLKLSL LGVRVGKCPI CLSQFRGGEK GVLTPTCVHA AHQSCALRWF REDRRCFVCR EILKEEE
|
| |