Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE04080 |
Symbol | |
ID | 3257735 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 1148387 |
End bp | 1151969 |
Gene Length | 3583 bp |
Protein Length | 908 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256990 |
Product | conserved hypothetical protein |
Protein accession | XP_571062 |
Protein GI | 58267812 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCCAGGCGG TGGCAGGCCC CAAGAGGTAA ACAAGAAGTT ATTACTTTAT TAGGCCAAGC AGCGGACTAG GCCACTGCGA AACTTTATCA CTTGCCCCCC CGTATCATGC CATCACCATC GCAGTGCCTT TTTGAGGACC AGGAATATGA CGCTCGTCAT AATCCAATTA TTGAAATGAT GTCATGAAGC CAACTTCCGT TCCACTATGG TAGCTTTCTT CTTTCCTACT TAAGAACAGT GCCACCTTCC CATCCTTCCC CCTTTTTCCT CACGATTCTC AGGTTTTTGT TTACACATAA CAACTTGTCT CTTTACCCAT CAATACCACT ACTGTAGCTG TGAGTTTTCT CTTGGTCGGA CCTCCTTGCA AGTGAATTCC TTCTCACATT TATCCTTCCC AGATTACTTT CCCACAATGA GTGTAGCAGC CACCAGATCC GAAATTTCCC ATAACCCCGG CAAGATGAGT GTTACTAGTG CCGTCTCCGG GGAGAAGGTT GAGGGCGATG TTCAGTCTAG AATGAAGTTG TTCGGTGCGA TTCAGGCCTT TAGGGATGGG TGAGTTTAAA TATCTCGATA CCATTGATTT TGCAAAATAC TTAAGATGTC TGTGTTCCGC AGTCGTATGC CTGACAATGA ACAAATCGGC TCTGTTCTCG AATATGCCAT CGGCCACTCC CCTGTCGACC TCCAAAAGCT CTCACCCGAA GGACGAGTCC TCATCGATGA CTTTAGGGAC ACTCTCGAGA CTCTCCGCAT GATCGTTCAC GAGAAGAACA CTGATGAGCT TTTCCAAAAT GCTGTCTGGT CATCATATCA TAGTGATGTT TCAAAAGCTA AGCAAGATGG TGTCATTCCC GTCAGCAGTG AACAGGCCAA GCAAGATGGC AAAACTGGTG AGTGAACTGG GTTTGCCCAT TTGTCTAGGA ACTCGACTAA CATTTATTTT GCAGCCGCTT CCCACATTCG TGTCCTCATC ACTCTCTTCC TTACCAACTC TGAAGCCCGA AAACTTCTGA AAGACTTTGG CATCGTTGGC CGAGATATCT TCGCTACCGC CGCGACTAAG GCTGCCGACA AGTCTCGACC CTCTCAAGAA AAGCTCGACT CCGTTGACCA GGAAGCTCCT TCTCATGAGT GGATCGGTGC CGATGGTAAG AGGCTTGGTC CCAACGAGAC CCCCGACATC CAGCTCAAGG GTCCCAAAGG TACCCAGGCT AGATACCACC CCAGAGACGA CCCTCGAGAT GCCCAGTGAG TTATTATTTT CATACATTCT TCGCCTTCCT ATTTGCTGAT TTGTTTTCTA GGCTCATTGA CGACAAGGGC AACTCTCGAT CTGCTGGCGA GGCCTATAAC CAGGCTCAAG AAGCCAAGGC TGATGCCCAG TCCAAGGCTC AGGACCTCAA GTCTTCCGCT AGAGACTACA AGGAGACTGG CAAGCAGCAG GCTCGTTCCC ACGCTCAAGA TGTGGCCGGT AACCGTGACC CCAATGCTTC ATTGTCTGAG CAGAAGGAGC AGGTTAAGGG TGCTGCCTAC GACAAGAAGG ATGCAGCTAG TGCCCAGGCC GGCCAGAACC TTCCCGATCC CAATGACGAG GGTAACCAAC AGAAGGCCAG GGGCAAGGTT GCCGAGTTGA AGGACCGAAT CCCTGATGAA CACAGGCAAA AGGCGGCCGA CTACATTCAA AAGTCCAAGA ACTTCGTCAA CGATGAGTTG CCCGAGGAGA GGAGAGATCA GTTCATCTAC AGACTTAAGA AGGTAAATAT CTTGTCTCTT TTGGCACCAA CAGGACAGCA TGTTTACCCA ATATTTTTAG GTTGTCGTCG AATGCCAGGG TCACAAGGAC TACCAGGAGG CCATGACTTG GCTCCTTGAC ACCCTCGAGA ACTACCGAGG TCACGCTAAG CACGTTACCA ACAAGGGCAC CGAGTCTGCC CAAACCGTTT CCAACGACCC TGCCGTGGGC GACTCCACTA TCCAGTTCCG AACCCTTCTC GAACGATTTG CCAACGGCAA GTCTCTCGAC AACGTCTTTT CTGCTCTCGA CCAGATCTAC ACCGACGTTC AGAACGACTC TGAGCTCCGC GAATGGTTCA CCACTTTCAA CGACTACATG CACCGAGTGC TTCTTGAGCC CGGTTACATC CTCGACGAGG ATTCTGACCG TGAGGCCAAG CAGCTTCGAG AGTCTGGCAG AAGATTCTTC CAGGAGAAGT ACAAGGCCCA CCAGGAACTT CTTTTCGACG AGCTCCAAGT CTGGCTCACA GCCTTTGGCG AAGACCCGCT CAACGTCCGA CTTGGTGACG ACATCAAGCG ATTCTTCAAA GACTTGCTCT TCAACCACGA AGGTAACCTT ACTTTCAAGC CCAAGCTTTG GAACGATGTT CGACAGGTCT TGATCCCTAT GCTTCTCAAG CAAGTCAGCT ACGTCCCCAT TCCCCGCGCT GAGTACTCCG ACAACAGCAT CGACTTGGTT ATTGAGGACT TGATCCTTTC TGGTCCTAAC CTTTTCCCCA ACATTGTCCA CATCGAGTCT TTCAACTCCT TCTCTTTCAG CCCTTACCCC AAGCTGAACA AGACGATGGA CAACCAGCAC CACAAGTTCA GGTTGAGTCT CAGTCAGATC CAGGCCGACA TCCGAGATGT TGCCTTTGCC TTTAGGCGTA AGAGCGGATG GCCCAAGCTT TCTGACCACG GTCTTGCCGA TGTTATTCTT GCTGGAAAGG GTATCTCCGT TGACGTCGAG CTTGAGTCTA TCGAGAACCG TCGGGACACC GTCTTCAAGA CCAACTTCAT TCACGTCAAC ATTGACACTC TCAAGTTCGC CATCAGGAAC TCCAAGCACG ACTTACTCTA CAAGGTGAGC ACTAGTCAAT AATAACAATG GGGTATACTG ACTTGATTAC ATTACAGTTT ATCAAATCTA CAGCCACTGG TCTTATTAAG AGGGCTATTA CTGCTGCTGT GCAGAATGCC ATGCACACCG CTCTTGGTCA CCTTGATGAG CAACTCGTTG AGATCCGAAA CAGGGTCAGT CTATCACACT ACATAAGATT TTGAAGTATT GCTGACACAA TTTTTTTTTC AGGTCGATGA GGCCAAACAG TCTGATGAGA CAACCCGAAC TGAAGCTCTC AAAGACCTCT ACTCCCGCAA GAAGGAGAGT GCCCAGGAGA AGAAGGCTGC CGCCGACGAG AAGACTGGTA CCTTCAAGAT TGTAACCGAC CGTGACAGTC AGCTTAACCC CGAGCTCACC CATGACGGTG GCAAGTCTTG GGCCAAACGA GCGTTTAAGG TTGAGGACGC TGCTCGTACG GGTAAGGAAT GGAGGTCACC GGCCTTCAAC CTTATTGACC CTGCCCACCC TGCGGTGACT GGCCAGCATC ACCCTGCTGT CCAAAATGCG GATGTCGATG CGGAAAGGTT GAAGCAGAAG GCTGAGGCTG CGGCTCCTGG TGTGGCTGCT GGAACTAAGA GGCTGTAAGA TTTACCGGGT TTAAGTAATG TAGAATATGA TCAGAAGTTT GTTTGTATAT TACCCGTATA AAATAATTTA ATCGATGAAT ACGCTTGAAT CAA
|
Protein sequence | MSVAATRSEI SHNPGKMSVT SAVSGEKVEG DVQSRMKLFG AIQAFRDGRM PDNEQIGSVL EYAIGHSPVD LQKLSPEGRV LIDDFRDTLE TLRMIVHEKN TDELFQNAVW SSYHSDVSKA KQDGVIPVSS EQAKQDGKTA ASHIRVLITL FLTNSEARKL LKDFGIVGRD IFATAATKAA DKSRPSQEKL DSVDQEAPSH EWIGADGKRL GPNETPDIQL KGPKGTQARY HPRDDPRDAQ LIDDKGNSRS AGEAYNQAQE AKADAQSKAQ DLKSSARDYK ETGKQQARSH AQDVAGNRDP NASLSEQKEQ VKGAAYDKKD AASAQAGQNL PDPNDEGNQQ KARGKVAELK DRIPDEHRQK AADYIQKSKN FVNDELPEER RDQFIYRLKK VVVECQGHKD YQEAMTWLLD TLENYRGHAK HVTNKGTESA QTVSNDPAVG DSTIQFRTLL ERFANGKSLD NVFSALDQIY TDVQNDSELR EWFTTFNDYM HRVLLEPGYI LDEDSDREAK QLRESGRRFF QEKYKAHQEL LFDELQVWLT AFGEDPLNVR LGDDIKRFFK DLLFNHEGNL TFKPKLWNDV RQVLIPMLLK QVSYVPIPRA EYSDNSIDLV IEDLILSGPN LFPNIVHIES FNSFSFSPYP KLNKTMDNQH HKFRLSLSQI QADIRDVAFA FRRKSGWPKL SDHGLADVIL AGKGISVDVE LESIENRRDT VFKTNFIHVN IDTLKFAIRN SKHDLLYKFI KSTATGLIKR AITAAVQNAM HTALGHLDEQ LVEIRNRVDE AKQSDETTRT EALKDLYSRK KESAQEKKAA ADEKTGTFKI VTDRDSQLNP ELTHDGGKSW AKRAFKVEDA ARTGKEWRSP AFNLIDPAHP AVTGQHHPAV QNADVDAERL KQKAEAAAPG VAAGTKRL
|
| |