Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00110 |
Symbol | |
ID | 3257899 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 18342 |
End bp | 21324 |
Gene Length | 2983 bp |
Protein Length | 728 aa |
Translation table | |
GC content | 48% |
IMG OID | 638256593 |
Product | conserved hypothetical protein |
Protein accession | XP_571101 |
Protein GI | 58267890 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.441864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGCATTCAA CATACTTATC CAATAATTCA CTAGCACATA CATGTCAGCT CAAGACCATC TCGATTCCCC AGAACTGCGA TTTCCATCGG CGGAGATTAA TGACCCCACA CCGCAAAGAA AATCTGGTGA ACTGGACGAT ACTAAAGTCA AGCGTGCGAC TGTAGCTTGC AACGCCTGTC GTGCTAGGAA AATCAAATGC TCGGGTGATA AACCGATATG CACGACCTGT GCTAAAGGAT CAGTACGATG TGAATATCCG ACTGTTCGGA AGCGGAAGCG GGATAGTACT AAGAAGAAGA AGGATTCTGG TAGTGATCAA CTCGATCCGG CACTGGCTGC TCCATTATTC ACTCAAACTT CAAATATGCC AGAGTCCACG GCGGGGACAT CTCCATTGGA TAGTAATCCT TTCGCGCCTC CATTTAATCC TTTCGCCGTT ACACCGCAAC ATGCTTCGAC ATCGTTCGTC GCCCATTCCA TTCCGGATTG GGCTCAAAGT CCATGGAAGA ACATTACTGG AGGCGATATA GTGGGCCAGG GGATGGACAA TGTTGATCAA AGTACCCTAG ATATCCTCTC TTGGGACTTA GGTGCGAAAT TTCCTTCCGG AACCGAGCCA GGCACTGGCA ATCATACGTC CACATCGGAC ATTGATTCCG GTCGCATGTG GGAAAATACA AGCAGTACAT CGAATGTGGA CCGTCGATCA GGTGCTCACC GAAAGGCTCG CTTCAGAGTG CCCTATTTCC GGTACGCTGT CCTATATTGC TTCTAGATCA TGCTGATGTA TCCTAGATTC TTGTTAGTTG CTCCGGTGTT TGATGAAGAG AAACGTTAGA ACTGACCTGA CTTCTAGTGG ACCTACTGCC ATAGCCCCCG GGTACAGACA AGTGGTTTGC GATGTATCCG CTCCCGTATC CCCAACTAGA GCACCTTCCG ATTTACCTTG GGATCCGATG TTGCAAAACG ATCAAATGAT CATGGCATCC GATGGAGAAT CGCCCGTAGG TCCTGCATGA TTTAGCGCTT TTTATTGTTG TCGTGAACTG ACTAGGATTA GTCCATTGAT GCCCTCAAAC AACTCTTGCC GATTTTCAAA CTTCATTTCG GTTATTTCTT TCCTTTCATA GATTTATCAA TCGATGATCA GGGCCTCCTA TTATCTCGTC CTCCATCTCA TTTGCTGAAC ATTGTTTGTG CCTTGGCGGC GAGACATTCA CAGGTGTATG GAATGCAATC TTTACTAGGG GGCGCGGACA CCGGTTCTCC AAGAGAAATA TGGGCATCGA AAGCCAAAGG ACAAGTGCCT CGGAACTTGG CGGTTGCATC GATAGAGATG GTGCAAACAT TGTTGCTCAT CAGTTGGTAT GAGTTTTCCC AGGACAGAGA TGGGGTACGT TGCCCTCGTC GCATTAGATG TAGTTGCTGA TGAAAGCAGG GTCTATGGAT GTGAGCTTCT CAGCGGCGAC TGTTAAAAGC TAATCCAATT AAAAGGTATT CGGGTATGGC GCTGCGAATG GGTCAGGACC TTGGTGGGTC GTTCTGTACC AAGCCAGATA CTCCTTACTG ATACCTTTGA AGGTCTTGAC ACCTTCAAGA ATCAGATACC ATCAAGTGAC AATCCAGACC AGCACGCTCA CCAGTCATTA CGATGTGCAC TGTTCATGAT GGATGCCATT ATGACTATTG GCAGTAGGTA TACCTTTGGC CTACAGGAAA GGGCTGACAT GGCTCGTATA GCTGGTCGAG CGGGGATGTA TAAAACCAGT TTGGACCAGG TGCCTTCCCT TCCACCATTC ACAACACCCA GCGGACTCAC TCTCACAAAC CCATACCCAT ACATGACACG CATCTTCTCG CTTGCAGATC ATGTCACTCG TATATTGGTC GACAAATGCA CAACCACAAT CGACGAAGCG GCTCTTGAAC AAGCTCAAAG CCGGCTGAAC GAATTTCACA CCTCATTACC GGCTGATTTA CGGTTCGAAA CTTCTGCATT CCAGAAGTAC GCCGCGATAG CTCAAGGTGG AGCATTTGTA CTGCTTAATG TGAGTCCATT GGCTGTTCTC ACGACTCAAC TGATTTGAAA TTGCAGTTGT GGTTCCATAC GTGCGTACCG CTTTTTCTGT GCCGCCATTG TTTTGGCTCA TTCTAATAGA TTAATCATTC TTGTATACCG TCCGTCGCTC TTGGTTTCCC CAATCCCACA TGAGAGTCGG CAACAGTATG ATGTTGCCGG GAAAGAGGTG TCTGCGAGCT CGGCAAAAAC CATTTTAGAT ATCGCAATCT TTGCAGAATT AGTTCGTCTG CCCCAATACT TGGGATATCA ACACTAACGC AAGCTTTCAG ATAGATCCCA AAGCCATCAC GCAGCCCTGG ATCAATTACC CTCTCTACAT TGCTGCTCGA ACGTTTCGTA GGTCCTCCTT CCTACAGCGC CTTTTGCTCA CATTCCTCGC AGTTTCGCAA ATAGCTCCCA GTCAGTCGGG GCAATCAGCA GACCCAACGG CTGGCGAAAT TCATATCGCC AAAACAGCAC GTGCAAACTT TCAACGAATT ATCAGTATTT TCGACGGATT ACAGCCGTAC TGGAATGGTG TGCGGTATAT CCGCAGTGTA CTTCTTCAAA AGGCAGAGGG CGTCAGCCAA GTCTCTCTTA TAGATGGAGA AAATGATGTC ATGTCGCCTG ATACATTGCC CCCGGAATTG GCTGCAATCC TTGCAGGGAT GACTAATGAT CGGAGACAAG ACGGTATGTC TCTGCTTGAT TCGCACTTGA CGGATACTTT GGCTGACATT GATTGTAGTG CTCGGAATGG GATTGACAGG AACTATGAAT TCGCCTTCGG ATAATTTGTG CAGCTTGATT CTCGGCACTG GCACTGAGGG GTAAGCGGCG AGGATGAAAG ATGTAAGGAT AGGAAGTTAC GAGGGTTATA GAGGTTGCAT TATTTCGTTG TTTAATTTAT TGGGTTGTCG TGT
|
Protein sequence | MSAQDHLDSP ELRFPSAEIN DPTPQRKSGE LDDTKVKRAT VACNACRARK IKCSGDKPIC TTCAKGSVRC EYPTVRKRKR DSTKKKKDSG SDQLDPALAA PLFTQTSNMP ESTAGTSPLD SNPFAPPFNP FAVTPQHAST SFVAHSIPDW AQSPWKNITG GDIVGQGMDN VDQSTLDILS WDLGAKFPSG TEPGTGNHTS TSDIDSGRMW ENTSSTSNVD RRSGAHRKAR FRVPYFRFFG PTAIAPGYRQ VVCDVSAPVS PTRAPSDLPW DPMLQNDQMI MASDGESPSI DALKQLLPIF KLHFGYFFPF IDLSIDDQGL LLSRPPSHLL NIVCALAARH SQVYGMQSLL GGADTGSPRE IWASKAKGQV PRNLAVASIE MVQTLLLISW YEFSQDRDGV FGYGAANGSG PWYTFGLQER ADMARIAGRA GMYKTSLDQV PSLPPFTTPS GLTLTNPYPY MTRIFSLADH VTRILVDKCT TTIDEAALEQ AQSRLNEFHT SLPADLRFET SAFQKYAAIA QGGAFVLLNL WFHTLIILVY RPSLLVSPIP HESRQQYDVA GKEVSASSAK TILDIAIFAE LIDPKAITQP WINYPLYIAA RTFLSQIAPS QSGQSADPTA GEIHIAKTAR ANFQRIISIF DGLQPYWNGV RYIRSVLLQK AEGVSQVSLI DGENDVMSPD TLPPELAAIL AGMTNDRRQD VLGMGLTGTM NSPSDNLCSL ILGTGTEG
|
| |