Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02700 |
Symbol | |
ID | 3258982 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 362341 |
End bp | 365431 |
Gene Length | 3091 bp |
Protein Length | 796 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258215 |
Product | conserved hypothetical protein |
Protein accession | XP_572443 |
Protein GI | 58270574 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.738564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATACCGTCCA TGTCCACAAC AATACAGCAA GGCGACACAG ACACAGGTGC AGGGAACCAT CTCCCAGCAT GGCTCCTCGC CATGTCTGGA ATATTCACAG CAGTTGGTCC GTAACTTCCT CCTCGTCGTT CCCAGCAGGA CATCTTGTCA CTCACCCGCA ACGTCGCTGT AGCTACTGCG ATATCTATGA TGTCCATCGT TCTCCAATTG AAGAATTATC GAAAGCCGAC ATTGCAGAGG GCAGTAGTGA GAATCATGGT GATGTTCGTC AGACCCAAAA TATTCATGAA GTTCGCTAAC AAATTCCGTC TGTCACAGGG TTCCCCTCTA TGCCATATCT TCTCTCATAG CTTTGTTCTC ACTTGAGGCT GCATTCTTCA TTGACGCCAT TCGTGATCTA TACGAGGTAT ATGCTTTCTC TATCTGTGGC GACTAGAGAT AGCTTACACA GCTTTAGGCG TTTGTGATTT ACACCTTTCT TCAACTCCTC ATTACGTATC TTGGTGGTGA ACGCTCTCTC CTTATAATAC TCCATGGACG CCCACCAATA CCTCATCCAT TTCCTGTAAA CATCTTTCTT CAGCCAATGG ATGTTAGTGA TCCATGGGTA CTTTTGAACC TGAAACGGGG TGTGCTTCGT AAGTCATATT CATTGTCACA ATTACAGTCG CAATCAATCG CCCATCGATT TAGGCTGGAC TAGCAAACTA AACAACAAGC TTATACATAT AACGCTTTGT AGAATATGTG CAAGTGAAGC CTTTGTTAGT GCTTGCCACT GTAGCTCTCA AAGCCACTGG AACCTACCAA GAAGGCAGAT TCGCCGCTGA TTCAGGATAC ACGTATGTCA GTATTGCATA CAATACCAGT ATCTGTCTGA GCCTCTAGTG AGTTTTTCCT TTTAAGCCTA ATGCTCCAAA AGCCAAAATT AGGGTGGATG GCGGGGGGAT TAGAAATCAG TTAGGCTGAC CACAGTGGCC GGCACAATAG TTGTTTAGCT ATGTTCTGGG TTGCGGTAAA CAAAGACTTG AAACCCTTCA GACCAGTTCG TAAGTAGTGT TTATTTCCCC CCAAGGCTTC ATGACCCTTA GAATTAAGTC TCTTCTGTAG CCAAATTCCT CTGTGTCAAA GGAATTCTGT TCTTTTCTTT TTGGCAAAGT ATCGGTATAT CCTTACTCGT AGCCATGGGC GCCATCAGAA AAGGTTAGTC CCAGGCTAAG ATATGAGCTC AATGATCGGC CAAGCTAATT TTGATGAGGA TAGTTGGGCC GTATACGGAT CCTGAACACA TGTCTCTTGC TCTTGTGGAC TCTTTGATCT GTTTCGAGAT GCCAATCTTC GCTATCGCTC ATGTAAGTCC TTTCAAGCCA CCATTCCCTT TTTATTCCCT CCCTAATCTT GATGTTCAAA ACGCATCCAG CAATACGCAT TCCAAGCCAG CGATTATATT GACCACAACC TCGTCTATGC GGCCCGTCTT CCATTCATCT ATGCTTTCCG CGATGCCTTT GGGTTCAAAG ACGTCTGGCA AGACACGATC GACACATTCA AAGGCCGCGG CGTATCATAC CAAGCCTACG AACCTGCTGA GGGGGGTCTG CATTATGGCG TTGGTCGACA AAAGAGGATA AGAGCGGGAT TGAGGTACGC GAAAGGCGGG AAGATGAAGT ATTGGATGCC AAAGCCAGGG GATGAGGCGA GGATGAAGGG ACAGAGTGGG CTGATTACGA GTATGAAACG GAGAGTGGAT GAGAGGTTGG CAGAGAGAGA AGGGTATGCA CCATTACTCC CTCAACAGGC GGCGAGGGTT GTGCATCTTG ATCCCGGGCG CTATACCACC TATACAGGGG GGGAACGGAT GTTCGACAGT GATTCTTCGG ACGATTCGGA TGCCCCCAGC CTCACTTTCC ACTCTGCAGA CGAGATGGAA GATGCAATGT ACGACCGAGC AAGGAGGATA GGGTATGCGG GGTTCCCGAA TGTGGATGTG AGTAAAGAGG AAGCGAGAAG GAAAAGGAGG GAGGAGGAGG AAGGCATTTT GAAGGGGAGA TGGACGAGGA GTGGGTTGAG GGTAAGAGAT GGGACTGGAC TTGGCGGTGG GGAGGGGCAG GGGGAAGGGA GAATGAGGGC GGGTTCAGCA GCCAAGGGTA AGAGTAATGG TAAAAATAAA GATAAAGGGA AGGGGAAAGG GCAAAGCAGG AAAGTGTACG GAACATGTAA GTCTGACATT ATTGTCATTT GCTGGGTCGA AAAGGGTTGA CAAAGGCGGT ACAGGGGCAG ATCACCCGCC ATCCGTACAA CGCTTCAACT CTTCATCATC ATCAACTCGC AACGGAAACG GAAATGGACA TCCGGCAGAC TTTGATGAGG CTGATGATGA AGGCGTCGTG CGGCAGGAGG AGGAGGGAAG GGCTATTAAA AAAAAGAGAC AGAGACAAAG TCCACCTGTT TGGGTTACAC GCACGGGCGC GACAACGGGT GTAGGCTCCT CTTCGTCCCC TTTTTCCATT GGGGATGATG ATGATGATGA CGATGAGGAC GGAGAGGGTG TCAAGAACAG AGAGACTCAA GGCGACAGAT CTATGAGCTT CAATCACCCT GATAACTCCA GCCACAACTC TGACTCAGAT TCTGAACATT CAGTTAAGCC GTCAACCACA AAAAAATTAC CCGCCGACGC GGTCGATCTG GTCAAGGAAG ATTACGAAGC CGTTGAAGCC GCTCGGGAAC GCGAACGTAG ACGAGGAGAA CCGCAGACGA ATGCCCCTGC TCATGTGTAT CGCAAAACCA TCATTGACGA GATACCAGAG ACGGATACGA ATGCAGGTGC AATGAAGAGG AGGAGAGGAA TAGGAGGAAG AATTGAGGAA ATACAGGAAG TGTATGCGCA TGATCCTCAT GTGATGACAA AGGATGAGAT ACAGACAGGC GTTAAGGAAG TGGTGGATCA CGTGGAGACA AGTTTGACTG TCGAGCCTCC AAAGCACGCG ATGAGTCTGG ATTTAGAAGA TAACCCGTGG TCGTGAGGTT GCTTGGGCTT GTGTGATGGG GTGAATAGAC AAGATCCTTA GATGTCCATT TATTGTTTCT T
|
Protein sequence | MSTTIQQGDT DTGAGNHLPA WLLAMSGIFT AVATAISMMS IVLQLKNYRK PTLQRAVVRI MVMVPLYAIS SLIALFSLEA AFFIDAIRDL YEAFVIYTFL QLLITYLGGE RSLLIILHGR PPIPHPFPVN IFLQPMDVSD PWVLLNLKRG VLQYVQVKPL LVLATVALKA TGTYQEGRFA ADSGYTYVSI AYNTSICLSL YCLAMFWVAV NKDLKPFRPV PKFLCVKGIL FFSFWQSIGI SLLVAMGAIR KVGPYTDPEH MSLALVDSLI CFEMPIFAIA HQYAFQASDY IDHNLVYAAR LPFIYAFRDA FGFKDVWQDT IDTFKGRGVS YQAYEPAEGG LHYGVGRQKR IRAGLRYAKG GKMKYWMPKP GDEARMKGQS GLITSMKRRV DERLAEREGY APLLPQQAAR VVHLDPGRYT TYTGGERMFD SDSSDDSDAP SLTFHSADEM EDAMYDRARR IGYAGFPNVD VSKEEARRKR REEEEGILKG RWTRSGLRVR DGTGLGGGEG QGEGRMRAGS AAKGKSNGKN KDKGKGKGQS RKVYGTWADH PPSVQRFNSS SSSTRNGNGN GHPADFDEAD DEGVVRQEEE GRAIKKKRQR QSPPVWVTRT GATTGVGSSS SPFSIGDDDD DDDEDGEGVK NRETQGDRSM SFNHPDNSSH NSDSDSEHSV KPSTTKKLPA DAVDLVKEDY EAVEAARERE RRRGEPQTNA PAHVYRKTII DEIPETDTNA GAMKRRRGIG GRIEEIQEVY AHDPHVMTKD EIQTGVKEVV DHVETSLTVE PPKHAMSLDL EDNPWS
|
| |