Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00770 |
Symbol | |
ID | 3256020 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 223436 |
End bp | 225928 |
Gene Length | 2493 bp |
Protein Length | 763 aa |
Translation table | |
GC content | 50% |
IMG OID | 638254729 |
Product | conserved hypothetical protein |
Protein accession | XP_569085 |
Protein GI | 58263350 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.104071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAA CACCTCAAAT ACCTCTGATG AGCGATTCAG GACAGTTTGC TACACCACAA TCTACCCTGC TTCCTCATTC ATCCCAGGCG GCTGGACGTC CTGGGGCTTC CTCGTATTCT CGACAGAATG ACTATCGATC TTCGAACTAC TCAACCGGCA TCCCAGCAAG CCTACAGGGA GAACACATTG TGTCGGGCTC TGTGGTGAGT CACCGTCAAT GTCTCGATAA GAGCTGAGGC TGAACCTCTT GACTAGGTTC CATCGACGAC ATTTATTCCG GAACCGACCA GCTCTATGCC ACCTCCGACG TCAAGCTTTG ATCCAAGACC GTTTGCAACT ATTGAAGCCC AGCCGACGAC CTCAAACAAC TTTTCCAATT CCGAGGCCGG AGGTTTTGCA CCTGCTCCTC CAAGGTCATC TTCACTGAAC CAGCCATATC TTGGTCAAGC ACAGACACCT ACGCTACCAG CACCCAGACT TACAACACCT ACCAATCCTA TTAAACTCCA CCCGTGCCCG ACTGCCGAAT CACTTACATA TGCTTCTGCA AAACTTGACA GTTGGACAGA CGCACACCAG ATTACCTGGG CTCAAGACGT AGTGAGATTG GTGGAAAGAT ATTGGCAGCA AACCAGCAAT GCCGACCATT TCGAACAACC ACCGTCGCCT CCATCAGACA GTCAACACCT CAGTATTACC TTGCAGCGTC TACTCGATAC TGCAGCTCCC ATCGTGATTG CAATTTCAGA CAGCGATATC AAAGAGCATG CCTCACTAGC CCTGTACTTG AAAGGGAAAC TGCTCTCTTC AGGTGCTTCT CCAGCGCTTT TACCAAAGGA TAAGAGGCAA TCGTTCAAGG ACTTTGAGGG AGCTGCGAGA AAGGGCGAAA AGCGGGCATG GTATAGACTC GGCAAGGATT ATGAAGCTGT GAATGATCTT GACCGGGCTG GTGACTGCTA TGATCGTGGT GCAAAAGCAG GAGATTGTGA AAGTGCGTTT GTGAGTGACA TTACTTAACT GGTGAAGCAT CGCAACATCT AACGAGAAAA TACAGAGAAT GGGCATGGCT CACTTGCTAG GCCAACTCAA CTTTCTTCCA GATCCGGCCA CTGGCTTGTA CTTTCTCCAT CAATCATCCG ACACTGCGAG CATCGATTTT CCCCAAGCAT CCTACGTATA CGGCATGCTC CTCGCAGGTG ACATCACTCT ACAAACCAAC TTGCCACCGA GTCTCATCAT CCCTCCAACA TCTCCACCCA CCGACGCTCT CTTGGCCCAG CAGAATCTCG CGTGCGAATC AATCGCGCGC GCCGCCTACC TCAATTATCC CGCAGCCCAA TTCAAACTCG GCCAAATGTA TGAACACGCC GATCTGGGCT GCGTATACGA TCCTATCGCT AGTGTTGCTT GGTACACGTT TGCAAGCCAG AACGGGGAGA TGGAAGCAGA TATGGCGTTA AGCAAATGGT TTTTATGTGG AGCGGAAGGA CATTTCCCGA AGAATGAGAG TAGCGCGAAG ACGTATGCGG AGAAGGCTGC TAGGAAAGGT CATCCGAATG GGTGTTTTGC ATTGGGGTAC TATAATGAGT AAGTGCATGT AATTTCAGGT TGTTGAAATA ATGTGCCGAC TCGAAACCTC AGAATTGGCG TAGGCACAGA TGTCGACCTG GAACAAGCCA GGAAATGGTA CGAAAAAGTA AGCACACTCA TTCAATTTCC CCAATATCCA CTAACGCTGT ACTCAGGCTG CAAAGGCAGG CAATGCCGAA GCCTCTACAC GCCTTGCTGC ACTATCCCTT CCCGTACCAA CGGCCATCTC AATGAACGAA CACCAATCCC GTCTTAATGA CACTCTCGTC CGCAAACACA CTACGGCAAA ACTTCGTTCT GACCGCAACA AATCTTCTCG ACCTGTGCGT CAGCAAATCT ACGAGTCTCA ACACCCTCCT CCTCTTCCTA GCTATGGCCA GACAAACTCA CTGCTCGAGT TTGGGCAAGT GAGTCAAATA CAAATGCCCA TGGCGTTACC TAAGTCTCCA CCTCCCATCT CACACACACC TTCCCCCGTA TCGTCATTAA CTGTGCCCCC TATGAATTAT TCCTCCCATC AACCTCAACC TTCTTACGCC CTTCAAGACC CTCCTATTAA CCCTTCCCGA CCTCCTATGC ATGAGCACCA ATATTCTTCC CATTCCACTT CTACCACCTC CACTGGAGGT GGACGCCAGT CGAGCAGCAC CCCAAGGCCG ATATCGGTGG TTGACCATCG TCGAATGAGT ACAGTTAGTT CAAGCGTTAA TGACTTTCCT GTTCTCAGTG CATCTTCATC TACATCCATT TCTATACCCT CTTCCGGCCC AACCAGTACG ATCAGCGGAG GAATATCGGG GACGATGGAG AAGAAGAGGA AGAAGAAGAA GGGCCCTCAG ACGTTTGCGG AGATGGGATT TGTGAGCAAA CCGGTTGAAG AGGATGGTTG TGTAATCATG TGA
|
Protein sequence | MSETPQIPLM SDSGQFATPQ STLLPHSSQA AGRPGASSYS RQNDYRSSNY STGIPASLQG EHIVSGSVVP STTFIPEPTS SMPPPTSSFD PRPFATIEAQ PTTSNNFSNS EAGGFAPAPP RSSSLNQPYL GQAQTPTLPA PRLTTPTNPI KLHPCPTAES LTYASAKLDS WTDAHQITWA QDVVRLVERY WQQTSNADHF EQPPSPPSDS QHLSITLQRL LDTAAPIVIA ISDSDIKEHA SLALYLKGKL LSSGASPALL PKDKRQSFKD FEGAARKGEK RAWYRLGKDY EAVNDLDRAG DCYDRGAKAG DCESAFKIQR MGMAHLLGQL NFLPDPATGL YFLHQSSDTA SIDFPQASYV YGMLLAGDIT LQTNLPPSLI IPPTSPPTDA LLAQQNLACE SIARAAYLNY PAAQFKLGQM YEHADLGCVY DPIASVAWYT FASQNGEMEA DMALSKWFLC GAEGHFPKNE SSAKTYAEKA ARKGHPNGCF ALGYYNEIGV GTDVDLEQAR KWYEKAAKAG NAEASTRLAA LSLPVPTAIS MNEHQSRLND TLVRKHTTAK LRSDRNKSSR PVRQQIYESQ HPPPLPSYGQ TNSLLEFGQV SQIQMPMALP KSPPPISHTP SPVSSLTVPP MNYSSHQPQP SYALQDPPIN PSRPPMHEHQ YSSHSTSTTS TGGGRQSSST PRPISVVDHR RMSTVSSSVN DFPVLSASSS TSISIPSSGP TSTISGGISG TMEKKRKKKK GPQTFAEMGF VSKPVEEDGC VIM
|
| |