Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03000 |
Symbol | |
ID | 3258228 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 856841 |
End bp | 859961 |
Gene Length | 3121 bp |
Protein Length | 768 aa |
Translation table | |
GC content | 44% |
IMG OID | 638257427 |
Product | ER-associated protein catabolism-related protein, putative |
Protein accession | XP_571596 |
Protein GI | 58268880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGAATTGTA CAGGCATTAC TCAAGACATT TCTACCCGCA TAACAACATT GTAGTGCCAT ATTTCAAGTC ATGCCAGCAT GGCCCAGGTT TATGTCTTTG CGACGTCAAT ATCTCTACAA AGCACTCGAC AAAAATCATC AAAAGCATCC CATCAAACAT CCACATACAC GGTTCCTATC GCGACTTTTT TGCGCATTGG CTGTCCTCTT TTCATATTCA ATCTATCAAT CATTCCGGAC TGACCTCAAA CAGTCAGGAT CTTGGGGATG TGAAATGAGC TGGATGTCGC CTTCGTATCG CCGGCTGGAA TGGACAGAAT TCATTTCAAC GAGATATGCC TTGTACCTTT ATCGCGAGCA AGGCTTGGAC TCGGAAGATA CGGTGAGTTA CTCTGCTGCT GGCTGTAGTC CTAAAGCCTC ATAGACCATA CTAAGGACAG TACATAGCTT TCGGGACACC CTGTTCTCTT TGTACCTGGG AATGCTGGCT CATATCAGCA GGTCCGCTCG ATCGCCTCCT CAGCATCCAA GCAGTACTAT GAGCAAGTGA AGGCGAGGGA ACGTAATGTG GTGACTGGGA AGAAGATTGA CTTCTTCACG GGTGAGTTAC CGCGGTCTTT ATCTTGGGAG TTTTACTGAG CCTCATCGTC TAGCTGATCT GAAGGAAGAG TTCTCTGCGT TTCATGCGCG GACTGTACGC GAACAAGCAG TTTTCATCCA ACACTGCATC AAGGGGATAC TCCAGGAGTA TACGCATCTA CCGCAAGAAA AGAGGCCTAC GCAGGTTACT CTACTTGCGC ACTCCATGGG GGGTGTTGTT GCTCGCTTAG CAATGGATCC AATTACTTCG ATTTCAGTTG ACATTATCGT GACCTTGTCG ACGCCTCATA TCTTACCACC ACTTGCTCTC GAACGTGACA TGGATTCCAT ATACTCATTG ATAAGGTGGA GGAGACAGCA TATCAGTACC CATCCGCCAT TAATATCCAT ATGTGGCGGC ATTTCAGATA CACAAATTGT ATCGGATAGC TGTGCGCTGC CCTTTTTTCA GGCAGGCAAT AACAGTGACA TTGCCGTTTT CACCACTGGC ATACCTGGTG TATGGACTGC TGTCGAACAC CAGGCCATAA TCTGGTGTCA TCAAATCCGC TGGCGAATTG CTAGGATGTT GCTTGACATG TCAAGTAGGG CAAATACAAC TGCAAAATTG GTTACTGCAA AAGAGTGGCT TCTCGATTAT CAGGAAGATG AAACCTTGAA AGAACCACGT AGCGAGAGAC AACATGATTA TTCGGTGTCA TCTCGCAATA TGACCTTCAT TGGACTTCAT CAACCAAGCA AGGCCTTCGT TGCCCAACAA TGCAATGGTC TCGAGCGTTG TAGAACGGTT CCTTCAGTAA TGAGTCTTCT TCCATTCCCA AATAATCCTA GTGACCCATT TCCGCTACCG GGAGAGGGGA TTAAACCAAG TGAAGTAATG CTAGTAGCTG AAATAAGCCT CTCATCGACC AACACAGTGG TCAAAATAAA TGCATCGCAG TATGGGCAAA CCATTGCAGG ATCAAGAGAA CACCATTTAG TCAAAGGGAA TAGCTGGAGT GAGTTTACAA TCACAATGCG ATATGTGGAT TAACCTCAAG CATAGAAGTT AATTCTTCAT TGCCATCTTT GACCACTCAT CACCTCTTCC ATTTTGAGAA TGCTTTTCTC TCCTCACTGG TTACTCATTC TCTTGATATC ACTCTTGGCC ACTGCAAAGG TATGTAATGT AATGTATGAT TAAATCCACT AATGAAATTC CAGATTTTAA ACCATTAATT AAGCACATAT CTCAACCAGC TTTGGAGTTG CAAAGCGCTA CATTCGAATC CAATTATTAT TTTGCATCAG GCCGACCCAT ACATCTACAT TCTCATTCTA CTGCGGGACC CTTCTTACCT TACCAAGACA GAGCTGGCAT TTATCTGGAG ATATTTCAAT CGCCCTTATG TCCTGTACAA CAGGTGTCGC TCAGAAAAAA TTACTACAAC GTACTGGCCA AATCTGTGAC TCGGTACCGT ATGGTCGTTC TTGCATGGCC AGTAGGGTGG GCTACTGTGG TTCTTCTGTT TCAATTATCA GATTTTATCA ACACAGGTCA GTTAGAACAT GTAAAGGATT CAAGTTAATT TTGGTAGGTG AGATACTTCC CTGGAACTCG GCATTAGAAA GGATTGCAAG GCGCCGGATG CCAATTTGTA TTGTTCTGCT CCTTCTGGGA GCGACTATAC AATCACAGCT ACCGGATTTT CCAATGTTAC ACACCTTTTT TCTGGGTGTT AATCAACTGG AAATGGTCCC TCTGGTGGGA ATTCTGGGGG TGTGGACGTT TGGGTTGTTA TGTGTTGTGT CTTTTGTAAT CACCGCCTGC CTCTGGCTTC TTGGCTGCAT TATACAACAG CCTCACGGTC ATGAGCGATT GGAAGATGAG TAAGTATTTA TTCATCTCAT GCTATGTGGC TCTAATCGCT TATGTACCTA AGGACTAAAT CAAAGCATGA TTGGCTGGGA GTTGTGATGA TTGGGGCCGC TGCAGTGCTT GTCAATCAAG TGATTCCCCA TCAACTGATA TTCCTCTTAT GTGTTATTCT TTTGTGGTTA TCCGCTGCAC GGTCAAAAGC CCTAAACGAT CGATATGTAA GTTGATTATG GCTCTATTCA CTGCTGATGT TTAGCATCTA ATTTCTACTT GTGCAATATT CACAACTCTG TTGATCCCCT TCAAAATCCT TCACGTAGCA ATCTGGAGCA GAAACATTTG GACTGGAAGT GCAGCTCTTG TCAGTACAGA TAATAATTTC TACTATGCCA TACCTCCAGT CCTTTTGGTT AAGTGTGCAT CCTGTGGTGG GACAATACAG AAGAGACATG TGTAGGTCTT TTTTTATGGC AGGACCTATG GGATAGCAGC TGACATATCA GTAGTTGCTT AAAAGCATGT CGAATTGCTC TCATAATATT GATCATGTCT TCTTTCTCTG TTGGTGCGCG GTGGACTTGG ATTCTATCTC CTATAGCAAA CGCGGTATTG ATTTTGTTTG TTGCTTCCAT AATTTAGATA CTATGATATG CGACAAAACC A
|
Protein sequence | MPAWPRFMSL RRQYLYKALD KNHQKHPIKH PHTRFLSRLF CALAVLFSYS IYQSFRTDLK QSGSWGCEMS WMSPSYRRLE WTEFISTRYA LYLYREQGLD SEDTLSGHPV LFVPGNAGSY QQVRSIASSA SKQYYEQVKA RERNVVTGKK IDFFTADLKE EFSAFHARTV REQAVFIQHC IKGILQEYTH LPQEKRPTQV TLLAHSMGGV VARLAMDPIT SISVDIIVTL STPHILPPLA LERDMDSIYS LIRWRRQHIS THPPLISICG GISDTQIVSD SCALPFFQAG NNSDIAVFTT GIPGVWTAVE HQAIIWCHQI RWRIARMLLD MSSRANTTAK LVTAKEWLLD YQEDETLKEP RSERQHDYSV SSRNMTFIGL HQPSKAFVAQ QCNGLERCRT VPSVMSLLPF PNNPSDPFPL PGEGIKPSEV MLVAEISLSS TNTVVKINAS QYGQTIAGSR EHHLVKGNSW SEFTITMRYR YIRIQLLFCI RPTHTSTFSF YCGTLLTLPR QSWHLSGDIS IALMSCTTGV AQKKLLQRTG QICDSVPYGR SCMASRVGYC GSSVSIIRFY QHRSVRTCKG FKLILVGEIL PWNSALERIA RRRMPICIVL LLLGATIQSQ LPDFPMLHTF FLGVNQLEMV PLVGILGVWT FGLLCVVSFH LISTCAIFTT LLIPFKILHV AIWSRNIWTG SAALVSTDNN FYYAIPPVLL VKCASCGGTI QKRHVCLKAC RIALIILIMS SFSVGARWTW ILSPIANAVL ILFVASII
|
| |