Gene CNF03000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03000 
Symbol 
ID3258228 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp856841 
End bp859961 
Gene Length3121 bp 
Protein Length768 aa 
Translation table 
GC content44% 
IMG OID638257427 
ProductER-associated protein catabolism-related protein, putative 
Protein accessionXP_571596 
Protein GI58268880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGAATTGTA CAGGCATTAC TCAAGACATT TCTACCCGCA TAACAACATT GTAGTGCCAT 
ATTTCAAGTC ATGCCAGCAT GGCCCAGGTT TATGTCTTTG CGACGTCAAT ATCTCTACAA
AGCACTCGAC AAAAATCATC AAAAGCATCC CATCAAACAT CCACATACAC GGTTCCTATC
GCGACTTTTT TGCGCATTGG CTGTCCTCTT TTCATATTCA ATCTATCAAT CATTCCGGAC
TGACCTCAAA CAGTCAGGAT CTTGGGGATG TGAAATGAGC TGGATGTCGC CTTCGTATCG
CCGGCTGGAA TGGACAGAAT TCATTTCAAC GAGATATGCC TTGTACCTTT ATCGCGAGCA
AGGCTTGGAC TCGGAAGATA CGGTGAGTTA CTCTGCTGCT GGCTGTAGTC CTAAAGCCTC
ATAGACCATA CTAAGGACAG TACATAGCTT TCGGGACACC CTGTTCTCTT TGTACCTGGG
AATGCTGGCT CATATCAGCA GGTCCGCTCG ATCGCCTCCT CAGCATCCAA GCAGTACTAT
GAGCAAGTGA AGGCGAGGGA ACGTAATGTG GTGACTGGGA AGAAGATTGA CTTCTTCACG
GGTGAGTTAC CGCGGTCTTT ATCTTGGGAG TTTTACTGAG CCTCATCGTC TAGCTGATCT
GAAGGAAGAG TTCTCTGCGT TTCATGCGCG GACTGTACGC GAACAAGCAG TTTTCATCCA
ACACTGCATC AAGGGGATAC TCCAGGAGTA TACGCATCTA CCGCAAGAAA AGAGGCCTAC
GCAGGTTACT CTACTTGCGC ACTCCATGGG GGGTGTTGTT GCTCGCTTAG CAATGGATCC
AATTACTTCG ATTTCAGTTG ACATTATCGT GACCTTGTCG ACGCCTCATA TCTTACCACC
ACTTGCTCTC GAACGTGACA TGGATTCCAT ATACTCATTG ATAAGGTGGA GGAGACAGCA
TATCAGTACC CATCCGCCAT TAATATCCAT ATGTGGCGGC ATTTCAGATA CACAAATTGT
ATCGGATAGC TGTGCGCTGC CCTTTTTTCA GGCAGGCAAT AACAGTGACA TTGCCGTTTT
CACCACTGGC ATACCTGGTG TATGGACTGC TGTCGAACAC CAGGCCATAA TCTGGTGTCA
TCAAATCCGC TGGCGAATTG CTAGGATGTT GCTTGACATG TCAAGTAGGG CAAATACAAC
TGCAAAATTG GTTACTGCAA AAGAGTGGCT TCTCGATTAT CAGGAAGATG AAACCTTGAA
AGAACCACGT AGCGAGAGAC AACATGATTA TTCGGTGTCA TCTCGCAATA TGACCTTCAT
TGGACTTCAT CAACCAAGCA AGGCCTTCGT TGCCCAACAA TGCAATGGTC TCGAGCGTTG
TAGAACGGTT CCTTCAGTAA TGAGTCTTCT TCCATTCCCA AATAATCCTA GTGACCCATT
TCCGCTACCG GGAGAGGGGA TTAAACCAAG TGAAGTAATG CTAGTAGCTG AAATAAGCCT
CTCATCGACC AACACAGTGG TCAAAATAAA TGCATCGCAG TATGGGCAAA CCATTGCAGG
ATCAAGAGAA CACCATTTAG TCAAAGGGAA TAGCTGGAGT GAGTTTACAA TCACAATGCG
ATATGTGGAT TAACCTCAAG CATAGAAGTT AATTCTTCAT TGCCATCTTT GACCACTCAT
CACCTCTTCC ATTTTGAGAA TGCTTTTCTC TCCTCACTGG TTACTCATTC TCTTGATATC
ACTCTTGGCC ACTGCAAAGG TATGTAATGT AATGTATGAT TAAATCCACT AATGAAATTC
CAGATTTTAA ACCATTAATT AAGCACATAT CTCAACCAGC TTTGGAGTTG CAAAGCGCTA
CATTCGAATC CAATTATTAT TTTGCATCAG GCCGACCCAT ACATCTACAT TCTCATTCTA
CTGCGGGACC CTTCTTACCT TACCAAGACA GAGCTGGCAT TTATCTGGAG ATATTTCAAT
CGCCCTTATG TCCTGTACAA CAGGTGTCGC TCAGAAAAAA TTACTACAAC GTACTGGCCA
AATCTGTGAC TCGGTACCGT ATGGTCGTTC TTGCATGGCC AGTAGGGTGG GCTACTGTGG
TTCTTCTGTT TCAATTATCA GATTTTATCA ACACAGGTCA GTTAGAACAT GTAAAGGATT
CAAGTTAATT TTGGTAGGTG AGATACTTCC CTGGAACTCG GCATTAGAAA GGATTGCAAG
GCGCCGGATG CCAATTTGTA TTGTTCTGCT CCTTCTGGGA GCGACTATAC AATCACAGCT
ACCGGATTTT CCAATGTTAC ACACCTTTTT TCTGGGTGTT AATCAACTGG AAATGGTCCC
TCTGGTGGGA ATTCTGGGGG TGTGGACGTT TGGGTTGTTA TGTGTTGTGT CTTTTGTAAT
CACCGCCTGC CTCTGGCTTC TTGGCTGCAT TATACAACAG CCTCACGGTC ATGAGCGATT
GGAAGATGAG TAAGTATTTA TTCATCTCAT GCTATGTGGC TCTAATCGCT TATGTACCTA
AGGACTAAAT CAAAGCATGA TTGGCTGGGA GTTGTGATGA TTGGGGCCGC TGCAGTGCTT
GTCAATCAAG TGATTCCCCA TCAACTGATA TTCCTCTTAT GTGTTATTCT TTTGTGGTTA
TCCGCTGCAC GGTCAAAAGC CCTAAACGAT CGATATGTAA GTTGATTATG GCTCTATTCA
CTGCTGATGT TTAGCATCTA ATTTCTACTT GTGCAATATT CACAACTCTG TTGATCCCCT
TCAAAATCCT TCACGTAGCA ATCTGGAGCA GAAACATTTG GACTGGAAGT GCAGCTCTTG
TCAGTACAGA TAATAATTTC TACTATGCCA TACCTCCAGT CCTTTTGGTT AAGTGTGCAT
CCTGTGGTGG GACAATACAG AAGAGACATG TGTAGGTCTT TTTTTATGGC AGGACCTATG
GGATAGCAGC TGACATATCA GTAGTTGCTT AAAAGCATGT CGAATTGCTC TCATAATATT
GATCATGTCT TCTTTCTCTG TTGGTGCGCG GTGGACTTGG ATTCTATCTC CTATAGCAAA
CGCGGTATTG ATTTTGTTTG TTGCTTCCAT AATTTAGATA CTATGATATG CGACAAAACC
A
 
Protein sequence
MPAWPRFMSL RRQYLYKALD KNHQKHPIKH PHTRFLSRLF CALAVLFSYS IYQSFRTDLK 
QSGSWGCEMS WMSPSYRRLE WTEFISTRYA LYLYREQGLD SEDTLSGHPV LFVPGNAGSY
QQVRSIASSA SKQYYEQVKA RERNVVTGKK IDFFTADLKE EFSAFHARTV REQAVFIQHC
IKGILQEYTH LPQEKRPTQV TLLAHSMGGV VARLAMDPIT SISVDIIVTL STPHILPPLA
LERDMDSIYS LIRWRRQHIS THPPLISICG GISDTQIVSD SCALPFFQAG NNSDIAVFTT
GIPGVWTAVE HQAIIWCHQI RWRIARMLLD MSSRANTTAK LVTAKEWLLD YQEDETLKEP
RSERQHDYSV SSRNMTFIGL HQPSKAFVAQ QCNGLERCRT VPSVMSLLPF PNNPSDPFPL
PGEGIKPSEV MLVAEISLSS TNTVVKINAS QYGQTIAGSR EHHLVKGNSW SEFTITMRYR
YIRIQLLFCI RPTHTSTFSF YCGTLLTLPR QSWHLSGDIS IALMSCTTGV AQKKLLQRTG
QICDSVPYGR SCMASRVGYC GSSVSIIRFY QHRSVRTCKG FKLILVGEIL PWNSALERIA
RRRMPICIVL LLLGATIQSQ LPDFPMLHTF FLGVNQLEMV PLVGILGVWT FGLLCVVSFH
LISTCAIFTT LLIPFKILHV AIWSRNIWTG SAALVSTDNN FYYAIPPVLL VKCASCGGTI
QKRHVCLKAC RIALIILIMS SFSVGARWTW ILSPIANAVL ILFVASII