Gene CNF04750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04750 
Symbol 
ID3258356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1380511 
End bp1383688 
Gene Length3178 bp 
Protein Length930 aa 
Translation table 
GC content51% 
IMG OID638257593 
Productexpressed protein 
Protein accessionXP_571617 
Protein GI58268922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTGCTTTGA ATTCTACTTC TACCCGTACT TCCTACATCT CATACATTAC AGGGCAAAAT 
CAACGAGAGT CAAGGTGATT TTATCTGTCC TTAAAGTAAT CCAGCTAACC TCTACTGTTG
CGCAGCGTCC GTCCGAGCCG GCCATCCTGT CCACATCTAC TTCTTGAACA TCGCAACCCC
CATCTCATAA CCCTAAAATC GTACATTGAA CTACACACAC GCTGTAAGTA CTGGAAATTC
TGTCTGTGAT TTTGCCTAAT TCACCCGTTC AGCCTAGCGA TGTCCACTCC TGAAGCCGGG
GACCAACCAA ATTCATCACT TTCCGCTCTT ACATCTTCTA TCGACAACAC CTCCTCCCCA
GTCGACTTTC TCAACTCCCT CCTGGCACCT CTTCTTCCAC CATCTCTCCC ACCCCCTAAC
AAGCCCCAAC CACCTTCTTT GCAGCCCATC GACACCGCCC TGAATGACCT CCTTACCCAA
CTGTCGCTCC TGTCGCAAGA TACTGCGTCA GCGATCGAGC AAGGGATGAG CGATGTAAGC
AGAACTGTCC CCAGGCTAGG CTACGACCTC CAATTTATGC GGGAGAGCGC CAATGGATTG
TCTGTCAGTT TGGGGATGGT ACAAGGCCGG GTTGCGAGAC AGGCGGATTA TGAGATGCCG
AACAACAAGT TCCCAGGTGG TGAAGAGAGT GAAGCTGTAA AAGCTTTCCG TGCGCTGGAA
AAGATCACTC ATCTGGACAA GCTCAAAACA AGGCTCGAAT CTGCGCGGGA CACCTTGCGC
GAAGCCGAAT CATGGTCTAC ACTTGAGTCG GAGATCACCA CTCTCATCAG TGAGAAGGAA
TATGCAAAGG CCGGGCAGCG ACTGGCCGAG GCAAGTCGGT CAATGGTAGT GTTTAAGAAT
GAGCCTGCAG AATGGGAAGA GAGAAAGAGA TTGTTGGTTT CACTGGGAGA TGAGCTTGAG
CGGGTTGCAG GAGAAGCGTT GAGGGAGAGC TTGAAAAAGG ACGATGGTGT GGACGAAGTG
CGAGCTTTCT GGGAGGTATT CATGGATATG GAAAGGGAAG AAGAGTTCAA GGGGTGGTAT
TTCAAGGAAA GAGGAAGGGG GTTACTTGAG GCATGGAAGG AACCATTGGT GGAAGAAGGA
CAGGGCGAAA ATTCATCAAA GCTATCCGAC TTTCTCCCCA AGTTTTACTC CCTTGTACTT
CAAACCCTTT CAGCCGAGCT ATCCTATATA CCCCTCGTCT TCCTTCCAGA ATCATCACCC
TCGATTTTGG CGTCATTTTT CCAGTCCACA CTCGACTCCC TCGATCCGAC GTTCTCAAAC
CGCCTCGCCG CCGTTGCGGA CTATCACGGC CCTGGTGCCC TTCCTGAACT CGTCAAGGCA
TGGGAAGCGA CTGTTGATTT GGGAGCGGGG GTACAAGGAT TGATTGACAA GATCATATTC
AACACTCAAG GAGGCTTGCT CAGCGGTGGT GCCGGCGAAA TCGATGTTGA ATCACCCGCC
ACCATCTTAA CCTCCCCTGG CATCTCATCA TCTTCACCCA ACCATCCCAT TCCTCGCACA
AACTCCCACT CCCATTCTAA ACGGCATCAG TCCATCTCTC GCAGATTCTC CCGCGCACCT
AACGCTACCA CTACCTCTCT TTCCCCTTCC CCCGGGAACG TCGATGACGC TTGGGAGACG
ACCCTGTATG AACCATTCTT GGACTGGCAG TCGTCCTACT CTTCTTTGGA GAAGAGGTGT
TTGGAGAAGG AGGTGGCGGA CTTGAAGACA TCATGGGAGA AGGCAAACAT GAAGCAAGGT
GGGAAGGATG TAATGAGCGG GATGATATCA CGTACGGCCG AACTCAAGGG TAGGCTTGAG
GAAGCAGTAG CGCGCTGCAG GATATTCACT TTTGGCTTTG GGGCAGTCCA TCTTATCCGT
GCTGTGGATA CTTGCATCTC CAGATTTTTC GACGATGAGC GGACCTCCAT CCTCAACAAC
GCCAAGTCTA AGAGGGATAA TAACAAGCAG AAGGATAAGG CGGATGAGCT CGATTTGGAT
GAGTTGGATG ACGATGGTGG GGATTGGAGT GGGTGGCAAG TAGGTCTGCA CATCCTTGAT
TCACTCCAAA AGGTGGCAGA GAAGCTTGTG GCTATGGAAG ATGGGTTGAA AGCCGAGTTG
AGTGAGTACG CCAAGATGCT GAAAGCCCAA AAGGGGGAGA AATGGGATGG CCAATGGGAC
GGAAGAAAAG CAACGTTTGG CACTGTGTCT CTTTTGCAGC AGTCCACGCT CAACACCGCC
GATTTACATG CTCTCATTGC CTCCGCGCCA TCACCCATCT TGCCTCAATC TAAATCCTCT
CTCCTCACAT TCATCCGTGA ATCCCAAATC CATCTCCAAC AGACTATCCT TTCTCCCCTT
CTCACCCAGC TCGACACGTA CCCTTCTCTC GCGGTCTGGA TCAAGGCAGA TAAACAGACA
AAAATCAGAA AGGGAGAACT GTACGTACCG CAGTTTAGTT TGAGTCCGAC GGATGTGATC
ACGAGGACGT CGGAAGGGCT GTTGGATCTG TTGAGGGTGT TTGAAGTGTA TGGCGGGGAA
AAGGCGTTGG GGTGGAGTTT GGGGAGTTTG CCGTTCGTCG AAGGGATGGG ACATGCTGTC
GCTCTTGATT GCCTTTCCTC TTCCAGAAAG GATACGAATA AGGAACCTTC AACCTCGATA
TCCGACCCAA CGACAACAGC AACATCGACC TCAGTATCAC TCGCACCCAT CCCTTCCTCT
ACCCCCGCGC CTACACCAGA AACGATCCAA ACCACCTGGA TATCATCTCT CACCCTCTCC
CTCCTATCAC ACTTTACCTC GTACACCCTA CCTTCCATCC AGCACCTTTC TCAGGAAGGA
CAAGCACAAC TGAAAGAGGA TTTGGGATAT TTGGAGAATG CGGTGAGGGC ACTAGACGTG
GAGTGGAGTG AGCTGGGTGA GTGGGCGAGG GCGGTGGAGA TGGAGGAAGA GGAATGGAGG
GAGAATGTGA AGCGGGAAGG AAGGGAAGGG GCTTTGGCCG CTGTGGGGAG GATGAGGGGA
TGGAAGTTCT GAAGGGCTTA TTTTCTGAGG TGGGCAAATT TCCCGTTCGC TACACCATAA
AAGAGTATTT TGCTAATGGA CCCGTTTTCA ATGCTTAGTC CATCAGCGTC TTTAATTC
 
Protein sequence
MSTPEAGDQP NSSLSALTSS IDNTSSPVDF LNSLLAPLLP PSLPPPNKPQ PPSLQPIDTA 
LNDLLTQLSL LSQDTASAIE QGMSDVSRTV PRLGYDLQFM RESANGLSVS LGMVQGRVAR
QADYEMPNNK FPGGEESEAV KAFRALEKIT HLDKLKTRLE SARDTLREAE SWSTLESEIT
TLISEKEYAK AGQRLAEASR SMVVFKNEPA EWEERKRLLV SLGDELERVA GEALRESLKK
DDGVDEVRAF WEVFMDMERE EEFKGWYFKE RGRGLLEAWK EPLVEEGQGE NSSKLSDFLP
KFYSLVLQTL SAELSYIPLV FLPESSPSIL ASFFQSTLDS LDPTFSNRLA AVADYHGPGA
LPELVKAWEA TVDLGAGVQG LIDKIIFNTQ GGLLSGGAGE IDVESPATIL TSPGISSSSP
NHPIPRTNSH SHSKRHQSIS RRFSRAPNAT TTSLSPSPGN VDDAWETTLY EPFLDWQSSY
SSLEKRCLEK EVADLKTSWE KANMKQGGKD VMSGMISRTA ELKGRLEEAV ARCRIFTFGF
GAVHLIRAVD TCISRFFDDE RTSILNNAKS KRDNNKQKDK ADELDLDELD DDGGDWSGWQ
VGLHILDSLQ KVAEKLVAME DGLKAELSEY AKMLKAQKGE KWDGQWDGRK ATFGTVSLLQ
QSTLNTADLH ALIASAPSPI LPQSKSSLLT FIRESQIHLQ QTILSPLLTQ LDTYPSLAVW
IKADKQTKIR KGELYVPQFS LSPTDVITRT SEGLLDLLRV FEVYGGEKAL GWSLGSLPFV
EGMGHAVALD CLSSSRKDTN KEPSTSISDP TTTATSTSVS LAPIPSSTPA PTPETIQTTW
ISSLTLSLLS HFTSYTLPSI QHLSQEGQAQ LKEDLGYLEN AVRALDVEWS ELGEWARAVE
MEEEEWRENV KREGREGALA AVGRMRGWKF