Gene CNF04120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04120 
Symbol 
ID3258121 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1192578 
End bp1195845 
Gene Length3268 bp 
Protein Length372 aa 
Translation table 
GC content48% 
IMG OID638257530 
Productexpressed protein 
Protein accessionXP_571686 
Protein GI58269060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAGTCGCAA CTTTCAAAAC CACCTATTGA TGGCCGAGAC GAAACAGGAA TTTTGAAAGC 
TTGGTGGGTT TCCCAAATAC TGCTACTTAC CATATCAAAC TTAGATAGCT GACCTAATAG
CCCCAAACTG TTATATTATA TGCCACTGTA GTACCTGTGG GCTTCAAGGT GGAATACAAC
TGGAGAAGCG AACATGGCTA GGACACCGAC CGCTTGACCA TCTACATGAA ATGGAGAATA
CCCTCCAAAT CTTAATTTTT CACCCTCGTC GGTCACAGAG ACCTCTCCTG TGGCTCCTCC
GCTCCTAGCC TTTGCGGCAA AAGAGATAGA TTTATCGGCC AAATTCAAGC TCCACAAAGG
CCTAGTTCAG AATCACCACC TGCACATCCA ACTATATGCG CAACGCCGCT TTTTCCTAGC
GCGTCCCCTT CACGGCCATG TTCTCATGTA ATTTCCCCCC CCAACGAGAC TTTCCTTACT
CCGATCGTCA TCCAACGCCT ACGCGAGATA CCGTTGCTGT CGCAGTTGCC TCGGTGGGTT
TTCGACATCA TTTCAGGTCT AATTACGATC TCTATTCCCC TGCTATCGCG TATACACTCA
AAGCCAAGCA TGGCTTCTCT CATCGTGCCA CCAGGGATAT GATTCAATCA GCGAATGACC
TACGTCTGCT AGCAGCCATG GACCGCAATA GGGAACATGA GGTTGCAAGC TCATCATCAT
CGGTCCCTCT CTCTCCGTTG CCACCCTTAG CATTCGTCAA CTCCCTGTTA GCACATTTCG
TGCCACGACA GAGTGGCCAG GGGCAAATCC ATACATCATC ACCCATCTTC ATTGTGCAAC
TAGCGATTGC CACTATATTT GGTACGAGAT CTCCCCCATG GAAGAGGGCT ATCATCTCCC
AGATGTGTGC CCATCCTGCA AGGGGCCTTT GAAGGACCGT GATGGAAAAG ATACTTTCAA
CTGGTTCCCA ATTCGAACTA TCAAGGATGA GCTTGAGGAC CTGTTGGCCA TCACAGGAGT
AGAAAGCTTA ATGGCGAGCA GAAGGAATCA GAGCTGAAAA GACCTGATGC CTCGACTGTA
ACGGCAAATC GGGTTTTTAG CCACCAAAGC GATGGCTCGG AATGGGTCCG GTGGCAGCTG
GATGACCCAA CAGCTTTAGT CGCCAGGATA AACCTATCAG AGGATGGGGC CAACATTTCC
AGTGACCTTA ATGCCAAAAA AAGGTCGATG GGTGTTATCA CGGGCCAGCT TGCCAACCTT
CCGCTGGCGT TTCGCGGACG AACCAGCTGG CTCAGTGCTC TACAAATTCT TGCTCCCATT
TGCTATAGAG TTGATGGCTG GCAGGACCTA CGGCTTACGG ATTAAAACCC CCAAGTATCC
TGAAGGTGAG TTCATACAAC GTGTACACAT CACAAAGTTC GATTTTTTGA CATCATAAAA
ACATACAGGT CGCCTAGTGT ACATTGATCT GGGACTCGTA TGCTGTGACC GACCCCTGGC
TTGCGCTTTA ACTGGAGCCC CTCATTATAA GGGCGATAGT TCTCCCTGCT TCCGGTGCTT
GATCAGCAAA AAGGATCTCC TGGACAACTG TACCCATCCA ACTCGCTCGT ATGAGCTCCA
AACAGCCTCT GTGAGGGGAA AGGTAGCCTC ATATCTTGCC GGCTTGCGCC ACCACGATGA
AAACTCAACC CAAAACGTCT TCCATGTCGA CCGTCGTTCC GGTGCCCGCA CGAATATGGA
GGTGAATGGG ACCAAGAGCC TTTGGAAAAA GACGTTCCCA AACGGGGAGC ATCTCTCTGT
GCTTGATCTC TTTCCCTGGT TCAACAGAGC TGTGCGCACC GTCGTCGACT CCATGCACGC
AGTTTTGGAA GGAGTTCTGA AGACGTACTG GTACAAGGTT TTGTGCCAAG GGATGTACAG
TGGTGGGGAA AAAGGCTCGG TGAATGAGCT GGCCGATGTG GTGACCGGAG AATCCAAGGA
TGAAAACAGA GAAGTGGATC CAAATAACGA GCCCGAAAAA TCGACCAAAA AAAGGAAAGA
AAGGTCTCAA GATGTATCAC CGAACCGCCA AAAAAAGCGT TCCCGAAAGG TTAAGTGGGC
TTTCAAGAGC GGCAAACCAG TCTTGACTAC CAAGGAAATT GACTGGTTTA AAGCTCGGAT
CCCGTTGGTA TACAAGCCTA CTTCCCTTGA AAACCTTTCT AGCGGCTTTG GAACCAAAGG
CAACGGTAAG GTGAAGGCGA GCCAGTGGAG AGTTTTTGGG GAGATATATG GGCCTCTCAT
TGTGCCTTTC TTTTTCGAGT TCATAGAACC ATCTTTGGGG GGCAGGACTG CGGAGGAAGG
ACTGAGAGGA TTGCAAATGT CTTGAGAGGA ATTGAGCTTG AGCGAACAGA TGTACTGCAT
ATATTCCCAA TACTTTTGTT CGCGTACGTA CCTTCGTTCT CTCCCTTTCT TTTGTCATCA
TGCTGGCATG AATGTTTAGT TGAAAGGCTC TATATCCCGC GTTACCTTGA AGCTTCTTTC
TGAACAATTG GATTTGGCCA AACAAGGGCC AGGCAATTCC TGAGAGGATT CCGATAAAGA
AACGAAAAAA AGATTTATAG AATTAGAGGA AAAGCCAATA TAAAACCAAG CATGAGTGGT
ATGCCGAGGT GGCATACGAG CTATTCCACG AAGATGAGGG GCTCTCTTGG GTCTGGAGAC
AAGATCCAAA GGTCCCAGAT TCCTACCGAA GCAAATGAAA ATGGGAGAAA AAAAAATAAG
CTGGCTTCGT GAATGATTTC CCGCTTCTGC ACGTCTCCCA GGCAAACACT GACTCCCTTT
CCCAGAATGA GGAGCAAAGT CACCAGGATG GACGGCATTA TGGGTCCAAC TGGTGCCGGC
CTAGATAAGG CCGAAGATAT CCTGCCCTCC AACTCGGACA TGGTTGGAGC ATGGCGTACG
TCAATGGCTC AAAGACTTGC TTACAATGTC AAAAATCTAA CGTCGAAGTT TTCTTGCAGA
GAAGGTCCGT AAGCAGTGCC CATGGTACTT CTTATGGAAG TCTTTAAAAC CAACAAAAAA
TCCTGTCGAT GAGAGTGCCA TAAACGCCCT CTCACGACCA GATACCACTT TACTTGACCG
CAGTAAGTTA CTGGAAGATG TTGGATGAGC GAGAAAATGG GAGCCTGGGA CACGGAGGAT
TCGCAGAGAG GCCGTTTGTG GCGTGGATGT TGCGTGGCTG ATTTTCACAT ATTCTATCAG
ATTCTATAAT AGTTTCTTTC TTTTCTTT
 
Protein sequence
MGPTFPVTLM PKKGRWVLSR ASLPTFRWRF ADEPAGSVLY KFLLPFAIEL MAGRTYGLRI 
KTPKYPEGRL VYIDLGLVCC DRPLACALTG APHYKGDSSP CFRCLISKKD LLDNCTHPTR
SYELQTASVR GKVASYLAGL RHHDENSTQN VFHVDRRSGA RTNMEVNGTK SLWKKTFPNG
EHLSVLDLFP WFNRAVRTVV DSMHAVLEGV LKTYWYKVLC QGMYSGGEKG SVNELADVVT
GESKDENREV DPNNEPEKST KKRKERSQDV SPNRQKKRSR KVKWAFKSGK PVLTTKEIDW
FKARIPLVYK PTSLENLSSG FGTKGNGKVK ASQWRVFGEI YGPLIVPFFF EFIEPSLGGR
TAEEGLRGLQ MS