Gene CNF03810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03810 
Symbol 
ID3258037 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1114570 
End bp1116972 
Gene Length2403 bp 
Protein Length619 aa 
Translation table 
GC content51% 
IMG OID638257500 
Producthypothetical protein 
Protein accessionXP_571338 
Protein GI58268364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.369935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCTC GTACTCGCAC ACGACACCAC CCCAGCGCCG GCACCCGGCC TCGTCCGCCG 
CTTATCCAGA CAAGCCAGGC GGCTGAGCCA GCTGCCGGTG CGTATCTCGT GCTAGCAGGA
GCGCAGTGCT GACGCCGTAA TAGATGGCCG GAGAGAAGAC AAGAGCGCAG AGTGTGAGTT
ATGCGCTAGC GTACGTAGTT GACCCGCGAC TGACTTTACC CAGCCAACCC TTCAAGACCG
CCCACCCCGG CCCCAACCTC CTCCAGGGCG TCCAACCCTC TTCCGGACCT TTCTCGCGTG
GCAACCGCCG CTTATACAGC TTCTATCCTT GCCCGCCCAC CTCCGCCTAC TATCTACGGT
CGGCCTGGTA CTCGTCCATT CTCTCTCGAT CCGTTGCCGT TCACCGACGA AGATATTCCG
TTGCCCAAAC CGTACAACTC CAATTACCCG CATCCGATTA CCGCATACGC CTCTCCTGGG
GTCAATCCAG AGACTGTCAA AGGATCAGGC TATAACAAGC TGCTGCAGGA TATCGTCAAG
GAGGATGTAA AGAGCGCTCT CGGAGATATC CTTTACGACG CTGGCTTCAG AGATCTGTCT
AATTCAGTCT ATTTGCTCTT TTCTTACGAA AGCATGGGGC AAGGAAATGG GATTCCGGAG
AGTCTCTGGA GCAAATTTGA GAATGTCAAC AATGTGAGGC TGCCAGCCGC CGCGGCTGTC
ACTGCGTCGG TCGTTGCTCA GGTCGAAGAA ACCCCTCCCA GATTTACTCA GTCTCAACTC
AACTCAACCT GCAGCAGCGA CTTTAATGCT CATCGCACTG TGGAAAACAA CCAAGCGATG
GGACGCACTC TCAGACTGTT ATCCGGCGAG TCGATTTTGT CGCCGCCTTA TAGCCTGATC
CAGACACAGA CACACACACA GCCGAGAGAC AGGGAAGCAA TGGATGCTTG GAAGACTGAA
ATATGCGCGG CTTGGGAGGC TACCGGACGC TGCAGATATG GATCTAGCTG CCAGGCAAGT
AACGTTCGTC TCCCGCAACT CGCATGCTTG TTGACTGGTT CCGGATTTAC ACAGTTTGCT
CATGGCATTG AAGAGCTCAA GCTTACTCGA CAGTCACTCA TCATACGTGG CCTCGCCCCC
CAATCTCCTC CAACACCTTC TGATATACTC TCCCCTATTT CTCCACATCG TTCATCCGTC
TCCCGCTATC CCGTCACTTG CAGGTCTAGT CAGACCCAGA TCTACCATCC TGCGATCATC
TCGGGGTGCC CTTATGTCAT CAAGGCTAGT GATAGGAGAA TGTCAGTTCC ACATTCGCAG
CTGAGTAGGG TGGCAGAAGA CGAGCTGCAA TTTAACGATT TGGACTTGGG ATTCAGGCGC
CTGTCAGATG TGTCATCTGG CCCTCCTTTC GCCAACACTC AAAACCTCGG TGTGCCGTCA
TCACGTCCTC GTTTCGACCC CCTTCCATCC AAATTCGCTC CTTCACCTGG TGAGGAATAT
CAAGGCTACC TATTCCCTCC CTGCAAACCC ACGTCCCTTT CCTCTGACTC TGCTTCATCC
ATCACATCAT TCAATATTGG CCCGCTCTCA GCTCAGGAGA AACAAAGAAG ACTGGTTTCC
CAACCAAGTA ACTTGACATT GTACACTTCG TCGTCATCTT CAACTGAATC TGTCAGTGGT
GGATCGAGGT TATCCATGTT CTCGGCCTTT GACGATGGAT TGGGCGAAAG TCTGGTCACC
CCAATCGAGG TTGGATGGGA GAACAATTAC CTAGACTCCG CCTCTGGCTC ATCAAGCTCT
AAAACAGGGC TCGACATCCA GTCAAGTAAT AGCGGGACTG GCGATGAGAT CGGTTTCGAA
GCTACTGGAA TGAGAAAAGC CGGCCTAGTC AAGTCGGGAA GTATGGGCAG TGTGGGCATG
ATGGGGTTAC CGGCCCATTC GAGCTTGCCT AGTATGGTGG GGCCGAAGGT GTCAACTACT
TACGAATTTT CGAGTGGCAA TTCCATCTGG CGTTGACCTT GCTCTGCCTT TTTCTCAGCT
TCACGAAAAG AGTCATGTGT ACTTTTCAAA AACGATCTGT GTCAAGTCAT GCCTGTGATC
CTTCTTCCAC ATCTTCAGCT TTTGTTTATT GTGGGATTTT GTATATCTGC GCTGAGTTGT
CGGTCCTTTT TTTCTCGCCT TTGCAAAAAT AGACACTTTA ATAAAAGTGC AGTAATGCTG
TAGCTTCAAT CCGATTGACA GGTTTTGGCA TGATGTTATT GATAGGCAAA AGAAGTGGTT
GTGTCTGATA GCTTTTATGT TCATGTCAAG TCTAGGTACT GGCGCCTTGG ATCTTATTAA
TTATACACGC TCATACTTGT TTTTCCTTTT CTATCACTTC GTCTGTACCT GCTACAGGTG
CTT
 
Protein sequence
MSARTRTRHH PSAGTRPRPP LIQTSQAAEP AADGRREDKS AESNPSRPPT PAPTSSRASN 
PLPDLSRVAT AAYTASILAR PPPPTIYGRP GTRPFSLDPL PFTDEDIPLP KPYNSNYPHP
ITAYASPGVN PETVKGSGYN KLLQDIVKED VKSALGDILY DAGFRDLSNS VYLLFSYESM
GQGNGIPESL WSKFENVNNV RLPAAAAVTA SVVAQVEETP PRFTQSQLNS TCSSDFNAHR
TVENNQAMGR TLRLLSGESI LSPPYSLIQT QTHTQPRDRE AMDAWKTEIC AAWEATGRCR
YGSSCQFAHG IEELKLTRQS LIIRGLAPQS PPTPSDILSP ISPHRSSVSR YPVTCRSSQT
QIYHPAIISG CPYVIKASDR RMSVPHSQLS RVAEDELQFN DLDLGFRRLS DVSSGPPFAN
TQNLGVPSSR PRFDPLPSKF APSPGEEYQG YLFPPCKPTS LSSDSASSIT SFNIGPLSAQ
EKQRRLVSQP SNLTLYTSSS SSTESVSGGS RLSMFSAFDD GLGESLVTPI EVGWENNYLD
SASGSSSSKT GLDIQSSNSG TGDEIGFEAT GMRKAGLVKS GSMGSVGMMG LPAHSSLPSM
VGPKVSTTYE FSSGNSIWR