Gene CNF00520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF00520 
Symbol 
ID3258096 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp178425 
End bp180630 
Gene Length2206 bp 
Protein Length394 aa 
Translation table 
GC content49% 
IMG OID638257175 
Producthypothetical protein 
Protein accessionXP_571233 
Protein GI58268154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGGA AGGGCAGCTC GTCAACGCCG TGCAAAAAAT CAGGACAAAC GCTCAGCCCG 
GCTTCTAACG TATATGCCAG CAACTCGTCA AAGAAGAAGA GCGAAAAGAG GGAAGGAAGA
GACGAAAGTG GGGATGTAGC CAATTATGAC AGCTCAGACA TGACAGATGG CAGTGAATTT
GAGAGTCCAG TGGAAGATGA AGAAGACAAT GATCAGATAT TCTATATCGG TTAGTGCACA
GGGCAGGCAG TATATTGATC AGCTCCTGAC TATCAGATCC ATGTAGATGC AATTATGTAT
GCTCATTTTC GAGATACCAG GTCACGTAAA GATGGACTGG GATGGTACGG ATGGGTGAGT
ATCAAAGTGG TTTCGGAAGT CGAAACTGAC AATGAGACTT ATGTAGCATT ATGGGGTGAT
GGCAAGTGGG CGAGCTACAT GATTCAGATA TGGGACTAAC TTCCGACAGT GGAAAGGTTA
CCTAAAATCC TTGTCGGATA CGTGAGCAAC TGTCCTCTAT CAAGCTGATT TAGTGCTAAA
TCATTATGCA GAGAGGAACC TCTGAGTTCA TTCGAAACCG AACTCAATGT CGCTCCTCTG
GTGACATCGT GTGTAGCTCG TGGGATTCAT GTATTTTAAG CTGACTAGAC TCCCCGTACA
GGTTTTGGCA GGCAGTTGAT AAACCGATAC CAAAAGGGAA ACTGGAGCCA CCAGGCAGGA
AGGGCGACTA TTACGAGATC GCGCCCGATC AGATGGGTAA GATTCTTTCG TTGTTATCTT
CATTTATTTA TCACTGAGAT GTATGCTGTT ATTTCACAGA GGATATTCTT TTAACTTGCT
ACCCTAAACG GACATGGAAA ATCTACAGCC GTCGGCGTGC CAAGCAACGA GCGCACCAAC
TAGCCCCTCC AAAAATCCGA GCTAAACCGT TACGAATTCA GAAGCGTGAC TCCGACCACT
ACAACTATGA ACGGTATAAA AGGAGGAGGA AGGTTATGCA AGAAGCTGAG GAGAAGCAGA
AGAAAGCACA TGGCATCAAG AAAAAAGAAG AAAAAGGATC ACCGGAATGG GCTTCAACCT
CTATCAGTGA TTCAGACTCT ACAGAAATGA GCTTGGATGA TGAAAATAAC GTTCCCTCAA
TGTCCGACAA GCGAGCCCAA GGGAAAAGGA AGGAAGGCCC TAGGAGGACG AAGAAGCACG
TACAAAGTTC CTCGGAGGAC GAGAATCAGA CTGAGCTGAA AGAGACGACC ATCGAGAGGA
AGAGGCGGAG AATTCCTTCA TCCGCGTTAT CGTCTGTGTC ATCACCTTCC GACAGCCCCA
GATCAAGCAA GCAGAAGGAG CAGTCATTAC AACCATCTAT CGAGGTTGAG ATACCCATTG
CGGTTGATCG AATGAGGTCA AGGCAGCCGT CACAAGGAAC AGGTCCAACC CAACTCTCTT
TTGGACAGCT TCAACCTGGG ATTTTTGATC CTGATCCTCC TGTTACGGCT ACTTTCCGGG
GAGGGCTTAG CGATGCACCG CGTACTGTTG CGGCCACTGT TGTCTCTTCG ATATCTGCCA
TCGAACCGTT CCCTGAGACT CAACAGTCAG GTTCCCAGCC TTTACCTTCC TTCCCTCCGT
CCCAGGCTCC TCGCGTATCA ACCCAAACTC AAACACGAAG CTCCCGTCAA CATTCGCCTC
AACCTCAGCC ACAACTTCAA CCGCAACCAC AACCTCAATC GCAACCTCAG CCTCAGCATC
AAGCCCAATC TCTGCCCCGA ACACAGCCGC AGCCTCAGAC ACCGAACTCA CTGTCAGCCT
CCTCTGTCTC AGGGCTTCCG GCTGCCACCA ACAAATCACA GCCAAAATCC ACATCTCTTT
CTGCAAATCA AATCCAAGAA ACAATTGCCG ACGCTAACAA TGCTTCAAAG ACACCAGTGG
CTGCCTTTCC GCAACATAAT AGCAGGCAAG GTAGTAAAGA TAATGCTGGC AACACAGCCG
GTGCCTCTCA AAATAGTGTC GCTCCGATTG ATCAACAGCA GAAGCAGACG ACTACAGGAT
CAGTTGAATC TACACTGAGA GGGACCGTCA ACCGTTTTTC CGAAGGCAGT ACCGGTGAGG
CTTCGAGACC GTCTACCACG CCTTCAACCA GTCGCACTCT GATTCCCAAT CAGTCTGGCA
ACCTGTCCCA TCAGATATCC GTCTCTTCAT TGCAAGAACC TATTGA
 
Protein sequence
MDRKGSSSTP CKKSGQTLSP ASNVYASNSS KKKSEKREGR DESGDVANYD SSDMTDGSEF 
ESPVEDEEDN DQIFYIDAIM YAHFRDTRSR KDGLGWYGWR GTSEFIRNRT QCRSSGDIKR
DSDHYNYERY KRRRKVMQEA EEKQKKAHGI KKKEEKGSPE WASTSISDSD STEMSLDDEN
NVPSMSDKRA QGKRKEGPRR TKKHVQSSSE DENQTELKET TIERKRRRIP SSALSSVSSP
SDSPRSSKQK EQSLQPSIEV EIPIAVDRMS QVPSLYLPSL RPRLLAYQPK LKHEAPVNIR
LNLSHNFNRN HNLNRNLSLS IKPNLCPEHS RSLRHRTHSG ASQNSVAPID QQQKQTTTGS
VESTLRGTVN RFSEGSTVWQ PVPSDIRLFI ARTY