Gene CNB00520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB00520 
Symbol 
ID3255625 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp151427 
End bp154357 
Gene Length2931 bp 
Protein Length795 aa 
Translation table 
GC content50% 
IMG OID638254705 
Producthypothetical protein 
Protein accessionXP_569062 
Protein GI58263304 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0129615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTCCCCCT TCTTTTCTCC CAAAAGAATC ACATAGAATA CGACACAGCC GCAAATACAA 
CACTCTGCCA AGTTCTGGTA ATGCTGGGCG CCTCCCATCA ATAACAACTA CCTTTCGAGA
CCTCAAAAGC CTCACACGTT CGTGGAGTGG CAGCGGATGA TCGGCACAAT GTCAACCTCC
CATTTGAATT CTCCTTCAGC TTCCTTTCCA CACCAAGAGC AAGAGCTATT CTCTTTGGAC
TTTCTTGCTC TAACTGGGTT GGACGGCTCC ATCTCAGATA CTAACTCGCC ACAAGCCAAT
TCAAGTCAAT CGAGTCACCG CCAACACGAT CAGAGCGGCG ATGCCGAGGG TCAGACGAGC
AATGAAAGAC AAAACTCAAT TTTTTTCCGA GAACAAAGAA GATCTTCCAA GGATCTCTTG
AACTCAATGG AAGTAGATGA ACATACGGGG AATGCTTTGC GGGGCTTGGG GCACAGTAAT
GATGGGGAAA ATCAACTCCA AGATTTTGAT TCTTTGCAAG CCGCATTGTT ACAACAGCAA
GTGAGTTGTC AAAGAATTAT ATGGTAATAT ACCAAGGCTA ATAACAATTC CAGCTTCAAG
CTATTCACAT GCAGTCTCCT TTAGGCTTTG ATATCCAAAA TCCGACCTAC CCGCTTGGGC
AATTGCTGGC TTCGCCTGCG TTTGATGAAC TGCATTCATC GCCCAATGGG CACTCTGAGC
AACAAACCCT CTCTGTCCAC AATGCTCACC CGAGCTCCCG ATCCGTATAC GACTCGCCTT
TATCACATCT TGCGTTCAAT GCTCACGGCC ACCGTAACTC ATTCTCTTCT ATGTCGACGA
GGAGTCCCTT AGAGCAGTTG CAAAGGCAGC AGCAGCAGTT TCAAGAGCAA CTTGGATTAC
TGCAACGGCA ACAGCTCAAG ATGCAGGCAA CGGCTGCTGC GGTTATGGCA GCCTCCACCT
CACCATACAT TGGGCTAAAT GGTCCATCAT CGACAGGTCC TCGACCTTCG GTGACGCCCG
GCATGACTCC TTCATCCTCG AACACTGGCA TGTTTTCACC CCTCACTTCT CCAGCTCTTG
AAGCCACCAA CTACTCTCAC CAGTCCCATG TTAGCCGTCA CAGCCAACAG TTTTCTCCTG
CCTATGGCTC GCAGCATATT GGCACATCCG GTATCCTCAA CACTGCTCTA TCTTCCCCCG
CCCTCAATCC CATTGGTTCT ACAGGAGGTG CCAATCAAAC CCTTTCGCCT GCTCTCAACC
CTCAAAATGA AGTGAACAGG GGTGACTCCG AATATCTTCA TGCCTTCATG GGTATGCTCG
ATAGCACCAA CAGTGGGAAC AGCACACCTG GTGGTGAACC TCCACAACCG AGCTATCAGT
CACCTTCCAT GACAAGCGCT TCCACAGCTG GCAATTCTAC CATAATATCA TCCCCAGCTC
TCTATCCTCA GGGTGCCGGT ACCGGTCCTC ACAGACAATC CCTTCCTTTC AAATCACGCC
CTTCGCCGAT GCTCAAACCC ACGCATCACC GATCGCACCA CCGCAACTCT GGCTCTGGCA
ATGTCTCCAT TCCTTCCTCA CCAGCAATCC AAAAGTATCA TCCTGACGCA TCTATGCCAC
CTGCTGCTAT GAACTCAGGT CTGCCTCCGC CGGCAATCGA ACACCGACAG ATACAATCCA
ATCTTTCTGT CTCATCGACC TCTACTCCTT CCCCTGTCGA TCTCAGCCAT ATTATGCCAC
CACCACCGGT GCCGACTGGT AAACCCAAGG CACGGAAGGG TGTCTTACCC ATGACTCCAG
CTAGTCTAAT GAACCTTGGT TCCGTGGAGA AGCATGGATC TCAGTCTGTA CCGCTACCAA
AGTCTCAGAC TTCGAGCGAG TCAAACTCAT CGATTGGTAC AGTCACAGCT GCTACATCTT
CTGGAAGTAC AAGCAAGCCG GCTGCCGGGA AGAAAAAAAC GGGTGGTCAA GTGGGGAAGA
AGACGGCAGG AAGTAAGCTT GTACCGGTGG GAACCACTAA AAGAACTTTG GCTATGCGAC
CTCAGACAAC TGTTGGTGTA CGATCAGGTA AGTCACTTCA ATCCTCCGTC AACTTCCTGC
ATCTGACTGA CATACATATT ATAGCTACTA AAGCAGCAGC CGCCGCTGCT GCTGCTGCCG
CCATCGCCCC GGCCGAACCC GAAAACCGCA AAATATCTCA CAAAGCCGCG GAACAAAAGC
GCCGAGATTC TCTCAAAGCC GGTTTCGACG AACTCCGTCT CTTACTTCCA CCCATTAACA
CTGAAGCTCT AGACCCATTA TCCGGCGAGC CTATCCCAGG CTCTTCAGCA CCGAGGTTAT
TACCCAAGTC TTCTCTTGTA CCAGATGATA ACCCTAATCG GGGCGTAAGC AAAGTCGCGC
TTTTGAGGTT TGGGAATGAA TATATCGGTA AACTGCAAGA AAGGGTGGAT AGGAGGGATT
TGTACATCGA GAAGCTGAGA GAGGAAGTTA AGCGGTTAAG AGAAGGAGGG GAAGAAGAAG
ACGTGACGTT GGATAATGGC GAGGATCTTT TGGAGTACGA CTGGAGAGAA GGCGAAGAGG
ATGAGTTTGG AGAATGCAAT GGCGATGACT ATAATGAAGA TGAGAAGGAA GCGGGGGAGG
GGGATGAGGG ATGATATGGA TTTGGAGGAT GATGGCGGAT AGCAGGTCAA GACGAAGGGG
GCTAAATGCT TTTCAACAGA GCTCGGCGCT GAAGACGATC GAGTCCAACT TGACGAAAGT
CAATGGTTTG GAGGCGGGGA ATATCATTCC GAGGATGAAG GACAACCAGG AGTCAAGGAC
TAGGACAATC ACAACATTTT TTTGGAGCGG ATTTGCTTTA GAATGTTGAA AATATATATA
ATTAGTACAC GTTGGATTAG AGAAGGCATC GCTGTACGTA TAGCAATTAG A
 
Protein sequence
MIGTMSTSHL NSPSASFPHQ EQELFSLDFL ALTGLDGSIS DTNSPQANSS QSSHRQHDQS 
GDAEGQTSNE RQNSIFFREQ RRSSKDLLNS MEVDEHTGNA LRGLGHSNDG ENQLQDFDSL
QAALLQQQLQ AIHMQSPLGF DIQNPTYPLG QLLASPAFDE LHSSPNGHSE QQTLSVHNAH
PSSRSVYDSP LSHLAFNAHG HRNSFSSMST RSPLEQLQRQ QQQFQEQLGL LQRQQLKMQA
TAAAVMAAST SPYIGLNGPS STGPRPSVTP GMTPSSSNTG MFSPLTSPAL EATNYSHQSH
VSRHSQQFSP AYGSQHIGTS GILNTALSSP ALNPIGSTGG ANQTLSPALN PQNEVNRGDS
EYLHAFMGML DSTNSGNSTP GGEPPQPSYQ SPSMTSASTA GNSTIISSPA LYPQGAGTGP
HRQSLPFKSR PSPMLKPTHH RSHHRNSGSG NVSIPSSPAI QKYHPDASMP PAAMNSGLPP
PAIEHRQIQS NLSVSSTSTP SPVDLSHIMP PPPVPTGKPK ARKGVLPMTP ASLMNLGSVE
KHGSQSVPLP KSQTSSESNS SIGTVTAATS SGSTSKPAAG KKKTGGQVGK KTAGSKLVPV
GTTKRTLAMR PQTTVGVRSA TKAAAAAAAA AAIAPAEPEN RKISHKAAEQ KRRDSLKAGF
DELRLLLPPI NTEALDPLSG EPIPGSSAPR LLPKSSLVPD DNPNRGVSKV ALLRFGNEYI
GKLQERVDRR DLYIEKLREE VKRLREGGEE EDVTLDNGED LLEYDWREGE EDEFGECNGD
DYNEDEKEAG EGDEG