Gene CNA01520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01520 
Symbol 
ID3253644 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp410561 
End bp412849 
Gene Length2289 bp 
Protein Length472 aa 
Translation table 
GC content47% 
IMG OID638252486 
Productconserved expressed protein 
Protein accessionXP_566624 
Protein GI58258423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATGTCCAAT CTGTCCACTT GCCGGGTCCC GGAAACCTGG ACTTCTTCCC GCTTGGCATG 
CCTCATTTCT GCTTCTAAGA TCGAGCCTTG CATTTTGAGG ACTGAAAACC TGACGTCTGA
GGAGATAGCA GCAGCAGCAG ACACGGTTAA TAAAGACGTC AGAATAGCGG AATTTCGATC
AAGGCTCGAG AAGTCGTATT CCTGTTACAC CTTACATCAA GAAAGCTGTA TCTTATAACA
GAAGTTATTA TCCCACTTCC ATCTTATTCT CACTGTGGTC TCTACAGGGC ATTTTAATTT
CTGGACAGTG CTAATTTTTG AGCAGTGCTT TCTTTACCGT ATCTTACGCT CTCTCATTTC
GGCCTTCTTC CTTCTCTTCT TCGATGGTCA TGTTCTCGCT CGACGCGTTC ATATTTTCTA
CGATTTGCCT GCTTTCACAT CTTCCCGCGG CTTTCGCTTT TGGTGTACAG AGCTACCCCA
ATCTTTTCTT TGACGCGTCC ACAATTGTGA ACAACGAATG GTTCAATGGC ATTGATTCCC
GATGGGCAAG GGCGATGAGT GAATACTGGG CTCAGACATT GATACAATAT GGTCCTTGGT
GTAAGTGTTG TAGTCTAGGA CGCGTGACTC GTAGCTGCTG ACTGAGGAAC GTCCAGCTGT
GACGAACAAG TCAATCGCTG CCAAATCCGG AGACAAGAAG TAAGTATAAT CATATTTGTT
AAGCTGTATA TTGACAACGA TGCGCAGAGA TTACCTTAGT TGGGCTGTCT ACCATTGGCC
AGATTGCTCA AACGTACAAA ATACCACCGA GTTGACTACA GAAGAAGGTG AGCAATGATT
TAGCTCCACT AATTCAAGGT CATACCTGAC ACTCGTGCCA CAGTGTGGGA TGAATGTGAT
TATGTTGTGC GGGACGGTCA GGTGAACCCT GATACAGAGC TAGGTAAGAT CCATATGGAT
TTGCTGGTCG TCAATAATGT ATCTGATGGT ATTCCTTTTC AGTCAAAGAT AAGGAGGCTT
TGCAAAATAT GAGCAACAGC ATCTACCTCT CTGCCATGGC CTACCTCCAC ACCGGTGACT
CGCAGTACAC AACGCACATC AATCACGCGC TTCACACTTT CTTCATCAAC AATGAAACTC
GTATGAATCC CAATCTCAAT TATGCCCAAG TCATCCGCGG GCCAGGCGAT CAAATGGGTC
GACACACCGG TGTGCTTGAT TTGGCGTGTA TGGCGAAGAT TGTCAGTGGC GTGCAAGTCA
TGCGGTCAAT TAAGCCAGTG GAATGGCAGC AAGAGACCGA GACTGGGATG ATGCAGTGGG
CTTCGGAGCA GTTAGATTGG CTTTACACTA GCGAGCTGGG TCTCGCAGAG TTGAACGCTA
CAAAGTGAGC CATTTTCTTT GTATATAATA TCACATGTAC TGATCAGTGT TTCAGTAACC
ACGGTACTTT TGCCGTGAAC CAGGTATGCG CCCTTAACGT TCTTCTGAAC CAGAGTGACG
CATGTGCCAC TGCTCTTGAT GATTTCTACA ATGGCATCTA CTTGGACCAA ATTGATGCCA
ACGGTGATCA ACCGCTTGAA TCTGCGCGAA CCCGTCCATA CCATTATCGA GCGTACAATC
TTATGGCTCT TGTTACGAAC GCGCAGTTTG GTGACTTTGT TGGCTTATCA CCATCTGCAT
GGGAGCGCCG AAGTGCAGCG AACGCCACCA TAGTTGATGC TGTCCACTAC GCGATGCTTC
AGAATTACAC GACCTCGGAT GAATTAAATC AAGAAAAAGC CCTTAACCCT GTTGTCGCTG
CTATATCTGC CAAATACGGT GATCCGGATA ACAAATACGC CGACTTCTTG AAGAGTACCG
ATCCGTATTT CCCAGGTCAA CCTTGGTTTG CATTGAACGA AGGTCTGAGC GATGCGGGCA
TCAAACAAGG GCTCCTGGAG ACCACATATG GGCCAGTGCC TCCTCAGCCA ACTTTAGGGG
CTGGGTCACC GGACGACCTG GAGTATGGGA AGAAAAGAGA CTGGAAGCCA AGAGGGGCAG
GATGGCCAAC AGGGGCACCT AGAGCCCCGT AGATTACGGG GAAATAAGGA GGTTTACTTG
GGGTGTGACG GGATGGATGG AAGGCTATGG AATTCATATC TTGAAACGTC TAATGATGGA
AGGAGTAGTA TTATCTTGGA TCATGGACTT TTCAGGTATC TCTGTCATAC AAGGTCAGCG
ATAACATATT GTTGTATATT TTATATCCCT AGCACATGAG ATGCACTCGC GATGTACGTC
ACCTTTAGT
 
Protein sequence
MVMFSLDAFI FSTICLLSHL PAAFAFGVQS YPNLFFDAST IVNNEWFNGI DSRWARAMSE 
YWAQTLIQYG PWSVTNKSIA AKSGDKKDYL SWAVYHWPDC SNVQNTTELT TEEVWDECDY
VVRDGQVNPD TELVKDKEAL QNMSNSIYLS AMAYLHTGDS QYTTHINHAL HTFFINNETR
MNPNLNYAQV IRGPGDQMGR HTGVLDLACM AKIVSGVQVM RSIKPVEWQQ ETETGMMQWA
SEQLDWLYTS ELGLAELNAT NNHGTFAVNQ VCALNVLLNQ SDACATALDD FYNGIYLDQI
DANGDQPLES ARTRPYHYRA YNLMALVTNA QFGDFVGLSP SAWERRSAAN ATIVDAVHYA
MLQNYTTSDE LNQEKALNPV VAAISAKYGD PDNKYADFLK STDPYFPGQP WFALNEGLSD
AGIKQGLLET TYGPVPPQPT LGAGSPDDLE YGKKRDWKPR GAGWPTGAPR AP