Gene CND02020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND02020 
Symbol 
ID3256991 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp544222 
End bp546524 
Gene Length2303 bp 
Protein Length481 aa 
Translation table 
GC content45% 
IMG OID638256136 
Productconserved hypothetical protein 
Protein accessionXP_570228 
Protein GI58266144 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTGATTGAC CAATATATGA GAGAATTACT CCCCCTTCCA TCTCCTTTTC AATCACAGAC 
AAATCAATTA AACAATCAAC AATGTCGCAA GATCCGTACA CCACCAAAGC ATCCGATGTT
AATATGGATG ACAACACGAG TACCGACAAG AATGTCAGCC TTTCAAGAGT AGTCACCGCT
CTTGAGCCGG AGCAGGTACA GGTAGTAGAT GGTGTTTGGG GTACTCTCGA TGAATCTGCA
CCTAACTATC GAAGTCTTGG CTGGTAAGTA TGCTCTACAT GGGCGGTTTG ATTTGCAGAT
GTCAACAAAT GAGCTGACCA CTGAGTAGGA TAAGAGCCAG TGTTCTCATG ATCAAAGTCC
AAATAGGACT AGGCGTCCTT GCTATTGTAG GCTGACTTCA TCTCGTCCAA CGGTGTCACA
TAGACCTAAT TGCCCCAAAC ATTTTTAGCC AGCTGTCCTG CATACTTTTG GTCTTATCCC
TGCTATTCTT ATCATATTCG CTGTTGCTGT TGCCACCACT TGTATATCTC AATCATTTCT
TCATCCATCT CCTCCCCTCT TCATTGACTG ATCCATAATT AGGGGCGGAT TACGTTGTAG
GCGTGTTCAA GCAGAATCAT CCCGAAGTTT ATACTCTGGC AGAGTAAGTA CACGTCTTCT
CTTTTCAAGA ACCAACATTC TCACTGTTTT ACACCCTTTA CTTCGCAATG GCTGACATTG
AGCTCATAGT GTTGGCTATA TCATGTGGGG ACCTATTGGA CGAGAAGTGT TTGGCGCTGT
CTACTGGGTA AGTATCATTA CGTCTAATAT TTTGACGAGG AAGAGTGAGA AGGAATTAAG
GCTTATTTTT ATGCCAAGAT TCAACTTACT GCTGTCGCTG GGGCCGGCCT CCTCAGTATA
TCTGTAGCTC TAAACGCAAT GTCAGGTCAT GGGACTTGCA CTATCGTCTT CGTCGTTGCC
GCGGCCATCA TCAACATCCT TGTATCTTCC ATTCAAACAC TCGACAGAAT TTCCTGGATT
GGATGGATAG GCCTTGTTGG TATTATGTCG TCCGTTATCG CCCTTGCCAT TGCTGTAAGC
GTTCAAGATC GTCCCAGTGC AGCTCCTGCC ACCGGTGATT GGTCTCCCGA TATCGTTCTT
GTTGGCAATC CCTCCTTTTC TGCTGCTATT GGTGCCCTAT CCAACATCAT TTTCTCCTTC
GCCGGTGCCC CCAACTTCTT CAATATCGTT GCTGAAATGA AAAATCCCAG AGATTTTAAC
AAAGCTCTCA TCTCCTGTCA GACCTTCGTA ACGGCGGCAT ATCTTGTACG TTCTCAGCTT
GACAATCTTT AGAAACTGGC TCACCGTACT AAAAGATTAT TGGCTGCGTT GTTTACCATT
ACTGTGGCCA ATATATTACT TCTCCAGCTC TAGGATCGGC AGGTATTCTT ATGAAGAAGG
TGTGTTACGG TCTCGCATTT CCTGGCCTTG TGGTCGGATG CGTCCTGAAT ACACACCTCC
CCGCCAAATA CAGTAAGCAT TGTCTCAACA TCGCAACAGA TCTCTAGGCT CATTTCGATC
CCAGTTTTTG TTCGCTTGAT GAGGAACAGC AAGCATTTGA GTGCGAACAC TATCCAGCAT
CGAGTTATTT GGATGTGAGT GAATATATTC AACAAGTAAC GACAAATAGT ATCGTACAAA
ACTAATGAAG ACGTTCAGTT CCTGTGTCGT ATTTAATTGT ACCGTCTCAT TTGTAATCGC
CGAAGGTATT CCAATTTTCA ACGACCTCAT CGGACTCATT GGGGCACTCT TTGCGACTCC
CAATGCGATG TAAGCGGTTT CGCTTTCTAG CACCTGGGGG CCTCTAGCTA ATTCCATTCA
GCATCTTTGA GTGTATGATG TATATCTGGG ATGTCTATTA CTGCGCCGAC AAGTATCCTA
GCCAGCATAC CTGGAAACAG CGATCAATTC AAGCATTCAA CGTCATTATT ATGCTCCTCT
CGATATTTGC CATGGTAGCC GGGACGTATG CCGCAGCCGT CACCATACGA GACGACGTGG
CGTCTAACGC CACCTCAAAG CCCTTCTCTT GTGATGATAA CTCGGGATAG CCATGAGAAG
TTGAGTATAG CAAATGTCCT ACCGTGTGAT TATTGGAATT GGTATTTTGC CCTCTTTTAC
CAGCATTTCA AATTAGCATT TCATTCCCCT TAACTTTCAA TTTGTGTCTC CCATACGTAT
GGCTGTATAT GATGTAACTT AGGAGCGTTG TTTGGATAAA AAGGTACTCG CTCGGTTGTA
TGCAAACTCC TTGGTCCTAT CAC
 
Protein sequence
MSQDPYTTKA SDVNMDDNTS TDKNVSLSRV VTALEPEQVQ VVDGVWGTLD ESAPNYRSLG 
WIRASVLMIK VQIGLGVLAI PAVLHTFGLI PAILIIFAVA VATTWADYVV GVFKQNHPEV
YTLADVGYIM WGPIGREVFG AVYWIQLTAV AGAGLLSISV ALNAMSGHGT CTIVFVVAAA
IINILVSSIQ TLDRISWIGW IGLVGIMSSV IALAIAVSVQ DRPSAAPATG DWSPDIVLVG
NPSFSAAIGA LSNIIFSFAG APNFFNIVAE MKNPRDFNKA LISCQTFVTA AYLIIGCVVY
HYCGQYITSP ALGSAGILMK KVCYGLAFPG LVVGCVLNTH LPAKYIFVRL MRNSKHLSAN
TIQHRVIWIS CVVFNCTVSF VIAEGIPIFN DLIGLIGALF ATPNAIIFEC MMYIWDVYYC
ADKYPSQHTW KQRSIQAFNV IIMLLSIFAM VAGTYAAAVT IRDDVASNAT SKPFSCDDNS
G