Gene CNB04750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04750 
Symbol 
ID3256062 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1352904 
End bp1355133 
Gene Length2230 bp 
Protein Length511 aa 
Translation table 
GC content47% 
IMG OID638255119 
Productconserved hypothetical protein 
Protein accessionXP_568965 
Protein GI58263110 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0726968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATATTTCGA GTCCACTTCT ACAAGCATAC AATCACAAGA AATGGCTCTC GTGGATGTAG 
AGGAACAGAG CACAGCAGGC CCGAGTCACT CGAGATCCTC CAGCGTGCTG GCAGCAACGG
CGGCAGAGTC GGTATCGTCG TCAAGAAGAG AATCGAGGAG AGGTTCAAAG GATAGAGAAG
TCGGTGATGG GGGAAGGGGT GGAGATGACC ATGTTTGTGA GTTTGATCCA TGGATGTCTA
TCTGCGTATC GCTGACAACA ACAATGCGTA AAGCATTCAA AACGGCTGAA GAGGACATCG
AGCTGCAAGA CAGCGCGACA GCACCTTTAC TTGCGGGAGC TGCAGCCAGC CCTCGTCTGC
TCGATTCAGA AGAGCGTCAT CTTCTAGAGG CTGAAGGACA CAATTCAGTT TCCAGGGGAT
CGATATTAGA CGCGGTGACG AATGTGAGTA TTGAGTCCAG CCGATCAGCC TCTGAGATGT
TGACGAATGA ATCGCGCGAC ATGGTAGATG GCCAACTCCA TTATAGGAGC TGGTATAGTG
GGGTAAGCTG AGAGTGCGCA TATGGCAGAG CTATAGCTGA CCATCCTTTT TGAAGATTGC
CATATGCAGT ATCCGAAGCA GGCTTTGTCA TGGGCGTGTT CCTCTTGATC GCGCTGGCAG
CCATATCAGA TTGGACTATC CGGCTAGTCA TCCTCACCAG TAAATTGAGT GGAAGAGAAT
CTTACACCGA GGTGTGCTAT GGCATATGGT CTGTTTTATC AATGCTGACT TAGGAGAAGA
CAATGTACCA TTGCTTCGGA CCACTAGGAG CGATGGCAGT ATCCTTCTTC CAGTTCTCAT
TCGCGTTTGG CGGGTAGGTC TGTAGTTGCC TATGTGACAT AAGTTAACCT CCATGTAGGA
CGGCTGCATT CCATGTCATC GTGTAAGGTT CCTTGTTCAT TTCCTCCGCG ATGAAAGCTG
ACACTCTACA GTGGTGATAC GATCCCTCGA GTCGTCTCTT ACATCTTTCC CTCCTTCGCC
GAAAACGTCT TCCTTCGTCT ATTTGTCAAT CGCCAAGCAG TCATTATCAT GTGCACCTTG
TTCATTTCTT TCCCCCTAAG CCTGCATCGG GACATTGTGA AGCTCTCAAA ATCATCCAGC
TTCGGTGGGT ACTTTATCGA AGATACAAGT AGATATCCCG GAATCGCTGA CTCGAGCGAT
AGCTTTAGTA TCCATGGTCA TCATTATTGT CTCTGTGCTC TTTCGAAGCG TCGCGGTTGA
TCCATCATTA CGCGGCTCTT CAACTGATGT GTTTTCAATC GTGAAACCTG GTGTCTTCCA
AGCCATAGGG GTGATCTCTT TCGCGTACGC GTGTCATCAC AACAGTAACT ATATCTACAA
AAGTATCAAT GTACCTACTC TCGATCGGTA TGTTCGTCTT GACTCAGAAT GGAATCGATG
CTAATGATGT TTAGCTTTGA CATGGTCACT CATATTTCAA CTGGCATAAG TTTAATAGCC
TGTTTACTAG TTGCTGTTTG TGGATATGTC GTCTTCACCG ATAAAACAGA GGTTTGCGAC
TTGCTTAGCT TGTAGGGCAT CAGTGCTAAC GTTTCATTTA GGGGAACATT CTCAATAATT
TCAGCTCTGA AGATTGGCTT ATCAACATTG CCCGCTTTTG CTTTGGCGCC AATATGTCAA
CAACAAGCGA GTCCATGCTT CACCGTCGTA GATCATATAC TCATCAAAAT CTCAGTCCCA
TTGGAAGTTT TCGTCTGTCG AGAAGTGATT GAAGAGACGT TTTACAAGTC TAAGCCCTTC
AGTAAGCTGC GTCACGTAAT CATAACTTCC TCTGTCATCT TTATCGCTAT GGGTCGTAAG
TCCTTTCCTA TTTGATTTCG CGCCTTACTG ATGCAATGGG CTTATAGTCG CGCTTACAAC
ATGTGATCTC GGTGTCGTCT TGGAGTTGGC CGGTGGTCTT TCAGCTTCCG CTTTAGCTTT
CATTCTGCCA GCCTCCGCAT ACTTTGTAAT GCTCTCTGGC CCTTGGTCTT CCAGAAGGAA
ATTACCGGCA CTTTTGGTTG CCAGTTTTGG AGTGATTGTG TTGGTGTTGA GTTGCGGGTT
GAGCCTCAAG AAAGCCTGGA GCGGAGAGGG TGGGAAGTCA GTGTGCTAAG TTCACAGTGC
TACTAGATAT ATAGATTGAT TGGGTGTATG TCGTATAGTT CGTTGTGTTG TTTGTACGTA
CATATGCACA
 
Protein sequence
MALVDVEEQS TAGPSHSRSS SVLAATAAES VSSSRRESRR GSKDREVGDG GRGGDDHVSF 
KTAEEDIELQ DSATAPLLAG AAASPRLLDS EERHLLEAEG HNSVSRGSIL DAVTNMANSI
IGAGIVGLPY AVSEAGFVMG VFLLIALAAI SDWTIRLVIL TSKLSGRESY TETMYHCFGP
LGAMAVSFFQ FSFAFGGTAA FHVIVGDTIP RVVSYIFPSF AENVFLRLFV NRQAVIIMCT
LFISFPLSLH RDIVKLSKSS SFALVSMVII IVSVLFRSVA VDPSLRGSST DVFSIVKPGV
FQAIGVISFA YACHHNSNYI YKSINVPTLD RFDMVTHIST GISLIACLLV AVCGYVVFTD
KTEGNILNNF SSEDWLINIA RFCFGANMST TIPLEVFVCR EVIEETFYKS KPFSKLRHVI
ITSSVIFIAM GLALTTCDLG VVLELAGGLS ASALAFILPA SAYFVMLSGP WSSRRKLPAL
LVASFGVIVL VLSCGLSLKK AWSGEGGKSV C