Gene CNC04850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04850 
Symbol 
ID3256119 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1468599 
End bp1470080 
Gene Length1482 bp 
Protein Length423 aa 
Translation table 
GC content48% 
IMG OID638255704 
Productconserved hypothetical protein 
Protein accessionXP_570018 
Protein GI58265724 
COG category[S] Function unknown 
COG ID[COG5542] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCTTACCAT CATTGCCCCT TCCCGCAAAA CATGCCTCAG CAATGGCGAA GCTGTCACCT 
CTCACTCTGA TCTTCATCGC TGCCTGTCTT TCACGCATAC TTCAACTCAC AATCCTCTCT
GGCCTCAGCA AGGCTCTGCC TCTGTTCGAT ACATCACCAT CTCTCCTGCT CTCTTCCCCA
CCACCCGCCT TACGATGGGA TGCTATTCAT TTTGCGTCTG TAGCTTACAA TGGGTACGAA
TATGAGCAAC AGGTCGCTTT TCAGCCAGGA TGGCTTGCCG TGATGCGACT GGCGGGCGAG
GGTGTGAGGT TTATAAGGGC AGCATCAGTC GTAGAACTGA AGGATGTGAT ACTGGGAGGC
ACAATCGTGG CTAATGTTGC CTTCGTGGCT GCGACCTTGG TGCTTTACAA GTAAGCATGG
ATTTCAAGGG GTACGGCGTA GCTTATGCAC ACCGTGTAGA CTGACGAAAC ACATCTTCAA
CCCAACGTTC GCATTTCTTA CATCCCTACT CTATCTCCTA CCGCCCACGG CCACTCCTTC
AGCACCCTAT ACAGAACCTA TCTACTCTCT TCTGACATTC TCAGGCATCT ATCTTCTGTC
TATCAAGCGA CAAATGGTAC TTGCTGGTCT TTGTTTTGCA GGGGCAACCA CCATCAGGTC
CACTGGCATT TTCAACTCAA TCACGCTCAT GTGTTTCGCT GTTTTCGGTG ATGCACACAT
ATTCGATCTC GACCCTAAGG ATTACTGTAA GGTGAGGGGT GTCTTGTTGA GCTTTGGTGC
AATATTTACT GACTCTTATT AGATTCGTAA AAAATTGAAG CCTTTTCTGT CGGCAATCCT
CGTGGTCGCG CCATTCTTCA TGTTCCAGCA TTACACTGAG ACTGTATTCT GTACGAGAGA
ATTGAAGCGG GCAAGTACTG CTCGTCCATG GTGCAGTAAC AGCCCACCAG TGTCTTATGG
TTTCGTTCAA AAGCTGTACT GGTAAGTGTA CATATTTTCA TTCAATTTTG ATATGCTAGC
TGAGGATGGG AATTTTAGGA ATGTTGGACC GTTTGAATAT TGGACAGTGT CTCAACTTCC
AAACTTTGCA CTGGCAATGC CTATCCTTTT TTTCTCCTTG GCCGGCGTCG TCAAGTTCTT
CTCCCACTTG GTATCTTCCT CTCAAGTTCT TAATCACGGC ACTGAAGAAA TCCCACCGCC
TCCTATACTA TTCGAGCTCT ATTCTGTCCA TGTTCTGACC ATGGCGCTGC TGTTATTCAC
CAGTCATACT CAGATAACCC TACGGGTCTG CCTAGGTGAT CCCGTGGTTT GGTGGAATGC
GGTCAAATTA GGATTTGACA ATGTTCAAAT TGGCGAGGCC CCCACGGGGC AAGTCAAGGT
GAATAAGTTT GGAAGATACT GGATAGGCTG GACTGTGGTT TGGGGCGCAG TAGCTGCCGT
ATTATGGGCA GGACACTACC CACCTGCATA GAAGTGTACC AA
 
Protein sequence
MAKLSPLTLI FIAACLSRIL QLTILSGLSK ALPLFDTSPS LLLSSPPPAL RWDAIHFASV 
AYNGYEYEQQ VAFQPGWLAV MRLAGEGVRF IRAASVVELK DVILGGTIVA NVAFVAATLV
LYKLTKHIFN PTFAFLTSLL YLLPPTATPS APYTEPIYSL LTFSGIYLLS IKRQMVLAGL
CFAGATTIRS TGIFNSITLM CFAVFGDAHI FDLDPKDYCK IRKKLKPFLS AILVVAPFFM
FQHYTETVFC TRELKRASTA RPWCSNSPPV SYGFVQKLYW NVGPFEYWTV SQLPNFALAM
PILFFSLAGV VKFFSHLVSS SQVLNHGTEE IPPPPILFEL YSVHVLTMAL LLFTSHTQIT
LRVCLGDPVV WWNAVKLGFD NVQIGEAPTG QVKVNKFGRY WIGWTVVWGA VAAVLWAGHY
PPA