Gene CNI03950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03950 
Symbol 
ID3259743 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1061919 
End bp1063205 
Gene Length1287 bp 
Protein Length376 aa 
Translation table 
GC content54% 
IMG OID638258890 
Productexpressed protein 
Protein accessionXP_572949 
Protein GI58271586 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAATATAAA ACATGTCCGA CAACAGGCAA GACGCTCAGC GAGAACTCCA CACCCAGGTG 
GAAACTAACA TCGAGGACAG CTTGGGCGTG CGTCTCTCTC CACGAGCCCA CTCTCACTCA
TGAATTCACA CTTACCAAAT ACATGCAGTC TGCTCTCGAG TCATCTTTCA ATATCCCTCC
TCCTGCTCCC AAGCCCAAGG TTCCCGTCCC TTCTCAGGAA AAGGAATCCG CTCCTGCAGC
CACGGCCTCT GCACCTGTCC CTGGTCTTGA CGAGTGGCCC CAAACTTTGC AAGGCTACCT
CGACGAATGG CAGGCCGAAT CCGCCACTGC TCGTGCCAAA GCTGAAGCTA CTCGTAAGAG
GTTTGAAGAA GAGCGAGCTG CCGAAGCCAA AGCTCTTGAA GACGCCAAAA AGGCGGAGAA
AACGAATAAG GAGGAAGAGG AGAAGAGGAA GAGGGATGCG GAGAGGTTGA GGCAGGAGTT
GGAAGGAGAG GAGGATGAGG TACAGGGCGG AAAGGGTCAC GGACATGGAG ACAAGAGCAG
GGTCAAGGAA GCTTGGGAAC TTGTTGCCAA GAAGGAAGGG CAGAGCAAGG ACACTCCTGT
GGTTGAGACT GATGTCCGGG GCGTTACTGG CGAAGACGTG TTCGCCGGCC AGGCTGGTGA
GAAGAAGGAG GTCAAGGCTG TGAGTACATT TTGTTTCACT GGCAGTATAG CTCTAACTAG
TATTTAAGCC CGCATACGAC CCCACTACTT CCACCGACCC TATCCCTCCT ATTTTCCAAG
ACCCCAAGCC CGTCGCTCCA GCTCCTGCGC CTACAGAATC GGCTACTCTG TCCCGGCACT
CTGCCACCTC TCAAGCGTGG GAAGAAATCT CTGGCCAGTC TTCCGGCAGT GGAGAGCAAG
TTTCTCCTCC CCGATCCTCT GGCTCCGACG ACATTGTCCA AGTCCCTTCT AACCCGGAAA
AGGCTCCCGA AGCCCCCCGT CCTCCTACGC AACCCCCCTC ACTCACCCTT ACTCTCTTCA
CCAACGCTTC ATCCTTGTCA ATCCCTAGGA TCTTTGCCGT CATCGGTATC AACCTTGTGT
TGCCTTTCAT CAACGGTGTA ATGCTCGGCT TTGGTGAGAT CTTTGCACGG GAAGTCGTGA
AGGTCGGCAA GGCTGTCTGG AGGGGTGAGA GGAGTTTGTT CAACTGGAAT CGGGGTTCAG
GTCTTGGAGG CAGAGGAACA ACGGGTGTCG GATTGAGTGG CGCTGGCTTC TAGAGTATTT
TGCGGATTTT TGACATGCAT CTACAAC
 
Protein sequence
MSDNRQDAQR ELHTQVETNI EDSLGSALES SFNIPPPAPK PKVPVPSQEK ESAPAATASA 
PVPGLDEWPQ TLQGYLDEWQ AESATARAKA EATRKRFEEE RAAEAKALED AKKAEKTNKE
EEEKRKRDAE RLRQELEGEE DEVQGGKGHG HGDKSRVKEA WELVAKKEGQ SKDTPVVETD
VRGVTGEDVF AGQAGEKKEV KAPAYDPTTS TDPIPPIFQD PKPVAPAPAP TESATLSRHS
ATSQAWEEIS GQSSGSGEQV SPPRSSGSDD IVQVPSNPEK APEAPRPPTQ PPSLTLTLFT
NASSLSIPRI FAVIGINLVL PFINGVMLGF GEIFAREVVK VGKAVWRGER SLFNWNRGSG
LGGRGTTGVG LSGAGF