Gene CNH00800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00800 
Symbol 
ID3259090 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp958768 
End bp961768 
Gene Length3001 bp 
Protein Length521 aa 
Translation table 
GC content47% 
IMG OID638258402 
ProductGPI anchor biosynthesis-related protein, putative 
Protein accessionXP_572273 
Protein GI58270234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.512246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACTCCATGGA CCAGCAAAGC ATCAGAGTAT TCTGGCCTAT CACTGGGGTC GATATGTCAG 
AGGGGAAAGT AGTCGGTTGG AGATTGAGAG ATACTCTGTG TGTTGTGGGG ATCGTTCAAG
ATCGGGTATG CCTTTCCATT CTTATTAATA CTGTAAGTTA AATATTGAGT GCAGCTATGG
AACAAGGTCC TTGCCCAAAT TGGTGAGGAA GAAGATTTGG TGGGGCTGGA GAGCATCGGT
CGGGCAGTAT TGGATGCGAC CAAGAACACC GATGTCAATG AGAAGCAGTA TATTTTTTGG
GTCAATAAGG AGCGTATACC GCTTTCATGT TCGTACGTAT GCTTCGAGAA GGGAAAATTA
CCAGCTTGAT ACTGACCTTA TGACCATAGG GTTCCCACAA TTCTTATACT GTATAAACCA
TTGGATTCCT CTCGCCTGCA GTACCTAACC CCATCATCCT CGTCTCCTGA TCTGCACGTA
TCTGGGCAAG ATAGGCATGA TCCTAGGTAC CAGTTAACTA TTGGAGATGA TCAACTATCT
GCCATCGTCG ACCTTGTAAG TCGAGTCAAG CCCTCCATCA TACGGTCTTA GGAAAAGGAC
ACCTAATGCA AGAATAGATT AATAAGACAA GGCATGTTCA ACAGGTTCTT CGTTCGTTAC
AAATGGAGAG CGCTGCAGAG GGAAAGAAAA AGAGAAGAAA GCAAGCATCC TTGCCATCTG
CTCCTCGTTT CCTTTTCGCT CTCGACACTT GCGCACAGAT CACCATCTTT CTCTTCTCCA
TTTCTATCCC CTGTTCAAGC TCCTTCCGCG CTATTTCCAC ATGTAAGTTC AACGGACTTC
TCTACCTACC TCACGATTAA AGAAGCTCAT CATAGGTGAC CTCTTCTATT TGTCAGCGGC
AGATCAATTG TGCACAAGGA TGGAGCAGTC AATCAGAGGA CCTATTCGGT ATTTGACGAC
TCGAAACGAT GGGGGAATCA ATGACAGGGC AGCGCGATAT AATGTGTGAG TCCCATCAAA
CATCAAGTTT ATCCCCCAGC ATTGCTTGGT TTTGCGAAGG GCTACTAGGC ATCGTAAAAA
GGTCCTAACA AGGCATGCTG AGAATGTGCT GACTTTTGGG ACAAGGTTTT GGAATACGGT
TTGGCTCGTT GTAGTGCGTT CCTGCCTCCA ACTGCCGTAC TCAGAGGGTT CAAGAACTGA
TGAAACCTTC TCATAGAACG ACTTGGTCTT GGGATATGTA GCCCACAACC TCATTCGTCG
ACATTCCGAA TGGATTTCCA CCACCACGAG CACCTTCTTC TCGGTAAGCA TTAAATGAAT
CCTATGGCGA AAGTTGTTAA AATACCCGCT GCAAAGTCCT ACGTAATAGA TATGCCCATA
CATGCTTTGA AGTGGTTGAA CGACTGGCCT GTGGGGCTGA AACTCAATAC TCCACTCAGC
CAATTTTTTT GTTCGACGTT TACGTTCCTC ATCCAACGTT GGGGTGGTCC GTTCTTCCTC
TCGTTCTTCC CCAGACTGTT TAGCTTACCA AGATGTTTCA TCTACATAAT TAAACTCCAG
ATTGCGTCAC ACCCTCGCTC CATTCACTTT TACCCCAGCT CATGTACTTG TTATCCATTC
TCTCCCTAAC AGGTTTTACC ACCCTCCTCG CAGCGTCACA CGACATACTC AACCTATTAA
CGCTTCATCT TCTCTTCGGG TATAATGTCA TGAGAGCCGT GTGTGTCTGG CAAATTGACA
GTTTGGGGGG CCTATGGAAT CTTTTCCGTG GTATGGTAAC TCTCCCTTCC TCCCTTCTCT
CCTTTCGCCA TATCCCTGTC ACTGTTTTGT TTTGCTAATC GGAGGTATAG GTAAACGATG
GAATGTCTTA CGGCGGCGGA CGGATTCATA CGAATACGAT ATCGATCAGC TCTTTCTCGG
TACGCTCCTT TTCACAGTAT CAGCATTCCT TTTCCCTACC GTCCTCAGCT ACACAGCTCT
CTTCTGCCTC GTGAGTCTTA TCGAGACAAG CATTTCCTTT GCAGTTTTGT TAGCATAAGT
AAGCTGACAG GAAGATGGAA GACGAGAGGA ATGATATTCA TAATATGTCG CGTTTCGGAA
GTGACCCGAC AGGCGATGAA CAGGTTTCCC ATATTTGAAC TTATATTATG GATAAAGGAA
CCTTCAAGAG TGCCGGGTAA GTGGACATCT TTAATCAAAT TGGGCGATAG CAATGCTTAT
CATCTGAAAG GGGGCTTGAA CATTACTGTG CAAACGGTAC CCTTGGGTGA CGAGGGGGAG
AAAACTGAAG GTAGATTCAT CATGAGGAGA GCATTGGTTT TGAAGGTAAG TATACTTCGT
ATCCGTGAAG GACAGTATTG AAAAGAGGAT GATAGAGTAC ACCCAAAGCA CTCTCGGACA
TTTTATTCCA CCAGTGAGAT TTTGATTGCG GACGACAGGC CGGTACCATC GCCACGTATA
TGCATCAACC GTTTACCAAA GCCATCTCTC TCCGAAATGT ACAAATTTCC ATCTCAAACT
CAAAACAGTC AGGCAACGAA AAACACCTCT ATCCGTCGCA AAGAGGATTG CCAAGGGGGC
GGCAGGTCGG TGACTTAGCA AGCGCAAGAA GTGCAATTGC AGCTATAAAG TGCGTCATTA
AAACGACCTT CGCAAGAAAG AAGGACAAGG GAGAACCGTG AATGGACGTA CGTAGGAGAG
CCGCAAGTGC TGCAATTGCC ATTTGCACCG CTTGAGTTAC ACTCGTTCAC GAGGGTCCTG
CCGTTAGAGG CGCAAGACCC GCAGGAAGAA CCGTGAGAAC AGTCGGAAGA GTTGCCGCTA
TCATTGAAGG TCGAAAGGAA GCCGGGGGCA CTAATGTTTA GTATTGGGGG GGGAATGGGG
AGAAGAAGCG TACTTGGAGC AAGAACAGCT GCTGGAGCCG CAAGCGGAGT TGAGAGCCTA
TGTGTCAAGG TTAGGCGATG AGCGAAGATG GGGACGTACT CAAAGTAGAT AAGATACGTA
C
 
Protein sequence
MDQQSIRVFW PITGVDMSEG KVVGWRLRDT LCVVGIVQDR LWNKVLAQIG EEEDLVGLES 
IGRAVLDATK NTDVNEKQYI FWVNKERIPL SCSVPTILIL YKPLDSSRLQ YLTPSSSSPD
LHVSGQDRHD PRYQLTIGDD QLSAIVDLIN KTRHVQQVLR SLQMESAAEG KKKRRKQASL
PSAPRFLFAL DTCAQITIFL FSISIPCSSS FRAISTSADQ LCTRMEQSIR GPIRYLTTRN
DGGINDRAAR YNVFWNTVWL VNDLVLGYVA HNLIRRHSEW ISTTTSTFFS WLNDWPVGLK
LNTPLSQFFC STFTFLIQRW GDCVTPSLHS LLPQLMYLLS ILSLTGFTTL LAASHDILNL
LTLHLLFGYN VMRAVCVWQI DSLGGLWNLF RGKRWNVLRR RTDSYEYDID QLFLGTLLFT
VSAFLFPTVL SYTALFCLTR GMIFIICRVS EVTRQAMNRF PIFELILWIK EPSRVPGGLN
ITVQTVPLGD EGEKTEGRFI MRRALVLKST PKALSDILFH Q