Gene CNH00030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00030 
Symbol 
ID3259220 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1185634 
End bp1188578 
Gene Length2945 bp 
Protein Length543 aa 
Translation table 
GC content50% 
IMG OID638258482 
ProductAlpha-L-arabinofuranosidase, putative 
Protein accessionXP_572197 
Protein GI58270082 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGATA CCAAAGAACA TAGCAAGAAG CTCAAAAGTA GGCATATACG TGATTCATTG 
AGGGCTCTGG GAGGAGACAA GACGAGAATA TGGGAAAGCA GGGACACCGC TCAAGAGGGA
ACGGAAACTC AGGCACACTA CAGCTATAGG TGAGCTTCGT GTGAAGTCGC GAGCCAAATC
GCCTTTCATC TGATGATCAT CATTCATAAT AAACCGCTCA TCAATTGTGC TACCAGATAA
GTACCGGCCC GCCAGCGTTA GTGTACGGAT TGTGGTCGGG CCGTCGGGTG CCTGTTAGGG
ATACGTACCG ATACACGGCG CCCCTGCTTT GCCAGTGGAA GAAGGCGGTT GCAATTTGGA
CTCTTATCGG CTCAATATTG TCACGCCGCC GCCTGAGCCT ATCTGGCCGA GACAACGCAT
GATTGCGCTT AGCTTATGAT CCGATAACCG TATGAATGCC GCCGACCTCG AGGTTTAGTG
GTTGCAGAAT ACAGCTGTAT GAGGGCATGT AGGGGCCTTG CGACTTACAG CTTATAGCTT
ATACGTCGAC ACTTCTATTC TCACGCTTCA TCCTCCAGCG ATCACCATGG CCGTCTACAG
CCAACACGTT ACAGTGAATC AGTCTCTTCT CCAGCAACAG GCTCCACGCG ACTTTCCCAA
CACTCCCTCC CATCCCAAAA CACTGGCATC AACTCCTATC CTCCTCGATG TCACCCGTCT
CCTCTATCCC TCCGGACCTC CCGTCACAGA CAAGCTCTTC AGCGGCTTCC TCGAACATCT
CGGGAGGTGT ATCTACGGAG GTATTGTGGA TGACCCGAAG GATCCAAGTC CGAAATGGCT
TTTGGAGGTG CAGGATGAAG GAAAGACGTT TCAGAAAGAT AGATTGGGGT GGAGGAAAGA
TGTGATGGGG TGCTTGGCGA AGGATGGGGA GCTCGAGATT CCAATGCTGA GGTGGCCTGG
TGGTGAGTAT CTGCTATTAT CTTCGTCCAT TTGCAAATCG ACTAACCTTT GTGATAGGTA
ACTTCGTCTC CAATTACCAC TGGCAAGACG GTATCGGCCC CATCTCTGAG CGTCCAAAGC
GTGTCGAGCT CGCTTGGCTC AGCGACGAGT CCAACAGGTT TGGCACCGAC GAGTTTATCG
ACTACTGCCG AGCGACGGGT GCTGAGCCTT ACATCTGTTT GAATAGTGCG TGTCTTTCTT
GACCTTGTTG CGCAGTATTG ACGTCTGCCG GTAGTGGGCA CAGGAAGTCT TGAGGAAGCT
TTGGCATGGG TTGAGTATTG CAACGGGACT GGCGATACCC AGTGAGCTTT CTCGCCTTCC
TATCTATCTT ACATCGCGCT TATCGTCGAT CAAGCTGGGC AAACCTTCGA CGCAAAAATA
CTGGAAGGGA CGAGCCTCAT GCTGTCAAAT ACTGGGGTCT CGGTAATGAG AGTAAGTCTG
CTTTTCTTCC CAAGATGAGA TTTGAGTCTG AATTTCTGCT GCAGTGTGGG GTCCATGGCA
AGTTGGCAAC CTTCTCCCTT CCGAGTACGT GAGGAAAGCA AGGCAATGGG CCCATGCTCT
CAAGCTCGTT GACCCTAGCA TCGTATTGGT ATCTTGTGGC GAGACTGTGA GTATATTCAA
TTTGGCAGCA GGGTTGAAAC TGACAATAGG TGTAGGGTGC GTCTGATTGG GATAGAGAGG
TTATCCAGGG TCTTCTTCCA TGGGCTGATA TGCACTCCAT TCACTTCTAG TGAGCAAAAT
CCCTACTCCA TCGCAATCCC CAACTTACAA AGATCTCAGT ACTATGCTCG GCCACGACAA
GATGTCCAGC GTATCTGGTT ATGACTACGA GAAGAACGTC TTCGGGCCAG CCGTAAGTAT
CACCTCTTCC TGCGCTGAAA GCTGCCTCGT TAATACTGAT ACGATGATAT ATAGGCTGCC
GAGAAGCACA TCGACATCTG CAAGTCACTC ATCGACCTCG CTAACATCGG CCGAACCTGG
GAGCGCATGC CAGCGAAGGA TATGAAGATC TGCTTTGATG AGGTGCGCTT CCTATCCTTC
CATCGTTATC CGTAAGTGTA AGAGTTGACG ATGTTGGTAG TGGAACGTCT GGGACGACGT
CAAAGCGCCC GGGTCCAACG GGTTGGAGCA ATGGTACGAC TACACCGACA TGCTTGGCTT
TTGTGCATGG TTGAATGTGC TTGTGAGGAA GCATAAGGAT ATAGGCATTG CATGTCTTGC
TCAGTCTGTC AACGTCGTAC GTCCCATCTC ATGAGCTTCT ACCTTATTCT ATTGAATCAA
AGCTGACAGA TGGAGCTTAG ATCTCACCTC TCATGACCAA GCCAGATGGC ATTGTTCGCC
AGACTTTGTA TTACCCCCTT CAGCTTTTCA GCAAGTACAT GAAGAACGGA CACCTTCTCC
AACTTCCTGT CTTCCCTGAC GTCTAGTATG TATCCGTCCG TACCCATGTC TCACTCGTTA
CTGACAAGCA TGGCAGCACC GGTCCCACGT TCCCTGTCTA CATCCAACAA GGCAACTACA
AACCCTCCTA CGTCGACTCT GTCGCTATAC TCGTAGAAAC GGAGAAGGGA GCAAGCATCA
GGGTTAGCAT TCTGAATAGG CATCCGGAGG TGGATTGGAG CTCGAAGATA GGGTTCTCGG
GTTTCGGTAA GTACATTCTG GAGACGTTAT ATCTGAGGCG TATTGGAACT GACGATATGG
GATAGAGGTC GAGAGTGTCG AGGTTCACGA GATCTACTCG GACGACCTTG CTGCTGCGGT
CAGTTCCCTT TAACATGTAT TTGTACACTG CATCAACTGA AACCTCCTTA CTACAGAATA
CGTTCGAAAA CCCTGACACT ATTATTCCTA AGATTACAAA GAAGAACGCG AAAGAGTGGG
ATGGAGATGT TTTGGTTAAG AAGCATTCGT GGTCCTTCAT CATTTTCAAT GGTCGTTTTC
ATTGA
 
Protein sequence
MAVYSQHVTV NQSLLQQQAP RDFPNTPSHP KTLASTPILL DVTRLLYPSG PPVTDKLFSG 
FLEHLGRCIY GGIVDDPKDP SPKWLLEVQD EGKTFQKDRL GWRKDVMGCL AKDGELEIPM
LRWPGGNFVS NYHWQDGIGP ISERPKRVEL AWLSDESNRF GTDEFIDYCR ATGAEPYICL
NMGTGSLEEA LAWVEYCNGT GDTHWANLRR KNTGRDEPHA VKYWGLGNEM WGPWQVGNLL
PSEYVRKARQ WAHALKLVDP SIVLVSCGET GASDWDREVI QGLLPWADMH SIHFYTMLGH
DKMSSVSGYD YEKNVFGPAA AEKHIDICKS LIDLANIGRT WERMPAKDMK ICFDEWNVWD
DVKAPGSNGL EQWYDYTDML GFCAWLNVLV RKHKDIGIAC LAQSVNVISP LMTKPDGIVR
QTLYYPLQLF SKYMKNGHLL QLPVFPDVYT GPTFPVYIQQ GNYKPSYVDS VAILVETEKG
ASIRVESVEV HEIYSDDLAA ANTFENPDTI IPKITKKNAK EWDGDVLVKK HSWSFIIFNG
RFH