Gene CNG00310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00310 
Symbol 
ID3258690 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp73174 
End bp76265 
Gene Length3092 bp 
Protein Length858 aa 
Translation table 
GC content50% 
IMG OID638257645 
Productalfa-L-rhamnosidase, putative 
Protein accessionXP_571766 
Protein GI58269220 
COG category 
COG ID 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGTGA CCATCGTCAG CTTACAAGCT GAACGTCACG AATCCGGCTT CGGTATCGCC 
CATCCTACTC CTCGCCTGAC ATGGCGATTC GGTTCGACAA CGCTCAAAGA CTGGAAACAA
GCTTCTTACG AGCTTATCAT CACCCATCCT GGGAATCATC AGGCCGAGCA CTACATTGTC
AAGTCTGAGC AGTCCGTCCT CGTTTCTTGG CCGTCGAAAT CTATTCAATC GAGAGAGATC
GTGGAGGTCA AAGTTCGTTC AACAGGTACC GATGGGTCGA CAACAAACTG GGCTGGGATC
ACACTCGAAG CTGCTCTGCT TGACCGAGGA GAGTGGAAAG CTAAATTTAT CTCCGGTCCT
CCTCAAGAGG TTGACGCCCC CAAACCGCCT TTCCGTTTAC GCAAGACCTT TGTTCTCAAG
TCTGCTCCTA TTCAAGCTCG ACTCTATGCG TCTGCTCTCG GAGTGTACGA ATGCGAGATC
AACGGGAAGA GGGTAGGCGA CCAGATTCTG GCCCCCGGGT GGACATCGTA CAAGTACCAT
CTCCGTTATC AGATATACGA TATCACCTCC CTCCTCCAGC AAGGCGAGAA TACGATCACC
GCGTACGTCG GAGAAGGATG GTACGCTACC CGTCTTGGTA GACCTGGGAA ACGCAATAAT
TGGGGCAGTC GCTTGGGATT CTTGGGACAG CTCGAAACAG ATGGTGAGGC TGAAGTGGTG
ACGGATGAGA CGTGGGAATG CGTCGATGGG CCTATCAAGA ACTCCGAGAT TTATAACGGC
GAGGTGTACG ATTTGACATA CGACGAGTCC AAGGCGAAGA TCTCCCCTGT CGAAGTCCTC
TCCTTCCCGG AAGCCCAACT CATCGCTTCC GATGCTCCAC CAATTAGGCG AGTCAAGGAA
GTCAAAGCCG TGGAACTTAT CACGACACCT TCCGGCAAAT CCATTCTCGA TTTCGGACAG
AACCTTGTAG GATTCTTGAG GATTGAGACG GATCTGAAAG GGAAGGAGCT ATTATTGAGG
CATGCAGAGG TATTGGAGGA TGGGGAGCTT GGAACAAGGC CGTTGAGAAC GGCGGAGCCG
AATGATAAGA TTATTCTGGG TGGGAAGACG AAAGGATGGG AACCCAAGTT CACTTTCCAC
GGCTTCAGGT GGGTGCATAA TGTGTTTGGA TTTCCGCGCT CTAACAATAT GTTAGGTACG
TTGAGATAGA GGGCATCAGA CCAACCCTCG AGGACTTTAC CGCCATTGTC ATTTTCTCCG
ATATGCGTCG TACAGGGACA TTCACATCGA GTCATGACAT GGTCAATAGA TTGCACGAGA
ATGTTGTATG GGGAATGATG TCCAACTTCG TCTCTGGTAA CTCGGATTGT CATTCGTTGA
GATTTGATCG CTGATGGATT TGCAGTCCCG ACTGATTGTC CGCAGAGGGA CGAACGATTA
GGATGGACGG GGGATATTCA GGTATTCGCA CCGACTGCAA ATTACCTCTT TGACACTTCA
GGTAGGTTTC ATTTTTGCCT TATCTCTTAT CCCACATTTA TGAAAATAAC TAGATCCCAC
TAGGGTTCCT TGAAGGTTGG CTCCAAGACG TGGCTGCCGA ACAGATTGAA TGGAAAGGCG
TGCCGCCTAC CGTCGTACCC TATGTTCCTC CCAACAAATT CAACGACCAA TACCCCAAAC
CCCAATCCAT CTGGGCTGAT GTGGTAGCTA TCGCCCCTTG GGATTTGTAC AACACCTTTG
GTGATGAAAG GATTATGGAG AAGCAATGGG GTAGCATGCG CATGTGGCTG GATGAGGGTG
TGCCGAGAGG CAAGGATGGG CTTTGGTCAG AGATAGCCCC TCAGTATGGT GACTGGTTGG
ACCCGAATGC TCCTCGCAAG TGCTACTTCT TTGGGGATAT GATCAACCAA GCTGACTTTT
GTGATCAGCT CAATATCCTG CGCATGGGCG TACAGATACA CACTTTGTGG CCAATGCCTA
CCTTGTCCAC GTCACGTCCC TCGTTGCGAA AATCGGTAAA CTGCTGAAGA AGGATCCCGA
GGTAGTGAAG AAGTACGAAG ATGATGCCAC CCGATTGCAT AAGCTTTTCC TTGAAGAATA
TACAACATCC ACGGGGCGAG TCGTTTCGGA TACCCAGACA GCTCTTGCTC TTGTTCTCAA
GTTCAATTTG CTCAAAGCAG AACAGATTCC GCGAGCCCGG GAGAGGCTTG AGTTTCTGAC
AAGGTGGGCT TACTTCAAGG TATCAACGGG CTTTGCGGGG ACGCCCATTT TGTTACCTGT
CTTAGCCGAT AATGGGCTAG AGCATATTGC GTACAGAATG TTGCAAGAAA AAGATAATCC
TTCGTGGCTG TACTCTGTGG GTATGGGTGC AACTACTATT GTAAGCATCT TTTTTGTCGC
AACACAGTCA AATTGCTAAT GCGTCAGTAG TGGGAGAGAT GGGATTCGAT GCTCCCCAAC
GGTCGAATCA ATCCTGGTCA AATGACTTCG TTCAACCACT ACGCCCTTGG CGCTGTCGCT
AAATTCATGC ATACCTACAT TGGTGGTCTC TCCCCTTCTT CTCCAGGTTG GAAGTCTGCC
CTCATCAAGC CCTTGCCCGG CGGCACGATC ACCTCTGCTC AAACATCCTT CGACTCGCCT
TATGGACCTT ACGTGTGTAA GTGGAAGATT GAGGGGGATA CAATGTTGGT TGATACGGAA
GTACCGCCCA ACGGAAGCGC GAGGGTTGTT TTGAACGGGA TTGATGAGGT TATTGGGAGC
GGGAAAAAGA GGTTCAAGGT GCCGTATGAA AAAGACAAGA GATGGCCACC CAAGGGTATC
CGAGGGCCGC AAAGTGTGTT CATGCCTGAT GAGTTTGTGC CCTAGACGGA ATTTTCAGTT
TGCAATTGTG TTCTGGACAA TGAAAGGTTG TTAGTGATTA CAGTTCTTAT ACACTATACA
GATAGAATTC TAATCGTCGT CTTTCTTACA CAGCCAGATA CTTGCCAGAC TTGTTTACGT
ACGCGTAGCC TCTGTTTCGT CGACAATAAT GATGAAACCG TAGCCAAACT AATCATCCAC
CGACGTATCA GGTTCCTGCG AGCCAATTAA TA
 
Protein sequence
MSVTIVSLQA ERHESGFGIA HPTPRLTWRF GSTTLKDWKQ ASYELIITHP GNHQAEHYIV 
KSEQSVLVSW PSKSIQSREI VEVKVRSTGT DGSTTNWAGI TLEAALLDRG EWKAKFISGP
PQEVDAPKPP FRLRKTFVLK SAPIQARLYA SALGVYECEI NGKRVGDQIL APGWTSYKYH
LRYQIYDITS LLQQGENTIT AYVGEGWYAT RLGRPGKRNN WGSRLGFLGQ LETDGEAEVV
TDETWECVDG PIKNSEIYNG EVYDLTYDES KAKISPVEVL SFPEAQLIAS DAPPIRRVKE
VKAVELITTP SGKSILDFGQ NLVGFLRIET DLKGKELLLR HAEVLEDGEL GTRPLRTAEP
NDKIILGGKT KGWEPKFTFH GFRYVEIEGI RPTLEDFTAI VIFSDMRRTG TFTSSHDMVN
RLHENVVWGM MSNFVSVPTD CPQRDERLGW TGDIQVFAPT ANYLFDTSGF LEGWLQDVAA
EQIEWKGVPP TVVPYVPPNK FNDQYPKPQS IWADVVAIAP WDLYNTFGDE RIMEKQWGSM
RMWLDEGVPR GKDGLWSEIA PQYAQYPAHG RTDTHFVANA YLVHVTSLVA KIGKLLKKDP
EVVKKYEDDA TRLHKLFLEE YTTSTGRVVS DTQTALALVL KFNLLKAEQI PRARERLEFL
TRWAYFKVST GFAGTPILLP VLADNGLEHI AYRMLQEKDN PSWLYSVGMG ATTIWERWDS
MLPNGRINPG QMTSFNHYAL GAVAKFMHTY IGGLSPSSPG WKSALIKPLP GGTITSAQTS
FDSPYGPYVC KWKIEGDTML VDTEVPPNGS ARVVLNGIDE VIGSGKKRFK VPYEKDKRWP
PKGIRGPQSV FMPDEFVP