Gene CNA03810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA03810 
Symbol 
ID3253422 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1025170 
End bp1028240 
Gene Length3071 bp 
Protein Length682 aa 
Translation table 
GC content45% 
IMG OID638252700 
Product1,4-alpha-glucan branching enzyme, putative 
Protein accessionXP_566719 
Protein GI58258613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCTCATCT CTCAAGTCTT CAGTTAATCT TACATTCCAT AGATTTATTC ACTTCAATCA 
CCATGACAGC TGTTTCGTTA TCAGATGGTG AGCACTCTCC GACGCGAGCG CAGTACCCTC
ATTTGCTCAC TTGAGACTCG TTCTCCTCTG AAAGGCACAG CCGTGTTGAA GACTGATCCT
TGGTAAGTCC CGCCTTTTCC TTTCGCAGAG CTTTGCCAAG TTGTCTAATT GCATACCTAG
GTTGGAACCG TTTTCTGGCG CCCTCCGTGA ACGATATGCC GCTTATCAAA AGCAACGTAC
CATTATTGAA GAGCACGAAG GCGGTCTCGC CGAATTCTCA AAAGGCTATA AATCTATGGG
CTTCCAGATT GATAAAAATG GGGGTGTAAG GTATCGGGAA TGGGCCTCTA ACGCGACAGA
AGCGAGACTC ATTGGCGAAT TCAGTGAGTC TAGCTGGTCT AGCTCCGTAT GGTGTCTGAA
ACCAGGTATC GACAATATAA TTTACTCTAT ATCGCAGACA ACTGGTCCCA TACGGCCAAT
CCTATGACAA AGTCTCCTTT CGGTGTATGG GAATGTTACG TACCTCCAGT TTCACCCGGC
GTCTGCGCCA TTCCCCATGA TTCCATGGTC AAGATATCAA TGACACTCCC AGGAGGTGAA
TCTATTGACA GGATTCCTAC CTGGATTACT CGAGTCACCC AAGATCTTAA CATATCTCCT
ATATATGACG GACGCTTCTG GAACCCGCCA AAGGAGCAAC AGTACCAATT CAAACATGGG
CATTCTACTC GGCCGGTAGA GGGATTGAAA ATTTACGAGG CCCACGGTAT GGCCTACCTT
GCTAGACTCA TTGTAATATT GGCTAACTGG TCATCTAGTG GGGATTTCTA GCCCCAATAT
GAGAGTTACC ACATACAAGG AGTTCGAGGT GGATGTCCTA CCGAAGATAA AACAGCTTGG
CTATAATTGT ATTCAGATGT GAGTTACAGC GTTGATTTTG TTCTTGAATC CAAATAATGA
CTTGGCGCAG GATGGCTATT ATGGAGCACG CATACTACGC CTGTAAGTTC TTTTGTGCCC
GGTATGCTGT AGTTTGGACT GACATGCTTT CCTACAGCAT TCGGCTATCA AGTCACCAAT
TTCTTTGCTG CTTCGTCTCG CTTCGGTATG TCTTTGCCCA CCTCTGCTTC TGATTGAAAG
TGCGCTGATC GTGTTTGCAC AGGTACACCG GAAGAACTGA AATCTCTCGT TGACAAGGCA
CACGAATTGG GTCTTACCGT ACTTCTTGAT GTGGTTCATT CCCATGCTAG TAAAAACATT
CTTGATGGGT AAATCATGCT TTCGAGTGCT GTGCTTACCT ATGGCTGACC TTTTCGGCAT
AGTATCAATA TGTATGATGG TTCTGACCAC CTTTACTTCC ATGAAGGTGG CAGAGGCAGA
CATGATCAAT GGGATTCTCG CCTCTTCAAT TATGGCCAAC ATGAAGTGCT CCGCTTTTTG
CTTTCTAATC TCCGATTCTG GATGGACATA TACATGTTTG ATGGCTTCAG GTTCGATGGT
GTCACCAGTA TGATGTACAA ACATCATGGT ATTGGTTCAG GTTTCTCAGG TGAATTCTTT
CCATTTCTTC ACCTGGTTTT GCTGATTCAC TATGCTAACA ACTTTTGATC CAGGAGGATA
TCATGAATAC TTTGGGGATT CAGTAGACCT TGAGGCCATG GTATACCTCA TGCTGGTGTG
TTTTCATTAC AAATTTGCAA ATCATACGCC GTCCGCTGAC TTTTCTAGGC AAATGCCATG
CTGCACGAGA CTTATCCTCA TGTTGTCACC ATAGCGGAGG ACGTCTCCGG GATGCCCACC
CTTTGCCGTC CAGTTGCAGA GGGTGGTGTT GGATTTGATT ATCGACTTTC CATGGCCATC
CCTGACATGT GGATCAAGCT TCTCAAAGAA TACACCGATG ATCAATGGGA GATGGGCCAG
ATTGTCCACA ACCTCACTAA TCGAAGGCAC TTGGAGAAAA GTGTTGCATA CGCTGAAAGT
CATGATCAGG CTTTGGTTGG AGACAAGACT TTAGCCTTCT GGTTGATGGA TAAGGAGATG
TGTAGGTTAT ATCCCGCCCA ATTGATATTT ATATCCAGGA TAATGCTTAC ATCGACTCAT
TAGATGACTT TATGTCTGAT CTTTCCCCTT TGACTCCCAT TATCGACAGG GGCTTAGCTC
TTCATAAAAT GATAAGGTAA ACCACACTTC TTCGGTTTCC TTGTAACAGG AGCTGACGGC
CTGTCCACGA AGATTCATTG TCCATACACT TGGAGGAGAG GCGTATCTCA ATTTTGAAGG
GAATGAGTTT GGACACCCTG AGTGAGTGCA GACCTGTTTT TCACATGTTC ACCATTTTTG
ACGCTTTTGG AACAGATGGA TGGATTTCCC ACGAGAAGGC AATGGCAACT CCTTTGCCCA
TGCTCGTCGC CAGTTCAACC TTGTGGATGA CAAGTTGTTG CGTTACAAAT ATCTGTATGA
GTTTGATGTC GCTATGAACT GGCTGGAGGA CAAATACAAG TGGCTCAACT CCCCTCAAGT
ACGTTCTTTT CATTCTGAGC TCTGTCCGAT GTCTGAGCTA ACTATCTGCT TCCTGAAAAG
GCTTATGTTT CTCTCAAACA TGAAGGAGAC AAGATGATTG TGTTTGAGAG AGCCGGACTG
CTATTCATTT TCAATTGTAA GCATTTCACG CTTTCGCCTT GGCCATGTGC TAACACGTCA
TAGTCCATCC CACACAATCA TTCACGGACT ATCGAGTTGG TGTAGATACT GCAGGAGAGT
ACAAGGTCAT CTTAACAAGT GATGAGACTA GATTCGGCGG ACACAATCGC ATTGATATGG
GTGGGAGGTA TTTCACGACA CCCATGGAAT GGAATGGGCG GAAGAATTGG CTTCAAGTCT
ATTCGCCTTC GAGGACTGTA CTCGTTCTTG GGCTTTAATT GGATACTAGC CAGAAAAAAA
TGTTACCACG AAGACGCATA ATCATCTGTA TGTTACCGAA GTTTGAAAGC AATGAAATAA
TTCTGTCTCT G
 
Protein sequence
MTAVSLSDGT AVLKTDPWLE PFSGALRERY AAYQKQRTII EEHEGGLAEF SKGYKSMGFQ 
IDKNGGVRYR EWASNATEAR LIGEFNNWSH TANPMTKSPF GVWECYVPPV SPGVCAIPHD
SMVKISMTLP GGESIDRIPT WITRVTQDLN ISPIYDGRFW NPPKEQQYQF KHGHSTRPVE
GLKIYEAHVG ISSPNMRVTT YKEFEVDVLP KIKQLGYNCI QMMAIMEHAY YASFGYQVTN
FFAASSRFGT PEELKSLVDK AHELGLTVLL DVVHSHASKN ILDGINMYDG SDHLYFHEGG
RGRHDQWDSR LFNYGQHEVL RFLLSNLRFW MDIYMFDGFR FDGVTSMMYK HHGIGSGFSG
GYHEYFGDSV DLEAMVYLML ANAMLHETYP HVVTIAEDVS GMPTLCRPVA EGGVGFDYRL
SMAIPDMWIK LLKEYTDDQW EMGQIVHNLT NRRHLEKSVA YAESHDQALV GDKTLAFWLM
DKEMYDFMSD LSPLTPIIDR GLALHKMIRF IVHTLGGEAY LNFEGNEFGH PEWMDFPREG
NGNSFAHARR QFNLVDDKLL RYKYLYEFDV AMNWLEDKYK WLNSPQAYVS LKHEGDKMIV
FERAGLLFIF NFHPTQSFTD YRVGVDTAGE YKVILTSDET RFGGHNRIDM GGRYFTTPME
WNGRKNWLQV YSPSRTVLVL GL