Gene CNA07770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA07770 
Symbol 
ID3253596 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp2143858 
End bp2147495 
Gene Length3638 bp 
Protein Length847 aa 
Translation table 
GC content51% 
IMG OID638253100 
Productconserved hypothetical protein 
Protein accessionXP_567124 
Protein GI58259423 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.622601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCCATCCTT CACGCCGTCT CCTCATTCCC CTCCACCTCC CGCCCGACAC TTCCACACCG 
AACGCCGATT CTCTACAGAC AGTCCTTTAA CACCCGACTA GTTGCACTGC GTGCACGAGC
CTCTGCCTTT GGCCCTTTAC CATCAAATTC CTCAAGGACC CCTGTAACTT GCTATAAACA
GGTGAGTCCT CTTTTCTGTT TGCCTCTTTT TATTCTTCCT CATTCTTTCT TCTTCCTTCC
TTCTCCATCC TCTTTCGCGC CGCCCCCGTC TCCTGTGCCA TCATCGCGCC TTTCTCCCAA
CAGCGCAGAA TTCTTCCTTC CTCTTGGCAT CATCCACTTG GCACTATCCA CTATTACGAC
CCATCATTAC GAAAAAGTTT CGAATTGTTC CCCTTATCCA GATCGCTCAC GCCTTTAACG
TCTGATAGTT ACGACCCACA TTCATTCTCT ATCACGATTC GTTCAAGCGG AAATCACAAA
TCAACTTCTT AACCCTATAA ACGAGGCTGG TACTCGTCTG TCTATCATCG ATCATGTCTG
CCTTTACCTA TACCGCCGCT CTCTTTTCCC TTATCTCCCT CCTCCCTTCC GCGCTCGCCG
GTCCGACTGC TGGAACTACC TACTCCGTTT CTCCTAACCA ACACCCTTCC ATGTGTCTCG
CTCCTGCCAA TAATTGGGAA GGAGCAGATG TTGTGCTCAA GGACTGTGAC GAGGATGACA
CCACCTGGTT GTGGACCGGC CAATCGTTTC AGAACACGGC GACCGACCTC TGCATTGATA
TCCGAGACTA CGGGGCGTGG TCAGGAAACA AGGCTCAGGT TTGGGGCTGC TTCTCTTACA
ACACCAACCA GCAGTTCACT GTTGAGGAAT CTATGATTCA TTGGGACAAC TTTTGCTGGG
ATTTGACAGA TGGAAGCTCT TCGGCTGGTA CGATGCTTCA GATCTGGAGT TGTTACAGCT
ACAATGACAA TCAGCAATGG ACGCTTACTG AGATAGAAGA GGTGGATGAG TGCGATGCCA
GTAAGTTATT CCCTCTTGAC ATTCAAGCGT TTTATCAAGT CTAATGGCCG TCGCAGCATC
GGTCACTGAA ACTGCCACCA TCATGTCGAC TGCTACTGCT TCCGTCTCTG ATCTTTCTAC
CGCCACTGCT TCTGTGTCTG CATCAAACAT CACCGAAGCT GTCACCGCCA CCGAATCGCT
CACCGCGTCA GTCAACGCTA CGGACCCTTT CTTCACCGCG TCAGCCACCG ACTCGGGTTA
TCAGGTCAAC GCCACTGCCT CTGCGACTTA CTCTAGCTAC GACGTTAATG CTACCGCCTC
TGCCACTGAC TCTGGCTACG AGTCCATCAA TGTTACTGCT TCTGCTACCG AATCTGGCTA
CGAGTTTGTC AACGCGACGG CCTCTGCTAC TCTCTCCGCC GAGACTTCTA CAGCCACCAA
TAGCAGCATC GGTGAAGGTC TTTGGTCTCC TCACAAATCT TCTTCCGTTT CCTCCGATGA
CTGGTCATCC GAGACTGCTA CCCGTTCCAA CACCGAGTGG TGGGCTACTT CCACTGGTTC
CGACTCTTGG GCGTCTGCCA CCGCCTCTGC TTCCAACCCT GGGCAGAATG CTTCCCAGTC
TGACTCCTGG AACGCCACCA GCACAGCGTC CAACCCCTGG GAGACTGCTT CTTCTCAGGC
CTCGAATGAG ACCTCCACCG ACTCTTGGGG TGCCTCTGCT ACTGCCACTG CTTCCCAGTC
TGACTCCTGG GACGCCACCA GCACAGCGTC CAACCCCTGG GAGACTGCTT CTTCTTCTCA
AGCCTGGAAT GAGACCTCCA CCGACTCTTG GGGTGCCTCT GCTACTGCCA CTGCTACCGA
ATCTGGCTCT TACGGGAATG CCACTTCAAC TTCCACGTCT TCCGCCATCA CTGCCACCGC
TACTGTTGGC ACCATCTCTT CTGGCTACCT CCAGACTAGC GGCACCAAAA TTGTCGACTC
TGACGGCAAC GAGGTGATCC TCCGCGGTAC CAACATTGGT GGCTGGCTCG TCCTCGAAGA
CTGGATGTGT GGTATTACTG ACACATCTGG ATCTTCCGAC CGATTCTCTC TTAGTACTCT
CGAGAATCGG TTTGGTACTG ACCAGGCCAG GACTCTTGTT GAGGCTTGGG CTGAGAACTG
GTTGACTACT TCTGACTTTG ATGAGCTTGC CGCCATTGGT TTCAACGTCA TCCGTCTTCC
CTTCTCTTTC CGAACTGTCC AGAACGCCGA TGGCTCCTGG AGAGACGACG CCTTCACCCG
TATGGACTGG GCAATCAGTC AGGCCAAGGC TCGTGGTATC TACACCATTG TCGACTTCCA
CATGTGGTCC GGCCAGGAGG CTGACTACTC TGCCATCTCT GAAAACACCG ATGAAGGACA
GAGCCAGCGA GATGCTGCTG GCGAAATCTG GAAGAAGGTT GCTACTCATT ATCTCGGCGA
GAGCAGCATC TGTGCTTTTG ATGTTATCAA TGAACCTACT GGTTCTTACG GCGATTATCT
CCAGCAGGAT CTTTACAATG CTGTAAGGTC TGTTGATGCT AACCGTATCA TCATCGTGAG
TGCTTCATTG ACTAGATACG AGTATTCACT AACATTGTCC CAGCATGAAT CAATCTCTAC
CGACCCCTCT ACCTACGGCT GGACCAATGT CATCTACTCT CTTCATGAGT ACGACATGAT
GGGCTCTGAC CTCTCGTCCA ACCAGGCCAC CTGGACTAAT GGTGTTCAAG CTTACATTGA
CTTGTGGCAC GGCTATAACA TCCCCTTCAT GCTCGCCGAG TTCATGGCCG ACGGGTAAGT
TGAGAAACAA TAATTAGACA CATGGACGTT GCTGACATCA TAACAGTGAA ACCCTTGACT
ACATGCTAAA CTCTATGAAC TCTCAAGGCA TTTCTTGGCT CACTTGGGCT CACTCTACCG
TCAACATGGG GCGATGGGGT ATTTGGAACC ACGAGGCTTT CAACGTTGAT GTTTCTTCTG
ACTCTTACGA CACCATCTAT AGCGCCTGGA CCAACATGCC CAGCACTTTC CACACCAGTA
TTTACGACCA GATGAAAACT GCCGCTACTG GCTCTACCAA CGTCAGCAGC AGGAAGCGAG
ATCTCGCCTC TGCTGCGAGG GCTACCAAGC GCTTCCATGG TAGCCATGGT GGTAGGTCAA
GAAGAAATGG TATGGCCCAC GCTGTTAGGG GTGCCGCTGG TGTCTCAATA TAGGCGAGAG
GGAGTCGCTT CTTATTTTCA TCCATCTTTG AGAGAGCATT TTCTTCATTC ACGACATGAT
CTGTTTTATT ATCTAGGGTT TATAGAAACG CATTGTTTAG CATTTTTTGT TTGTTACTTA
GGTTTTCATA CCTTTCTCCC ATTCGCTCCA CTCATAACGC TTGTTTTGCA CTGCGTGCAA
AGCAAAGCGG TATCGAGGAG GAGATGGGAT CGTGTAGAAC ACATTGGATG GACCTCGGGG
GTATCTTTTT TATAAATACT TCGATTACTC TTAATACGTA CGGGCTACGG CGACAGAGGT
CTCTTTGATA GGACTGTTGG GCCCTGCTGC GCAGGTGCTA CGTATTTGTA AGACGTAGAC
GTCCATAGAT CTGGATCTAT GCACCACCCT CTCCTGTG
 
Protein sequence
MSAFTYTAAL FSLISLLPSA LAGPTAGTTY SVSPNQHPSM CLAPANNWEG ADVVLKDCDE 
DDTTWLWTGQ SFQNTATDLC IDIRDYGAWS GNKAQVWGCF SYNTNQQFTV EESMIHWDNF
CWDLTDGSSS AGTMLQIWSC YSYNDNQQWT LTEIEEVDEC DATSVTETAT IMSTATASVS
DLSTATASVS ASNITEAVTA TESLTASVNA TDPFFTASAT DSGYQVNATA SATYSSYDVN
ATASATDSGY ESINVTASAT ESGYEFVNAT ASATLSAETS TATNSSIGEG LWSPHKSSSV
SSDDWSSETA TRSNTEWWAT STGSDSWASA TASASNPGQN ASQSDSWNAT STASNPWETA
SSQASNETST DSWGASATAT ASQSDSWDAT STASNPWETA SSSQAWNETS TDSWGASATA
TATESGSYGN ATSTSTSSAI TATATVGTIS SGYLQTSGTK IVDSDGNEVI LRGTNIGGWL
VLEDWMCGIT DTSGSSDRFS LSTLENRFGT DQARTLVEAW AENWLTTSDF DELAAIGFNV
IRLPFSFRTV QNADGSWRDD AFTRMDWAIS QAKARGIYTI VDFHMWSGQE ADYSAISENT
DEGQSQRDAA GEIWKKVATH YLGESSICAF DVINEPTGSY GDYLQQDLYN AVRSVDANRI
IIHESISTDP STYGWTNVIY SLHEYDMMGS DLSSNQATWT NGVQAYIDLW HGYNIPFMLA
EFMADGETLD YMLNSMNSQG ISWLTWAHST VNMGRWGIWN HEAFNVDVSS DSYDTIYSAW
TNMPSTFHTS IYDQMKTAAT GSTNVSSRKR DLASAARATK RFHGSHGGRS RRNGMAHAVR
GAAGVSI