Gene CNL04780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04780 
Symbol 
ID3254901 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp340424 
End bp341861 
Gene Length1438 bp 
Protein Length236 aa 
Translation table 
GC content50% 
IMG OID638253949 
Productcarbonic anhydrase protein, putative 
Protein accessionXP_568020 
Protein GI58261220 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0288] Carbonic anhydrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.360959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCATATATC CACAATACCC CGCTCCGCGA GCCTTCGTTG CCCGCTATCC ACCGCGAAAA 
TACCCACATC ACACGTTAGA GCCATCTCTC CCATTACACA GTCGCTATAT AGAACTCGAC
TGGCCTCCCA CAGAGCCCAC TTTTCATTCA GCACACTTCC CAGACCCACA TTCGGTCTCC
TTCAGCTTCC CACCATTTGT TCCTCCAGAT CTATAAACAC CACCAGCATC GCCAACATGC
CTTTCCACGC CGAACCCCTC AAGCCCTCCG AGGAGATTGA CATGGATCTT GGGCACTCTG
TGGCTGCCCA GAAGTTCAAG GAGATTAGGG AAGTCCTCGA AGGCAACAGG TACTGGGCCA
GAAAAGTCAC TTCTGAGGAG CCCGAGTTCA TGGCCGAGCA GGTCATGGGT CAGGTGAGCA
GCTCCTGTCT CTGGAACGTT CGAAGCACAT CGTAAAAGCT AATTCATGAT AATGCAGGCG
CCCAACTTTC TTTGGATCGG ATGCGCCGAC TCTCGAGTTC CAGAGGTTAC AATCATGGCT
CGTAAACCCG GAGACGTGTT TGTCCAGGTA TGTTTCTACC CTCCCGACTT TCGGTCATCG
TTGCCTGTGC ACAAGATGCG CTGCGAAATG AAGGGGCTAA CAAAAGCTTG GTTGCCTGTA
GAGGAACGTT GCCAACCAGT TCAAGCCCGA GGACGACTCT TCCCAGGCTC TTCTCAACTA
CGCCATCATG AACGTTGGTG TCACTCACGG TAAGAATCTT CCGCCCCTAT GGTATTGCCA
AGTCTCTATA GACTGACTTG GGACGCCATT AGTCATGGTT GTTGGTCACA CCGGTTGCGG
TGGCTGTATT GCTGCGTTTG ACCAGCCTAT CCCGACTGTG GAAAACCCCG GCGCGACTCC
TTTGGTGCGA TATCTCGAAC CCATCATCAG GCTGAAGCAT TCTTTACCCG AGGGAAGCGA
TGTGAACGAC TTGATCAAGG AGAACGTCAA GATGGCCGTA AAGAACGTTG TTAACAGCCC
TGTAAGCTTG CCCCTCATTG ATTAGGGCTA TGGATCACTC ATAGCAAATG TAGACTATTC
AGGAAGCTTG GGAAAAGGCC AGGAAGGGCG AGTTCCGGGA AGTTTTTGTC CACGGCTGGG
TGAGTTGCAT TCTGTGACTA TGGCATGGCC CATTGACTGA TGCCCATATA GCTCTATGAC
CTTTCTACCG GCAACATTGT TGACCTCAAC GTCACCCAGG GTCCTCATCC TTTCGTTGAC
GACCGAGTGC CTCGAACGTA GATAGGAGTA GTTGTAGAAG TAGAGGATCA TACTGATTAT
TATGGGTGTA TCAATAGATA GGAATTACAG GCTACAGCAT AGATGAGATG GAGGTTTCTT
TCGTTATCAT TTATCAGTAT GTGAATAGAA TAGAATGCAT ACAACGCATA TATTGCTT
 
Protein sequence
MPFHAEPLKP SEEIDMDLGH SVAAQKFKEI REVLEGNRYW ARKVTSEEPE FMAEQVMGQA 
PNFLWIGCAD SRVPEVTIMA RKPGDVFVQR NVANQFKPED DSSQALLNYA IMNVGVTHVM
VVGHTGCGGC IAAFDQPIPT VENPGATPLV RYLEPIIRLK HSLPEGSDVN DLIKENVKMA
VKNVVNSPEA WEKARKGEFR EVFVHGWLYD LSTGNIVDLN VTQGPHPFVD DRVPRT