Gene CNC01170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01170 
Symbol 
ID3256420 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp326794 
End bp327965 
Gene Length1172 bp 
Protein Length287 aa 
Translation table 
GC content46% 
IMG OID638255335 
Productconserved hypothetical protein 
Protein accessionXP_569963 
Protein GI58265614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01458] HAD-superfamily subfamily IIA hydrolase, TIGR01458
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTAGTATCTA CTGTCTCAAT CATTAGCACG ATCAGCTAGT GAACATCGGC TCATTCTTTG 
TCGTCGGGCC AGCTCCATTC ATTGTCTGCA TCCCGCATGC CCTAGTAAGC TCGCCTACAC
TAGCCAATAC CAAGCCAGCA CCATTCTGCA TTCCACTGAC ACTAAATCTC TTGGCAGCTC
ACAGAAACTC AAAGCTCTTC TCATTGACCT CAATGGGACT CTTCATATCG GCTCGGAGAG
CACGCCTTCG GCGGTCAAAG CCATTGAGCG ATTACGATCC GTCCGTATAC CCTTCATATT
CTGTTCCAAT TCAACCAAAG AATCTTCAGC AAGCTTGTTA GACAAACTGA AGAAGATTGG
GTTTGATGTT AAGAAGGAAG AATTGATGAC AAGCCTGAGT GCATGTCGAA TGATTGTTGA
GGAGAAAGGG TTAAAGTTAG TTCACCTTTT TCAATCCTCG TTGCCCGTGA TAAGAATCTA
ACAGTGTTCT AGACATCCTT TGCTCTTAAT GTCACCATCA GCCAAAGAAG AGTTTTCAAC
CCTTCCGCCT TCACAAGGTA TAAATCATGA TGCTGTCATT CTTGGGCTCC ATCCCGACTC
CCTTTCATAC GAGCACCTTA ACAAAGCCTT TCGCGTCTTA AAAGGCGAGC CACTTTCTTC
TCAAGAGAAG AACAATTCGA CAGGAGAACA CAGACCAGTC TTGATAGCCC CCCACGCCTC
CATGTTTATG CAAGATCCGG GGTCATCGTC ATTTCCACCT GGCTTATCGC TTGGTATAGG
TCCATTTGTA AGGGCCTTGG AGGAGGCAGC CAGCGTGAAG GCTGAGATCG TAGGAAAGCC
AACCAAAGGC TTCTTTGAGT TGGCTTTGGA GAAGTTGAAA GAGTTGAGTG GAGAGGATTT
TGAACGGAAT GAAGTAGCGG TAGTAGGTGA CGACGTGGAT AATGATCTCG GAGAAGGCGC
CAGAGAGCTT GGACTAAAAA GGATACTAGG TGAGCATGTA GCTTGAGGGA CGATTCCAGA
CATATGAAGT GTCTGATAAA AAGAGATCAA TTGCTGATGA ATTGTTTACA GTGAAGACAG
GAAAATATCG GACTGATGTA GAGAAGAAAA TCGAGCATCC TCCCGATACG GTGTATGATA
CGTTTGCGGC GTTCGTGGAC GACCTGTTAT GA
 
Protein sequence
MPYSQKLKAL LIDLNGTLHI GSESTPSAVK AIERLRSVRI PFIFCSNSTK ESSASLLDKL 
KKIGFDVKKE ELMTSLSACR MIVEEKGLKH PLLLMSPSAK EEFSTLPPSQ GINHDAVILG
LHPDSLSYEH LNKAFRVLKG EPLSSQEKNN STGEHRPVLI APHASMFMQD PGSSSFPPGL
SLGIGPFVRA LEEAASVKAE IVGKPTKGFF ELALEKLKEL SGEDFERNEV AVVGDDVDND
LGEGARELGL KRILVKTGKY RTDVEKKIEH PPDTVYDTFA AFVDDLL