Gene PICST_80185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80185 
SymbolCAD2 
ID4851458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1859869 
End bp1861161 
Gene Length1293 bp 
Protein Length331 aa 
Translation table 
GC content43% 
IMG OID640393166 
ProductCAD family protein 
Protein accessionXP_001387596 
Protein GI126274586 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.101414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGTTTGGTAT CAATTGATAT TCTCTTGTCT GGCTTTTTCG AATTCTAATT GACACCATTC 
TAACTTCGAT TTCGCAATTA TAATCCGGAG TCGTTTCCCT AATTGGAATT TTTCAAGTAC
TTATAATCGC TGATTTCCAG TTTTTCAGAA TCCCCATTCG GAGTATTTCG CTTTTCATTG
AACTATTCTG CTTTCTGGAT ATTCAATTAA CCTCAATTGC TACTCAATAC ATATTACGTC
ATGTCTGAAG TCTACTTAGT CACCGGAGGT ACTGGTTACG TTGCCGGATT TGTACTTCTC
CAGTTGTTGG AACAGGGTGC AAAGGTCAAA ACCTCGATCA GAAGTTTGGC CAAAGAAGCC
CAATTGAGAG AGTCTCTCTA CTCCTCAAGT GACAAGCTCA CGAAGGAAAT CGTGGATGCC
AACTTGAAGG TCTATCAAGC TGATTTAACC TCTGACGCTA ATTGGCCAGA GATCTTTGAA
GACGTCACCT ACGTACTCCA TGTAGCATCT CCATTTCCCT CTTCTCCACC AAAAGATCCT
AACGATTTGA TTATTCCTGC TAGAGAAGGT ACCTTGAGAA TCCTCGGCTA TGCTGCTGAA
ACCAACACTG TAAAGCACGT AGTCGTGACT TCGTCTTTCG CAGCCATTGG CTTTGGTCAT
GCTGAAGTCA AGCCACTCTA CACTGAAAAG GACTGGACTG AAACGGAAAA CTTGGACCGT
CCTTACACCG TCTCCAAGAC ATTGGCTGAA AAGGCTGCTT GGGAATACGT TGAAGCTAAA
CCAGTCCAAT ATGGCTTGAC TGTGATCAAC CCAGTCTTGG TTATCGGACC TTCTTTGAAG
AAGCAAGTTA CCAACTCTAC CTCCTTGAAC ATCATCCAGG GCTTGATCGA TGGCTCGAAG
AAAAATGGTG TAGATCCATC TTCTGTCCAC CTTGTTGACG TTAGAGATGT TGCTAAGTTG
CACATCTTGG CTTTGACCAC AGAAGAAGCT CTTGGTGAGA GATTCTTGGC TGCTACTGGT
AGCACCCTTA CGTGGGTAGA TGCAGCTAAC ATCTTGAGAT CTAGAATCCC AGAGAAGTAT
GTAGCTAACT TGCCTACAAA GGAAACTGGC CCTAGTGAAA CTCCTAAGTT GATTTCTGTT
GAAAAGGCCA AGAAGACCTT CAACTGGACC CCAATCTCTG ACGAAGAGTC CTTGGTTGCC
ACTGTCGAAG GCATTATCCA AGAAGGAAAG GTCTAGTAGG ACGTTTATAA CAATTTAAAT
ATGCATATAA TGAACAATGT ATGTCATATA TTT
 
Protein sequence
MSEVYLVTGG TGYVAGFVLL QLLEQGAKVK TSIRSLAKEA QLRESLYSSS DKLTKEIVDA 
NLKVYQADLT SDANWPEIFE DVTYVLHVAS PFPSSPPKDP NDLIIPAREG TLRILGYAAE
TNTVKHVVVT SSFAAIGFGH AEVKPLYTEK DWTETENLDR PYTVSKTLAE KAAWEYVEAK
PVQYGLTVIN PVLVIGPSLK KQVTNSTSLN IIQGLIDGSK KNGVDPSSVH LVDVRDVAKL
HILALTTEEA LGERFLAATG STLTWVDAAN ILRSRIPEKY VANLPTKETG PSETPKLISV
EKAKKTFNWT PISDEESLVA TVEGIIQEGK V