Gene PICST_67940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67940 
SymbolCDB4 
ID4839620 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1482492 
End bp1483911 
Gene Length1420 bp 
Protein Length383 aa 
Translation table12 
GC content44% 
IMG OID640390935 
ProductCurved DNA-binding protein (42 kDa protein) 
Protein accessionXP_001385287 
Protein GI126137527 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0024] Methionine aminopeptidase 
TIGRFAM ID[TIGR00495] 42K curved DNA binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCA AGACTGGTAA GTAGCGAACG GAATATTCCG AGAGAAATGG CTATTTTAGA 
GTGTTGTGAA ATTGGAGAGA AATAGTGAAA CTTGTCCAAT ACCTCACATA CTGAAAATTA
TGTCCAACTT TTCATTACTG TTCATTGATA CTAACATTCA CAGCTCCCGC TGCTACCCCA
GACTACACCA TTGCCAACTC CGACGTTGTA TCCAAATACA AGACTGCCGG AGAAATCACC
AACCGTGTGT TGGCTCAAGT CATTGCTTTG CTTGTTGATG GCGCTACCAC CTACGAAGTC
TCTTCCAAGG GTGATGAGTT ATTGAACGAA GAATTGTCTA AGATCTACAA CTCCAAGAAG
GCTTCCAAGA CTCCAAAGGG CATTGCATTC CCTACCTGTG TGAATCCTAA CCACATCCCA
GCCCACTTGG CTCCTGTGAG TGAAGATGAT GCTGGTAACA TTACCTTGAA AAACGGCGAT
GTAGTTAACG TGATGCTCGG TGTCCAGCTT GATGGGTTTC CATCCATTGT AGCTCAGACT
ATTGTCATTG GAGCTACTAA GGAATCTCCT GCTGAAGGTA ACAAGGCTGA CTTACTCCAC
GCTGCCTGGA CTGCTTCTGA GGCTGCTATC AGAACTTTGA GACCCAAGAA CAAGAACTGG
GACACCACCA ACGTTGTAGC CAAGGTCGCC AAGGAATTCG ACACTACCCC AGTCGAAAGC
ATGTTGTCTC ATAACCAAGA AAGAAACGTG TTGTACGGTC CTAAAGAAAT CATCATCAAC
CCTACCAAGC AGAACAAGAG CCAGATGGAA ACCTTCAAGT TTGAAGAAAA CGAAGTCTAT
GGCTTGGACA TCTTAATCTC TACTTCTAAG GATGGAAAAG TCAAGCCTTC TGACTACAGA
ACCTCCTTGT ACAAGTTGAC AGGTAACAAC TACTCTCTCA AGATGAAGTT GTCGCACAAG
GTTTTGGCCG AATTCAAAGC TAAGTGCAAC AACCAGCCTT TCCCTTTCAA CATCAGAAAC
TTGGACGAAC CTAAGAAGTC TAGAGGTGGT TTGGCTGAAC CTTCAAACCA CAAGGTCATC
TTGCCATACG ATATTGTCAC CGAAAAGGAA GGCGAATATG TTGCCCAGTT CTTTACGACA
GTTGCTATCA CCAAGAACGG TCTTGTCAAG TACACCCAAC CAGAGTTTGA CCCTGAGCTC
TACAAGACCG AGAAGAAGGT CGAGGACGAG GAAATTGTGC AATTGTTGAC TGAGCCTTTG
AGAATCAAGA AGCAGTCTAA GAAGGAAGAA GCTAAGTAGC TCTGTAGCCT TCCAAAATTG
ACATGACATG ATAGTATCAA AACATCGAAT ACCAAGATAT CTCCAGTATC CAACATTCAT
ATATCCAAAA TATCAGAATC CAAAATACGT ATCTAGCCAG
 
Protein sequence
MSTKTAPAAT PDYTIANSDV VSKYKTAGEI TNRVLAQVIA LLVDGATTYE VSSKGDELLN 
EELSKIYNSK KASKTPKGIA FPTCVNPNHI PAHLAPVSED DAGNITLKNG DVVNVMLGVQ
LDGFPSIVAQ TIVIGATKES PAEGNKADLL HAAWTASEAA IRTLRPKNKN WDTTNVVAKV
AKEFDTTPVE SMLSHNQERN VLYGPKEIII NPTKQNKSQM ETFKFEENEV YGLDILISTS
KDGKVKPSDY RTSLYKLTGN NYSLKMKLSH KVLAEFKAKC NNQPFPFNIR NLDEPKKSRG
GLAEPSNHKV ILPYDIVTEK EGEYVAQFFT TVAITKNGLV KYTQPEFDPE LYKTEKKVED
EEIVQLLTEP LRIKKQSKKE EAK