Gene PICST_67866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67866 
SymbolAZF2 
ID4839292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp950442 
End bp951870 
Gene Length1429 bp 
Protein Length465 aa 
Translation table12 
GC content46% 
IMG OID640390607 
Productasparagine-rich zinc finger protein 
Protein accessionXP_001385190 
Protein GI126137333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.12306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTA AATTGAATAT GATTAACAAC CTCAAGAAAC GGGCCGAGAT CCGTGCAGAG 
ATCCGATCCA GTATAAAGCC TGCCGACAGC TTGGAAGAAG GAGAAGTTTC CAATTCTGAA
AACCACGGCA AAAATGATGA ATTAGATCTT GATGAAGGTG CCAGCTTGTC TCAGCTAGCC
GAGGAAGGTG CAGAGGAAGA TCTAAAGAAC AAAATCAAGC GTGAGCAGAT TGAACAACAG
TTGCTAGGGG TGCACACAGA CGAAGAGGGA GACAGAAGTG GTTCCGAACT AGAAATATAC
TCTGATGCTG GTGATGACGA TTTCATCCCT GAAGTCAATG GGTCAGGCGA CAGAGTCCAC
GGTGGGCAAC ATGAACGAGA TGTTCAGCAC GAATTAAGTG CTCAACAGAT GCGTCAGGTT
GAAGAATTCA GTCGGGCAGT AGATGGGCAA GTTCACACTA TAGACCCTGA TCTCATGCCT
GAGTCTACTT CCAAAAAGAG AAATTTGGAT ATTACCTTGC CAGGGACTGC TGGTCGATTG
GCAGGATCTG TAAATCCATC CAGCGAAGCT GCTGTTGCTG TAGATGCCGT GGCTTCAGCT
GTAGCCTCCG CTGTAGCAGG TGATATAGGC GCTACCATGT CACATGAAGG AAAGGTCAAG
CGCAGACAGA CTACTGCTGT TCGGGAGGAC GAGAAAGTTT GTCCCTATTG TAAGCAGGAG
TTTGACTCGG CAGTAGATTG TAGGAACCAT CGTCGCACTC ATCCCAAGCC CAAGGTTTAT
AAGTGTGGAT TGTGTGACAA GACGTTTAGT CAGATTCCGA ACTTGAGTTA CCACCGAACG
ATCGTCCACA AAGACTTGAG AGTAGTCAAT GGAATTGATA CTACTAGTGT AAATGCGGCA
ACTGGTTCTA GCAACCCTAC TGTTGCAAAC TTAGCTGCTG TAGCTAGTTC CGCAGTGGCG
GCCAGTGTAC CACTTGTGGA TCTTCAGAAT GTGCGAGTTT TCCATTGTGA CGAAGTTGAT
TGCACTTTTA CATATTTGAC ATACCAGGCT CTATTGGCAC ATAAAGAAAA TGACCATAGT
GGAGTTAATG TTAAGCGACC ATATCGTGTT TCGAAGGCTA CAAAAAAACA TGCGTGTACG
TTTGACGGCT GTAACAAGGT GTTCGCAAAG TTTTCTGATT TGACCAGACA CTCACGAGTT
CATTCAGGCG AAAGGCCGTT TGAGTGTACT CATTGCGGAG CTACTTTCAA CCAGAAGTAC
CGCTTGACCA CACATTTACG TTCACATACT GGCGAAAAGC CGTTCTCCTG CAAGTACTGT
GGAAAGACAT TTGCTCGAGG TGATGCTGTG CAATCTCATA TCTTTGCTAT ACATAGAGCC
AAAGGCTCAG CTTTTTAGAG ATATATAAGA TGAATTGAGA ACTACGGGT
 
Protein sequence
MASKLNMINN LKKRAEIRAE IRSSIKPADS LEEGEVSNSE NHGKNDELDL DEGASLSQLA 
EEGAEEDLKN KIKREQIEQQ LLGVHTDEEG DRSGSELEIY SDAGDDDFIP EVNGSGDRVH
GGQHERDVQH ELSAQQMRQV EEFSRAVDGQ VHTIDPDLMP ESTSKKRNLD ITLPGTAGRL
AGSVNPSSEA AVAVDAVASA VASAVAGDIG ATMSHEGKVK RRQTTAVRED EKVCPYCKQE
FDSAVDCRNH RRTHPKPKVY KCGLCDKTFS QIPNLSYHRT IVHKDLRVVN GIDTTSVNAA
TGSSNPTVAN LAAVASSAVA ASVPLVDLQN VRVFHCDEVD CTFTYLTYQA LLAHKENDHS
GVNVKRPYRV SKATKKHACT FDGCNKVFAK FSDLTRHSRV HSGERPFECT HCGATFNQKY
RLTTHLRSHT GEKPFSCKYC GKTFARGDAV QSHIFAIHRA KGSAF