Gene PICST_30617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30617 
SymbolZIC3 
ID4838057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp519625 
End bp520875 
Gene Length1251 bp 
Protein Length416 aa 
Translation table12 
GC content45% 
IMG OID640389372 
Productzf-C2H2 Zinc finger, C2H2 type 
Protein accessionXP_001383735 
Protein GI150864766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.995284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTCT CTGAAAAAGA AGAAGAACTC CCAACAGAAG ACAAACCAGG ATTCCCATGT 
AAATGGCAAG ACTGTGAAGT AGACATCTTT CCCCTGTTGA CGGGATTAGT GGACCATCTC
AACACAAATC ATTTGGCCCA TATGGCACAC TTGACACCAA CGACTCCTAT CAGATACACC
TGCCAATGGC AAGGCTGTCC TCGTTTTGGC ATTGAACAAC CTTCACGTTT CGCTCTCATT
TCCCACTGTA GAACCCACAC CGGTGAGAAA CCGTATTTTT GTCCTATTCC CGAATGCGAG
AAACACTTTA CAAGGTCAGA TGCTCTTGCC AAGCACGTCA AGGGAGTTCA TGATTTGCAC
ACTATAAAAG ACGCTGTGAA CTCTATCAGA GACAAGTATG CCAAAGGAAC TTTGTCAGCT
GTAGCTTATG ACTGGCTCGA ACTAGATGAA TTCAATGAAG ATATGTATCT CCGCTTAGTT
GAAGAGGATT ATGAGTACAA GAACCCATGG TGGTATTCTC AGAAGTTCTT GGACGTCTTG
AAGGAAGGAG GAGTTCATCT GGTAGGGTAT GAAGATGAAG ATGAAGAAGA AGAAGAAGAA
GACGAAGACG ACGACGACGA CGATGACGAC GAGGAAGAAG ATGGAGATGA TGACGAAGAC
GAGTATGATG GTTCGGTTCA CAAACACAGT AAGAAAACTC TCACTGCTGG AGGCAGCATA
ACTGCCGATG TTTTCTTCAA CTTGCCTTAC AATTTCACCC AGCACAAGAT AGCCGCCATA
CGGTACCAAA ACTATTTTGC AGAAGATGAT GGCGAAGATT TACTTACGGG TGAAGACGAT
AACAACAAGA CAATTAATCT CGTCAAGAGA CAACAACACC ACGACAGCCC AAGTAGAAAA
CACAAGGACG TCAATTCCAA TTCTTTACAC AAATTGCTCA AACTGAAAGC ACGAGTGTTG
AAGACAGGTT ACCCAGCTAT AGAGACCCCA GATGTAGAAG ACATCGATGA TCTTGCCGAG
TTGAAGTCGC TCCATGCCAA GTTGACTAGC CAGCTCAACA CTGCCTCTAA GATTAACAAG
GTCGTGGGAA AGCAGCTTTC TGTCTCCATA AAGCAGAAGC GTAAGCTCTG GTTGATCAAC
CAGTTGTTGA TAGATGGCAA CTTAGAAGCT GGGCTTCCAC TCAAAAGGCT GACTGAGCCC
CAGCGCGTTG CCAGAGACGC TGTAGATAGA GAGTTATTGA GAACCACCTG A
 
Protein sequence
MSLSEKEEEL PTEDKPGFPC KWQDCEVDIF PSLTGLVDHL NTNHLAHMAH LTPTTPIRYT 
CQWQGCPRFG IEQPSRFALI SHCRTHTGEK PYFCPIPECE KHFTRSDALA KHVKGVHDLH
TIKDAVNSIR DKYAKGTLSA VAYDWLELDE FNEDMYLRLV EEDYEYKNPW WYSQKFLDVL
KEGGVHSVGY EDEDEEEEEE DEDDDDDDDD EEEDGDDDED EYDGSVHKHS KKTLTAGGSI
TADVFFNLPY NFTQHKIAAI RYQNYFAEDD GEDLLTGEDD NNKTINLVKR QQHHDSPSRK
HKDVNSNSLH KLLKSKARVL KTGYPAIETP DVEDIDDLAE LKSLHAKLTS QLNTASKINK
VVGKQLSVSI KQKRKLWLIN QLLIDGNLEA GLPLKRSTEP QRVARDAVDR ELLRTT