Gene PICST_28111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28111 
Symbol 
ID4850893 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp347837 
End bp351421 
Gene Length3585 bp 
Protein Length924 aa 
Translation table 
GC content41% 
IMG OID640392601 
ProductZn-finger protein 
Protein accessionXP_001387714 
Protein GI126273853 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.689755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCAC CGTCCAAGAA GTCGACACTC AAGGTGTTGG CTGACGGAAA GGTCATGAAG 
GTGCAGAAAA CCCGACAGAG AAAGATTCTC TCGTGCATCT ACTGCCATTC GAAAAAGATC
AAGTGTCTGA GAGTACAACC CGTGTGTAAC AATTGTGAGA AGCTAGGTGT CGAGTGCAAA
TACTTCATCA ACGAAAGAGT AAGTCGAGGA GGTAAAGAAT CGGCCAGACT CTCAGATAAG
GAGAAGGAGT CGCGGGGCTT GAATTCTACT AGAGATAAAA ACTACACCTT AAACAAGAAA
TCCAAGAGCA GAAGCTCTAG TATAGCTGAT GAAAAGGAAA ATAACGATAA CATAAACGAC
AATAACATTC ATAATGGCAA CCTCAATGTC GACCTCAATG GCAACAATAT CGATGGTCGC
AGTTCTTTCT CGTCTGATGA TCAGGACATG GAACTTGACC TAGAGAAGGA CTCCTCCCCG
GAGTCGCACA TGATGGTCTC AGGCACAGAC TCTTCAGCAA CTGTCACCAA TGCCAGTTCT
CTCAGTGATG ACAAAAATGG CAGTGTGGAT GGTAGTGTCA ACGGACTTGG CAAAAGACGT
AGTGTGAACG CCAACTCGTC TCACAATAGT AGCAATAACA ACAGTGGCAA TGGCATCAAT
TCTAATAATG GTGCTAATGC TAGCGTTAAT AGTAATGGTA ATATATCTGG AGCTGAGTTC
AAGACAGCTA GTCCAAATAC GATTATCCCG GATCTTTCGC TCAACTACTT GAACTTAAAC
TTCAATCTTC ACAATGCCAA CCCTAACGAC GATGCTAGCT CAAACGGCAT TCCAAATGTC
ACGAGTAAGA TGCCTATGAT CTCCAATAGC GTTCTCCAAA CCCCGATCAT AAACAGTGGC
TCTAACAACA TCACCAACAA CTACTTCAAC ACCAACTTTC TGAGCCAGGC TCCTAACATA
ACCTCTCTTA CTGGCTCTAC TGCTACTACT ACTGGCCCTG CCACAGGACC AGAGGACCAA
TATTCGTTCA ATTTGAGCTC ATTTACGTTT GCTCATCAAG AAGGCTTGAT TGCTTCCAAC
ATGCCCTCTG TCGTAAATTC GCCCATGACA CAGCCTAATG CTCAGCCATC TTTGTCTCAG
ACCAACTCGA CTACCAACAT CAACTCGTTT TTCACAAGAA ACTTGAATAA CGAGGAACCA
ACTCCAAGAG TCGATTCTCC TTCAGGTATG GCAAAGAACA ATAAGGCAAA ACCCCAGCCA
AACAACCCAG GTTTACATAA TTCGTTGAAT GCCTATTCTT CAAACCCAGC TACCACCATG
AACTACCTCT ATGGAACAAA CACATACTAC GACAACGATC ATCTTTTGGA TGATCTATTG
AACCACTTGC CAAGCGGAAA AGAACGTTCC TTTGAGCTTA TTGACCGTTA CATCAACTCT
GTCCATTTGC TCTTGCCCAT TGTAGTCAAT CTCAGTGACT TCTTAAAAGA GCATGAACGG
TACTGGGATC TTGCTACGGG GGCCGCAAGC AACAACACTA GTAACAGTAA CAACAATAAC
AATGCTAATA GCAATAATAG CAATCATGTG AATAACAAAT CGAGCCCTGA AAGTATTTCC
TCGGCAAGCT CTAAAGGCGA TGACGTCGAT TTCAATTACT TGCAGTTCTA CACGTTGTAC
TTCCCCATTT TGTACGCTTC GACAATCTCT GAATTTGAAG AGTATGACAA CTTATTGTTG
AACCAGGACA TCAATCGTTA CTTGAAGGCC TTCAACAAGA TCTGCCAGTA TTATAATTAC
CCACATGGAA TCAAGACGAT TCCACTCTTG TTGGGTAATG TGATTATCCA GTCTACATCG
CCCAACCCTT CGACAATGGA GATGTCCCAG ATTATCAGGT ACGCTAAGTT CTTACAGATG
CACAAGGACC CTTTGATCAG TTTGCGCATC AACGACTGGG AAGTCATCAA GTTCAGAAGA
TTGCTCTGGT GGGTAATTTT CGGCTTGGAT GCCTTGACGT CTCACAACTT CTGCTTACCT
CCAGTGTGCA AATTCGAAGA TTTTAATGTC TTGATGCCCG AAGAAGAAGA GCCTATTTTC
GACAAGTTGG GAATCATCAA AGAAAAGAAG TTGAACGTCA GTATCCTTTC CATGAATGTC
AAATTCAAAT ATGATCGCAT TTTGAGTGAG TTAGTGTATC ATTTGCACAA CGGCTTGTCT
TCCGATATCA CGCCTAATCA AATCAATGAA ATCAAGGGCA TGATCATAGA CTACTTTAAA
TATATCCATA GATCAATCTT TAGAATGAAC CAGTTCTACA AGTTGAATCC TCCTACTACT
GTTCAAGAAA TGAACTTGAT CAATTTCATC AAGAATCACT CCTGGAGTTT TGTTGACAGA
GCTCTTATGT TGTTACACAA GAAGATCTTG TTCGGTGACA ACACTGTTAA CGACTATGGA
TCTGACGATG GAAGAGGTGG CAAGTCAAGA GGAAAAGGCG CTGGTGAGTT GGAAAACTTG
ACTAAGACTA GAGGTGGCAT TCTATCCTTG AGTCAGTACG AGGACACATT TGGGAGAATT
CAAGAAGCCA ATATCATCAA GAACTTCAAT AACTCGTCGA TTTCATTATT AAGATTCAAC
CAATTCGAAA ACTTCTCCTA TGAAAACATG CACAACAATT TGATACCTTC CATTTTGCAT
AACTTGAACG ATTTCTTGAA GTACAATGAT TTCATCAAGT TTGGAAAGTA CAATTGGTAT
ATCAAGAGAA CAATTCCCTT GGATTCAATA ATCTTGATGT TCATCCTCAT TACTGTCAAG
TTGAAGTACG AGTTTATGAC GATGAATGAA TTATGCATCT ATGTCAAATT GATCAACAAG
GCATTATTTA TTTTGAACAG AAAGTGGTTC AAAAATGAGA AGTATAAGAG AATGTTGTCG
TTAACCAATT TAACCTGGGA GTTCCTCTTG AAGAGGTACA ACATAATTGA GTTGATCAAC
CAATATAACG AATCCACTCT CAGTGGAGGA TCCAGTGGTG GAAGGGTAGA GTTTTTTGAC
TACCAAGTTA CCAGCTACAT GAACATGAAT GATCTTTTTA ATGTCATGGA TGTTCCTCAG
CCGATGTTCA ACGATTTGGA ATTCTTGGAA GCTGCCGAAG AGGAGCAGAA GAAACGAGAA
ATGCAACATC AAAAACAACA TCAATTGCTT CATCCTCATT CCAACCACCG CCACCACCGT
CGTGGTAGCC ATGAATCCCG TAACTCAAGT AGCCCTACAC ACAACTCAAA TGGCAGTAGT
ACTTCTAATA AGGATAAGGA GTCAAAGAAG GATAGAGCGG ACATGTTCTC CAATACTTTG
ATCCACAGCT CCACTTTGGA TACCTCTCGC AGCCATGCGA ACAATAAGTT GACAATAGTG
AACCACAATT TGAGTGCGGA TAGAAAGACG GAGTTGATTC AGTTGAACGA AAAGATCTAC
TACGACTTGA GAAACAATTT TGTAGACATC AACGACTACT GTGCTTTCTA CTCCAGTTTA
GAAAACATTC TTCACGAATT AATGGACTAC ATCCACAAGG GTTGA
 
Protein sequence
MISPSKKSTL KVLADGKVMK VQKTRQRKIL SCIYCHSKKI KCLRVQPVCN NCEKLGVECK 
YFINERTYNN NSGNGINSNN GANASVNSNG NISGAEFKTA SPNTIIPDLS LNYLNLNFNL
HNANPNDDAS SNGIPNRPED QYSFNLSSFT FAHQEGLIAS NMPSVVNSPM TQPNAQPSLS
QTNSTTNINS FFTRNLNNEE PTPRVDSPSG MAKNNKAKPQ PNNPGLHNSL NAYSSNPATT
MNYLYGTNTY YDNDHLLDDL LNHLPSGKER SFELIDRYIN SVHLLLPIVV NLSDFLKEHE
RYWDLATGAA SNNTSNSDDV DFNYLQFYTL YFPILYASTI SEFEEYDNLL LNQDINRYLK
AFNKICQYYN YPHGIKTIPL LLGNVIIQST SPNPSTMEMS QIIRYAKFLQ MHKDPLISLR
INDWEVIKFR RLLWWVIFGL DALTSHNFCL PPVCKFEDFN VLMPEEEEPI FDKLGIIKEK
KLNVSILSMN VKFKYDRILS ELVYHLHNGL SSDITPNQIN EIKGMIIDYF KYIHRSIFRM
NQFYKLNPPT TVQEMNLINF IKNHSWSFVD RALMLLHKKI LFGDNTVNDY GSDDGRGGKS
RGKGAGELEN LTKTRGGILS LSQYEDTFGR IQEANIIKNF NNSSISLLRF NQFENFSYEN
MHNNLIPSIL HNLNDFLKYN DFIKFGKYNW YIKRTIPLDS IILMFILITV KLKYEFMTMN
ELCIYVKLIN KALFILNRKW FKNEKYKRML SLTNLTWEFL LKRYNIIELI NQYNESTLSG
GSSGGRVEFF DYQVTSYMNM NDLFNVMDVP QPIHESRNSS SPTHNSNGSS TSNKDKESKK
DRADMFSNTL IHSSTLDTSR SHANNKLTIV NHNLSADRKT ELIQLNEKIY YDLRNNFVDI
NDYCAFYSSL ENILHELMDY IHKG