Gene PICST_31007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31007 
Symbol 
ID4837982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1499283 
End bp1500326 
Gene Length1044 bp 
Protein Length347 aa 
Translation table12 
GC content45% 
IMG OID640389297 
Productpredicted protein 
Protein accessionXP_001383577 
Protein GI150864655 
COG category[L] Replication, recombination and repair 
COG ID[COG0164] Ribonuclease HII 
TIGRFAM ID[TIGR00729] ribonuclease H, mammalian HI/archaeal HII subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.718365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.389801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGAC CACTGTCCGT GGAAGCTCCT GAGCTGGAAA GTGCAAGAAA AAGGAGGAAA 
TTGGAAACTG GCACGGCTTC AGCAGAAGAA GCTGGTGATT CAAGACCATA TCCTTTGTCA
GTAACAAGCA TAGAGAACCA TTTTGAATTC AAGTCATCAA CATATCATTC GGCAATCCCC
GTAGAAGTTC TCGAGAATCC AGACGAACCA GTTGTCTTGG GAGTAGATGA AGCTGGCAGA
GGTCCAGTTT TGGGTCCAAT GGTGTATGGC ATTGCATTTG CATTGGAGAA GTATCTGACA
AGATTGCAGA AGGAATATGG GTTTGCCGAT TCCAAGACTT TAAAGGAAGA GAAAAGAGAT
GAACTATTCT ACAGCATAGA GGATGAAGCG AATGAGCTTA ACAGAAATGT TGGCTGGGCT
ACTACTACGA TGACAGCTAG AGACATTTCT TCAGGGATGT TACGTTCAGT TTTGGGAATA
GGCAACTACA ACTTAAACGA ACAAGCCCAC GACACTACCA TTCAGCTTAT CAAGGAAGTT
ATTGCCAAAG GAGTTAATGT GAAGAAAATC TACGTAGACA CAGTAGGTCC CCCTGTGACG
TACCAGGCCA AATTGCAGAA GATATTTCCA GAAACGGAAG TTACGGTTGC GAAAAAGGCA
GACAGTATAT ATCCCATAGT AAGTACTGCT TCCGTGATGG CCAAGGTGAC AAGAGACGCC
AATATCAGGT GGTATAACCA CAATTTGGAT GTGTTGAAGG GCCACAAATT GGGTTCAGGC
TATCCCAGTG ACCCCAATAC CAGCAAGTGG CTCAATGGTA ATGTCGACAA GGTTTTTGGC
TGGTGCTACG GGTTTATTCG ATTCTCATGG CAGACAGCCA AGGACTCGTT GGTGAAACAC
GACGGGGTAG AGGTGATTTA CGAAGATGAA TGTGTAAAGC AGGACAATGG ATATGGCAAT
GTCAGCGAGT ATTTCAGCCA TAAGGACGAG CCTGTGAGAG GGAGCATCGA TAAGTTGTAT
TATAGTAGCG GAGTGAAACT TTGA
 
Protein sequence
MSRPSSVEAP ESESARKRRK LETGTASAEE AGDSRPYPLS VTSIENHFEF KSSTYHSAIP 
VEVLENPDEP VVLGVDEAGR GPVLGPMVYG IAFALEKYST RLQKEYGFAD SKTLKEEKRD
ELFYSIEDEA NELNRNVGWA TTTMTARDIS SGMLRSVLGI GNYNLNEQAH DTTIQLIKEV
IAKGVNVKKI YVDTVGPPVT YQAKLQKIFP ETEVTVAKKA DSIYPIVSTA SVMAKVTRDA
NIRWYNHNLD VLKGHKLGSG YPSDPNTSKW LNGNVDKVFG WCYGFIRFSW QTAKDSLVKH
DGVEVIYEDE CVKQDNGYGN VSEYFSHKDE PVRGSIDKLY YSSGVKL