Gene PICST_35815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35815 
Symbol 
ID4838820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp403279 
End bp405573 
Gene Length2295 bp 
Protein Length764 aa 
Translation table12 
GC content45% 
IMG OID640390135 
Productpredicted protein 
Protein accessionXP_001384033 
Protein GI150864993 
COG category[R] General function prediction only 
COG ID[COG5275] BRCT domain type II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.756935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAG AACAAGTTTT GTCCACAATT CCGGATGCAG ACTTGCCAGA GGATGTAGAA 
GATAAGAAGT TCAACTTCTT TGCCAAAAAG GCAGCCGACG CGTCAACTGG CACTGGCGAA
CTGGTTCAAA TTCCAGAAGC TCAACCAAAC TGCTTATCTG GTTTAACTAT TGTGTTCACT
GGTGTGTTGC CCAGATTAGA TAGAGATACT TCCGAAAATT TGGCCAAGAG GTACGGCGCC
AAAGTTACAA AGTCTATTTC TGGTAAGACG AGTTTAGTAG TTATAGGAGA TGAAGCTGGT
CCATCCAAGA TCAAAAAGAT AAAACTGTTG CATATAAAAG CCATAAACGA AGATGGGTTC
ATACAGCTTT TGCAGAGTAT GCCAATGGAA GGAGGAGATG GAGCAGCCGC TCAGAAGGCT
AAATTGAAAA GAGAAGAAGA GGAACGTAAA ATCGTAGAAG AAGCCGAAGA AGAAGAACGT
AGGGCCCAAG AGGAGGAAGC CAGACAGAAA AGGCTTTTGG AAGCACGAAA AGCTGCCTCA
ATTGAAAAGG CCTCGCATTC ACAATCTTCT CACAGAGAGC CAGCAGAAGT TCCCAGAATC
ATACCTGACA GCGAAAAGTT GTGGACTGTT AAGTATGCTC CCAGCAGGAT TGACCAGCTT
TGTGGAAACA AAGGACAAGT TCAGAAATTG CAAAACTGGC TTTCCAACTG GTTCGACAAC
GCTAAGAAGG ATTTCAAGGT TCCTGGAAGA GATGGCAGTG GTATTTACCG AGCTTGCTTA
ATAAGTGGTC CACCGGGTAT TGGTAAGACT AGTGCAGCGC ATTTGGTTGC AAAATCGTTG
GGATTTGACA TCTTGGAGAA GAATGCCTCA GATGTCAGAT CCAAGTCTTT ATTGAACTCC
AATCTCAAGT CCGTGTTAAC AAACACTTCA GTTGTGGGGT TTTTCAAGCA TCGCGATGAA
AACATTCAAC ACACTCAGAA TGATCGTAGA TTCTGCCTTA TTATGGATGA AGTGGATGGA
ATGTCTAGTG GTGATCACGG AGGTGCTGGG GCGTTATCGG CTTTTTGTAA AATCACACAT
ATGCCTATGA TCTTGATTTG TAACGATAAG TCGTTGCCCA AGATGAGGAC ATTTGATCGT
GTTACGTTGG ACTTACCTTT CAGAAGACCT TCCGAAGCTG AGATGAAGTC CAGATTGATG
TCTATTGCTT TACGAGAAAA GATCAAATTG GATCCCACAG TTATTGGGCA GTTAGTCCAG
GCTACAGGGA ACGATATCAG ACAGATCATT AACTTGTTGT CTACAGTTTC CAAGACTCAG
TCGAGTATAG GCGGAGAACA AGCCAAGGAC GCAGCCAATA GCTGGAAAAA GCAAACAGTT
TTGAAGCCGT TTGATATCAC AGCAAGATTG TTGAATGCTC AAATCTACTC TCCTAATGCA
AAGCATTCCT TGAACGACAA GATTGATTTG TATTTCAACG ATATTGATTT CGCTCCGCTT
ATGATTCAAG AAAACTACTT GCTGACACGG CCATCCGACG CTAGAACCCC ACAGGACCAT
CTAAGAAGAG TTGCAAATGC TGCCGACGAC ATTTCTGAGT CTGACCGGAT CAACTCTCTC
ATCAGATCGA GTGAGCAGCA GTGGAGTTTG TTACCATTCC ATGGTGTTAT GTCATCTGTC
AAGCCTTCTT CGCAAGTGGC AGGCCAGATT TCGCAACGTA TCAACTTTGC TGGATGGTTG
GGACAAAATT CGAAAGCCAT GAAGTACCAG CGGATGTTGC AGGACTTGCA GTACCATACT
CGCTTAAGAA CTTCTACAGA CAAGAAGGAG TTACGCTTGG ACTACTTGCC TACACTTCGT
CAACGGTTGA CTGAGCCATT GTTACAAGGC GAAGAGCAAG GGATCCAGCC AGTGATCGAC
ATCATGGACT ACTACTACTT GACTAAAGAA GATTGGGACA ATATTGTAGA TTTGGGTGTA
GGCAGGTACA AGGGAGAGTT GGTGTTGAAG GGTATCAGCA CCAAAACCAA GTCCGCGTTC
ACTAGAAGCT ACAATAGCAC GACGCATCCT ATTGCTATCT ACAAGACGGG TAATTCTGTG
GGAGCCATGG TATCGTCCAA GAAGAATGTG GACTACGAAG ATGTGATTGA AGATGACACA
AACACTCCCG ATAAGGACGA TGAAGAAGTT GATCCCGATA AGATCGACAG CAAAAAGGAC
AAATTGATCA AGGAGGTGAA GCCCAAGAAG ACGGCCAGGA AAGCCAAGGC GGCACCCAAA
GGTCGCAAGA AGTGA
 
Protein sequence
MTAEQVLSTI PDADLPEDVE DKKFNFFAKK AADASTGTGE SVQIPEAQPN CLSGLTIVFT 
GVLPRLDRDT SENLAKRYGA KVTKSISGKT SLVVIGDEAG PSKIKKIKSL HIKAINEDGF
IQLLQSMPME GGDGAAAQKA KLKREEEERK IVEEAEEEER RAQEEEARQK RLLEARKAAS
IEKASHSQSS HREPAEVPRI IPDSEKLWTV KYAPSRIDQL CGNKGQVQKL QNWLSNWFDN
AKKDFKVPGR DGSGIYRACL ISGPPGIGKT SAAHLVAKSL GFDILEKNAS DVRSKSLLNS
NLKSVLTNTS VVGFFKHRDE NIQHTQNDRR FCLIMDEVDG MSSGDHGGAG ALSAFCKITH
MPMILICNDK SLPKMRTFDR VTLDLPFRRP SEAEMKSRLM SIALREKIKL DPTVIGQLVQ
ATGNDIRQII NLLSTVSKTQ SSIGGEQAKD AANSWKKQTV LKPFDITARL LNAQIYSPNA
KHSLNDKIDL YFNDIDFAPL MIQENYLSTR PSDARTPQDH LRRVANAADD ISESDRINSL
IRSSEQQWSL LPFHGVMSSV KPSSQVAGQI SQRINFAGWL GQNSKAMKYQ RMLQDLQYHT
RLRTSTDKKE LRLDYLPTLR QRLTEPLLQG EEQGIQPVID IMDYYYLTKE DWDNIVDLGV
GRYKGELVLK GISTKTKSAF TRSYNSTTHP IAIYKTGNSV GAMVSSKKNV DYEDVIEDDT
NTPDKDDEEV DPDKIDSKKD KLIKEVKPKK TARKAKAAPK GRKK