Gene PICST_83723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83723 
Symbol 
ID4839230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp814833 
End bp816902 
Gene Length2070 bp 
Protein Length557 aa 
Translation table12 
GC content41% 
IMG OID640390545 
Productpredicted protein 
Protein accessionXP_001384821 
Protein GI150865554 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000478126 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.12365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGACCCTATA AATTGCCATT GGAAATATTT GTCTAGTCTA ATATTTGTAT ATTTAGGTGA 
TTTCAACCTC TTTAAGCCGC CTTAGTACAG CCTATTTTCA CAAATAGTAG CACTGAAGAT
TTATTAAACT ACGAAGTATT CACTTTCTGT TTTCACTACG ATTTTCATTC GGCAATACTT
AATCCGCATA TTCCTACGAA GTTTGCCAAT TTAACCAAGC ACCTTCATTC TAATTGTGGA
GACTACTTCA GTTGCTAGTC TTATCTAAGG AACGACTATA GACTACTGTG AAATTTCTAT
ATATTCAACC CAGGGCAAAC CCACTAGACC AAACACGCAT AATGAAGTCA TGTCGTACAT
AGTAAAGAAA ATATTCATTC CCAAATCGTA CCGTAAGGTG CTCGTCACCA TATTCCTGGC
CCTTACTATG GTATTGATAT ATAGCTACCT AAGAGCATCG ACCGCGGTGT ATCCAGATGT
CAAGCTATTA CAGAACAATT TCGACCAGCC ACATACCAAA CTAGGAACGG TGATCACCGA
TATCAAGATC ATCAAATGCT ACTACCGGAA TTGCAAAGCG CCCGTGGGCT ATTCCAAGAT
CCAGCCTCCC ATGAATTACT ACGAAACAAC TGTAGACTCT TCTGCTACCA CCAAATTGAC
ATCTCCCTTG TATTATATCG TCATTAAGGA GCAGCCCGTA GACACTGCCA CCAACATCCT
TTTAGATCTC ACGTTTGACA AGCCTAAGGA TGACGCTGGG TTCGAACTCA TCAAAAGCGA
CAACTACCAG TTGTACAAGA AGTTTTTCAG CACGAACATC AAGAACCCCA TTCCAAGTGA
TTTGCCTTTG GTGCATTCAC TTGATTTGCT CTTTGGTTCA AACGACCTCA TCGACTCGCG
TGTGAACCAT TTCTCTGTTC ACTCCTCAAT GACGGAAGAA AAGATTCATC CCATCATCTC
GATGTTCAAA TTGCCTCAAT CGAAACAAGA CATCTGGACG CAAGATACTT CGCAATTTGT
GATGATGCAA GAGACAAGCA TTCTTCGTAT TGATGAATCC GTGACCAAGT TCAAAGTGAT
TCAGATGAGT GACTTGCACT TTGGACAAAG TCTTGGTAGA AAATGCGGTA AGGATCAGGA
GTTGTGTACT TCAGACTTGA AAACCTTGAA ATTCATGGAG GACAGTATAC ACAAAGAGAA
CCCAGATTTG GTGGTAATCA CAGGTGACTT GATAGACGTA GACCGTAGTG TCGACTATAA
GTCGATAATT CTCAAGTCGT TACAGCCTAT ACTACAGACA AATACCAAGT TCATATTCAC
TTTTGGTGAT GAGTTTGACG GCCAGGAAAA CCTCAGAGAG ATCAAATTGT CCTTGATTAA
GTTTTTGCAA ACGTTGCCCA ATTGCTACAA CACTATAGAA GGAATTGACG ATAGTTTGCA
TGGAGTTACT AACTACAACT TGAAAGTGAT AAGAGGCGAA AAGGAAGTAG CTCATGTTAC
TGTTTTTGAC TCCGAAGATA AATATCTTGA TGAAACACAG ACCAATTTCT TGTACCGTAT
CCATGCCGAG GACCCTGAGA AATTGTTTAA GTTGTTATTC TTCCATTTTC CCATTCCTCA
GTTCCGTCCC ACTGGAAAGT TCAAGATAAT TGGAAGTTAC AATGAGAAGC ATCCTCTCAA
TTCCAAGACG AAGCCGCAGG TGCTTGATGA TATCCGCAAC TGCGGTTATC AAGTAGTCAG
TGTAGGACAC GAACATGAGA ATGATGCCTG TCTCCTAAAC GAAAAATCTA GTGCTTCAGG
AGAACAGTCT ATCTGGCTCT GCTACAGTAG CGTTGCTGGA GATTCTGGAG TCACTGCTCT
TGATGCCAAC TACGATCGTA AGCTCAGAGT GTATGAAATC GACTTTGAGA AGAGTATATT
GTTGAGTTGG AAGAGAAGTG AAATGAAAAA GAAGGGATTT GACTACCAGC TGGTCTACAA
GTTCCCATCC TTACCTGAGG CACCAAAGGA ACCCAAACCA TAGACATAAG TCGTTGAATC
ATATAGGCAT ATGCAATCTA ATTATTATTT
 
Protein sequence
MSYIVKKIFI PKSYRKVLVT IFSALTMVLI YSYLRASTAV YPDVKLLQNN FDQPHTKLGT 
VITDIKIIKC YYRNCKAPVG YSKIQPPMNY YETTVDSSAT TKLTSPLYYI VIKEQPVDTA
TNILLDLTFD KPKDDAGFEL IKSDNYQLYK KFFSTNIKNP IPSDLPLVHS LDLLFGSNDL
IDSRVNHFSV HSSMTEEKIH PIISMFKLPQ SKQDIWTQDT SQFVMMQETS ILRIDESVTK
FKVIQMSDLH FGQSLGRKCG KDQELCTSDL KTLKFMEDSI HKENPDLVVI TGDLIDVDRS
VDYKSIILKS LQPILQTNTK FIFTFGDEFD GQENLREIKL SLIKFLQTLP NCYNTIEGID
DSLHGVTNYN LKVIRGEKEV AHVTVFDSED KYLDETQTNF LYRIHAEDPE KLFKLLFFHF
PIPQFRPTGK FKIIGSYNEK HPLNSKTKPQ VLDDIRNCGY QVVSVGHEHE NDACLLNEKS
SASGEQSIWL CYSSVAGDSG VTALDANYDR KLRVYEIDFE KSILLSWKRS EMKKKGFDYQ
SVYKFPSLPE APKEPKP