Gene PICST_61628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61628 
SymbolMGS1 
ID4840046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp741127 
End bp743370 
Gene Length2244 bp 
Protein Length747 aa 
Translation table12 
GC content39% 
IMG OID640391361 
ProductDNA-dependent ATPase MGS1 (Maintenance of genome stability protein 1) DNA-directed DNA polymerase ATPase 
Protein accessionXP_001385499 
Protein GI150866032 
COG category[L] Replication, recombination and repair 
COG ID[COG2256] ATPase related to the helicase subunit of the Holliday junction resolvase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.17782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCAA CTTGTCCGAT CTGTGGAAAA CAGCTCTCCC TGCACTTGCT AGAAAGACAT 
GTCAATACTT GCCTAGATAA CCAGGAATCA AAAGAAGTAG AAGAAGTGGA GCAACCTAAA
GATGTCCAAG ATTTAGACGA AGTCGTAATA GTGGAAGATG AAATAGGCGA AATCAGGAAA
AGGCGACATG GTAGTGACGC GTTTGCTGCT CTAGGCTTGA AAATGGACTC ATCGCAAAAG
AGACAAAAGC TAGAGCGACT GCAAAAGAGC AAACCAACCT TAACTCGTAT TCTTATAGAA
GAGAAACGAT TAAAGTCTGA ATCACGTAAG ACGTCGGATG AGCTTAAAGA ACAATTGGAA
AAATTGTCTT CAAGCAAATC TTATGATGAA TCATTAATCT CAACACCATC AGATCAGCCA
AAAGTTGTAC CAGAACCTCA GGAAGAACCA AAAGAAGCAA AACCACCACC ACTTCAATCA
GATATACTTA CCAGGGCACA AGAATTGAAC AAACTCAAAC GTGAAGCAAG TATTCCGTTG
GCTCATAGAC TAAGACCCAA ATCACTAGAC GATTTCTTTG GCCAGGAGAA ACTTTTAGGA
CAAGATGGGA TATTACGTAA TATTATTAAT GCCGATAACA TACCATCGTT TATACTCTGG
GGAGTACCAG GAGTAGGAAA AACGTCATTG GCCAGAATAA TTGCACACAC AACAAATTGT
AAATTTGTAG AATTGTCTGG TGCAGAAAGT AATGCCAAAA GACTAAAAGA AGTGTTTCTC
CAGGCAGAAA ATGAAAAGCA TTTGACTGGC AGAAAGACCA TTTTATTCTT GGACGAGATC
CATAGATTCA ATAAAGCTGT ACAAGATTTG CTCCTTCCTG TGATTGAGAA GGGTGTTCTC
ACAGTTATTG GTGCCACTAC AGAGAATCCT TCTTTCACAT TAAACAATGC CTTGCTTTCA
CGAATGCATA CGTTTGTAAT GGAACAATTG ACCACAGCAG CGCTAATTAA GATTCTTGCT
AGGGCACTTT ATCAAGTCAA TAAACTTAGA AAGCATCTCT ATAATCTCCA TTACATCTCA
CTTAAAAGAG ATTCTTTTAA GTACATTGCA GAACTTTCGA TGGGAGATTC TAGAGTGGCA
CTCAATCTCT TGGAGTTCGT CAACGCGTAT TTGTCTACCG ATAAGTTTTC GACTAAAAAT
GGTGAAGAGC AGCCCAAGAA AATGGGAGTT ATCAACGTCT CCGCTGAAAG CCTCAAGAGT
ATACTTAAGT CTAGAGACTT TCATAACATG TACGATAAAC AAGGGGAATC GCACTATGAT
ACAATTAGTG CATTCCATAA ATCAGTAAGA GGTTCTGATG CTGATGCTGC GATGTTTTAC
TTGGTCAAAA TGTTATCTGG TGGAGAAGAC CCTCTCTTCA TTATTCGTAG AATGATAGTA
ATGGCTAGTG AAGATATCGG ATTGAGAGAT AGCTCATGTT TGCCCTTTGC CATTGCAGCC
AAAGAAGCGC TAGAGTTTGT AGGTATGCCT GAAGGAGAAA TCATTCTCGC TCACTGCGCA
ATGAAAATGG CCAGAGCACC AAAATCAACA AAATCATACA GAGCACTTAG AACTGCACAG
CTGCTTCTTC GGGAGAAGCC AGAAATTACT AAGCTTCCTA TTCCCATGCA TTTGAGGAAT
GCTCCAACCA AATTAATGAA AGAACTTGGT TATGGGGATT CCTATAAATA CAATCCAGGC
TACAAGAATG GTTTAGTCAA ACAAACGTAC TTCCCTGAAG GAATGGAAGA CGTTCACTTT
TTAGAAGAAA CGCACTTGGG GGTAGTTGAA GATTCTGATA CTTCCCCTGA AGAACAAGCA
AGGGCACAAG CACAAGTAGT CGACTACGAG AATTTTAAGA AAGACCGCAT AAAGCAGTTG
AAGTTGGAAT ACTTGAATCG TGTGAAAGCG ACAAGACAAG AGACCTTCAA TAAAATGATG
GCTAGATATA AGAAAAACGA AATCACAACA GTACCAGAGG TGCCTCAATA TATTGATACT
AATTCTAACG GTAAAAAAGA ACAAGGGAAA GAGAAAATTA GTGAACAGGA TTCAAGTTCA
GATACCGATT ACAAAGACAA TTTCGAAAAG TCATATGACG AATTTCTTGA TCCAGAATCA
CAACCTGAAT ATTTTGAAGG AGACGACACA GGGTACCATT ACGATCCAGA GTACCCATTG
ATTAGAGACG ACTATGAAAT CTAA
 
Protein sequence
MDSTCPICGK QLSSHLLERH VNTCLDNQES KEVEEVEQPK DVQDLDEVVI VEDEIGEIRK 
RRHGSDAFAA LGLKMDSSQK RQKLERSQKS KPTLTRILIE EKRLKSESRK TSDELKEQLE
KLSSSKSYDE SLISTPSDQP KVVPEPQEEP KEAKPPPLQS DILTRAQELN KLKREASIPL
AHRLRPKSLD DFFGQEKLLG QDGILRNIIN ADNIPSFILW GVPGVGKTSL ARIIAHTTNC
KFVELSGAES NAKRLKEVFL QAENEKHLTG RKTILFLDEI HRFNKAVQDL LLPVIEKGVL
TVIGATTENP SFTLNNALLS RMHTFVMEQL TTAALIKILA RALYQVNKLR KHLYNLHYIS
LKRDSFKYIA ELSMGDSRVA LNLLEFVNAY LSTDKFSTKN GEEQPKKMGV INVSAESLKS
ILKSRDFHNM YDKQGESHYD TISAFHKSVR GSDADAAMFY LVKMLSGGED PLFIIRRMIV
MASEDIGLRD SSCLPFAIAA KEALEFVGMP EGEIILAHCA MKMARAPKST KSYRALRTAQ
SLLREKPEIT KLPIPMHLRN APTKLMKELG YGDSYKYNPG YKNGLVKQTY FPEGMEDVHF
LEETHLGVVE DSDTSPEEQA RAQAQVVDYE NFKKDRIKQL KLEYLNRVKA TRQETFNKMM
ARYKKNEITT VPEVPQYIDT NSNGKKEQGK EKISEQDSSS DTDYKDNFEK SYDEFLDPES
QPEYFEGDDT GYHYDPEYPL IRDDYEI