Gene PICST_31531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31531 
Symbol 
ID4838374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1021260 
End bp1022504 
Gene Length1245 bp 
Protein Length414 aa 
Translation table12 
GC content36% 
IMG OID640389689 
Productpredicted protein 
Protein accessionXP_001384151 
Protein GI150865082 
COG category[L] Replication, recombination and repair 
COG ID[COG1697] DNA topoisomerase VI, subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.288827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0593553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG AAGTAGCTTT TGGCTGCAAA TACAGTTATA ACAACACAGT CATGAAGCAC 
CACCTAAAGT TCATGGCAAT AAAGGCCGAG TACTCCTGTG AGGCGTCCTT CAGTTTAGCA
GTATGCTTGA AAAGTGAAAG TAGTGGAGAC GTCCAAATCT TCCATTCACA AGAAAATCAT
AACAAATCAA AAGAGAACAT GATCTTGACT TCCAAGTGGA TGGACATTGT ACAAACAATA
AATTCTGAAC GTGAGTTCGT TCTTTCGTAT GGTAGCAGAA ATGGTTCGAA GATACACTTT
TTAAGGCAGT TTGCAGACCT GGATATATTG GTCCAAAGAT TTACTGCCAC ATTAAAGGTG
TTGAAGATCC TACTTCTACA GGCACAATCA AATTCTAAGA AATCAACAAC AATAAGAGAT
ATTTACTATC AAGATGTTGA AGCCTTTCAC TGGAAACAAA GATATTGCAA TGAAATTTTG
CACCTGATTG TTGTCGATTC CTTGGGTTTG AGTTTGGAAC ACAATTTCAG CATATACCCT
TCTCAAAAGG GTCTTGTGTA TGGTGATTTT GCAATACAGT CTAATGAAGG AACTATATTT
CAAATGAGCT ATTCTGAAGA ACCAGTTTTA ATTCCTCTAC ACACCAAGTT TGAACACATC
TTACCAAATG AGGAAAGTCA CTATGCTATA GTCATTTTAG AAAAAGAAGC TGTTTTCCAA
TCCTTTTGCT ATTACATCAA GACCAGATAT ACGTTGAAAG ACAATTTCGT ACCTGACAAC
TTAATTGTGG TGACTGGAAA GGGGTTTCCA GATAATTTGA CCAAGAAATT TGTCAATATC
CTTGCAAATA CTGCATTCAC CAATTCAGTA ACTCTAGGCT TTTTCGACTC TGACGTTTAT
GGAATAAATA TCTGCAAAAA TTACCAAGAT GAAATTGCTA CGGAATCTAA ATCAAGAGAT
ATCTATGCTG GGGTCTATTT GATGGACTAC ATTGCTGGAT GGAGTGACAT TACGGCTAGA
GAAAGGATAC TAATAATGAG CACAATTACG AAGATAACTA CGGTGTATCC CACTATTCAA
AATAAAAGAT TTCACAGAGA GTTAACGAGA GGATTGTGGT TGTCCAAGAA ATGTGAAATG
AACGTATACC AAGGTGATGC AGACCAATCA GAAGGAATTT CATCAATTGC GATCAATGAA
TACATACTAT CTCAAATCAA TTCCCACAAA AAAGTGATCA AATAA
 
Protein sequence
MKFEVAFGCK YSYNNTVMKH HLKFMAIKAE YSCEASFSLA VCLKSESSGD VQIFHSQENH 
NKSKENMILT SKWMDIVQTI NSEREFVLSY GSRNGSKIHF LRQFADSDIL VQRFTATLKV
LKILLLQAQS NSKKSTTIRD IYYQDVEAFH WKQRYCNEIL HSIVVDSLGL SLEHNFSIYP
SQKGLVYGDF AIQSNEGTIF QMSYSEEPVL IPLHTKFEHI LPNEESHYAI VILEKEAVFQ
SFCYYIKTRY TLKDNFVPDN LIVVTGKGFP DNLTKKFVNI LANTAFTNSV TLGFFDSDVY
GINICKNYQD EIATESKSRD IYAGVYLMDY IAGWSDITAR ERILIMSTIT KITTVYPTIQ
NKRFHRELTR GLWLSKKCEM NVYQGDADQS EGISSIAINE YILSQINSHK KVIK