Gene PICST_66043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66043 
Symbol 
ID4840461 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp224660 
End bp227901 
Gene Length3242 bp 
Protein Length1009 aa 
Translation table12 
GC content40% 
IMG OID640391776 
Productpredicted protein 
Protein accessionXP_001386255 
Protein GI150866601 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA ACCGTCAGCT TCTTTTGTTA GTTGGGCTTG TGTTCCATTT TTTCTACCTT 
TGGTCCATTT TCGACATCTA CTTCGTCCTG CCCTTAGTAC ATGGAATGGA CCACCATGTT
TCCACGACTA CAGCACCAGC CAAACGTCTC TTTCTAATAG TGGGAGATGG ACTTCGTGCT
GACAAGACAT TCCAAAAATT GAAGCATCCA AGAACTGGCG AAACAAAATA CTTGGCTCCG
TACTTGAGAA GCATCGCTCA GAATGAAGGC ACCTGGGGCA TTTCGAACAC GAGAATGCCG
ACTGAGTCTA GACCTGGCCA CGTAGCCATG ATCGCTGGTT TTTACGAAGA TGTCTCTGCT
GTTACCAAGG GATGGAAGGA AAATCCAGTA GATTTCGACT CATTTTTTAA CCAATCGAAA
CACACCTATT CCTTTGGATC GCCCGATATC TTACCCATGT TCGCTTATGG TGAAGGAGTA
GTTCCCGGAA GAATCGACGT TTGTATGTAT GGCCACGAGT TCGAAGATTA TACCCAGAGC
TCGATCGAGT TGGACGCATT TGTGTTCAAA CACTTTGACG AGCTAATGGC CAATTCTGAG
ACCAACCAGA CACTACACGA CGAGTTGCAC GAGGAAGGAA ATGTCTTCTT CTTGCATTTG
TTGGGTCCAG ACACAGCCGG TCATGCCTAC CGTCCGTATT CGGCCGAATA TTACGAGAAT
ATCGAGTATA TCGACATGCA GTTGTCCAAG TTGATTCCTA GAATCCATGA ATTCTTTGGT
GATGATGATT CTGCCTTTGT TTTCACAGCT GACCACGGCA TGTCCGATTT TGGATCGCAC
GGTGATGGCC ATCCTGACAA CACCAGGACA CCGTTGATTG CATGGGGTGC CGGTGTTAAC
AAGCCTAAAC ATATAAAGGA CTTACCTGAT CCGCAAGCCC AAAGAGCAAA ACAAGATCCA
GTCCGCAGCG GATACGAAGA TACATATTTT GAAACGTGGG AACTTGACCA TTTGGTTAGA
AATGACGTCA AGCAGGCTGA TATCGCTTCG TTAATGGCTT ATTTGATTGG TGCAAACTAT
CCTGCCAATT CTGTAGGTGA GTTGCCTCTT GCGTACCTCG ACACCGATGC TGTGACGAAG
ATCAAAGCTC TCTTCGCTAA CGCTTTAGCT ATTATTGAGC AATACTATGT CAAAGAAAAG
GAAGTGTACA ACCACCAATT CAAGTTCAAG CCTTTCCGGC CCTTCGACGA AAAGTCAATC
GATGAATACA GCGGTCAAAT TAACTCATTT ATTTATTCCT TACAAAATGA ACAACTCAGC
CAATCTCAAA AGGAATTATT GGAAAAAGAG GCTGTCATGG TTGTGGAAGA ATTGATGAAG
ACAGCTTTGG ATGGATTGAA TTATTTGCAA ACTTACAACT GGTTATTGTT GAGATCCATC
GTCACATTGG GCTTCATTGG ATGGATCGTT TATGCTTTTG GTCTTTTCTT AAAGTTGTTC
ATTATATCAG AAGAGGATTT ACAAACTTTG AAACCAGGTA ATCTGATATT CTTGTTGCTG
TCTTTCTCTG CATTAGCATT GTCCACCAAC TACTTGCTTT TCTATCAAAA CTCGCCTTTC
AATTACTACA TGTATGCTGC ATTTCCATTA TACTTTTGGT ACACCATCTT CAATGAGCTC
ACCTATCTTG GAGAAGGTTT GAATCAGTTG TTGTACGGTA TCTCCATTCC AACTAGAGTT
TTCATCGCAG TTTCCTTCAT TGGAATGTAT GAAGGTATAG CTTACGGCTT TTTTGAAAGA
TTTGTGTTTT CGATCATCTT CGTCCTTATA GGATTGTACC CATTGTTCGT CTCAGGAAAT
GAAAAAGTAT CTACTTACCA AAAGCTTGTA TGGTTAGCAA GTTGCCTGTT AATGTGTATT
TTCACTAACT TGAACCCGGT GAAAGTTGAA AGCTTACTCC AAATTAATGC TGGTGCCTTG
TTTTCGTTGA TAATTGCTCT GATTGGTATC AGCAAAGTCT TTAAGAGACC TATAGAGTCT
GTACAGAAAA GGTTGGTCAT ACATCAATTG CTCATAATTC CTCTTATCTT GTACGCTACG
AATGTTTCTG TACTTTCATT GCAAGCTAGG AATGGTCTTC CTTTATATTC ACAAGTTCTT
GGTTGGTTGT CTTTTGTTGC ATCCTTATTG TTACCAGTAT TCCATTCGGT CTATCCTTCA
AAGGACTACG CCTTGAGATT ATTGATTATT TTCCTTACTT TTGTTCCTGC CTTTATTATC
TTGACCATTT CATTTGAATT GTTGTTCTAC AATGGATTCT CCTTGATATT GCTTCAATGG
CTCAATATCG AAGAGAATTT AAAATTCCCA AGAGCAGAAA TTATCAAATC ACAAGAAGAA
ACTGGAAGGT TACCGAAAGG GTATTGGTTG CAGTTGATTA GAATATCTAT CATCGGCTTC
TTTTTCCTTC AGTTAGCATT CTTTGGCACT GGTAACATTG CATCGATTTC TTCTTTCTCA
TTGGACTCAG TTTACAGGCT TATTCCTATA TTCGATCCTT TCCCAATGGG GGCGTTGTTG
ATGTTTAAAT TGATTGTTCC ATACGTGCTA CTTTCCACTT GTCTTGGTAT CATGAACCAC
AACTTGGAAA TCAGACAGTT CACAATTTCC ACATTAATTA TTTCTACAAG TGATTTCTTA
TCGTTGAACT TCTTTTTCTT AGTGAGAACG GAAGGTTCTT GGTTAGACAT TGGACTCAGT
ATCTCCAACT ATTGCTTAGC TATCCTATCT TCGCTTTTCA TGTTGATTTT GGAATTGGTC
GGTTCCATAA TTTTGCGCGG AGTCGAATAC AATGACAAGT GGACGGAGAA AGAAGAGTAC
AAAATAGAGG AAATAGAAAC TGAAGAAGAA AAAGCAGAGG AAGTTGAGGT TGTCGAAATT
GATGAAACTG AAGAAATTGA AGTAATTGAA ATTGTCGAAG TTAACGAAGA GGAAATTGAA
GATGAAGATG AAGATGTCGA GGATGAAGAA GAAGATGATG ATCACAACGG TGAAATAGAC
CCCAGAAACA TTATACCAGC CCAATCGGAG ACCGAGGAAT TGCCGATCAG CGCAAGAATA
AGACGGAGAG GCGCCAAGCG CGATTGAAAA GGAGGACAAA AACACTAGAT TTATTGTATA
GTATATATAT ACAAAGGTAT ACTACCAAAA AGTTTAGAAA TTAGACACCG GGAAATTATA
AC
 
Protein sequence
MNRNRQLLLL VGLVFHFFYL WSIFDIYFVS PLVHGMDHHV STTTAPAKRL FLIVGDGLRA 
DKTFQKLKHP RTGETKYLAP YLRSIAQNEG TWGISNTRMP TESRPGHVAM IAGFYEDVSA
VTKGWKENPV DFDSFFNQSK HTYSFGSPDI LPMFAYGEGV VPGRIDVCMY GHEFEDYTQS
SIELDAFVFK HFDELMANSE TNQTLHDELH EEGNVFFLHL LGPDTAGHAY RPYSAEYYEN
IEYIDMQLSK LIPRIHEFFG DDDSAFVFTA DHGMSDFGSH GDGHPDNTRT PLIAWGAGVN
KPKHIKDLPD PQAQRAKQDP VRSGYEDTYF ETWELDHLVR NDVKQADIAS LMAYLIGANY
PANSVGELPL AYLDTDAVTK IKALFANALA IIEQYYVKEK EVYNHQFKFK PFRPFDEKSI
DEYSGQINSF IYSLQNEQLS QSQKELLEKE AVMVVEELMK TALDGLNYLQ TYNWLLLRSI
VTLGFIGWIV YAFGLFLKLF IISEEDLQTL KPGNSIFLLS SFSALALSTN YLLFYQNSPF
NYYMYAAFPL YFWYTIFNEL TYLGEGLNQL LYGISIPTRV FIAVSFIGMY EGIAYGFFER
FVFSIIFVLI GLYPLFVSGN EKVSTYQKLV WLASCSLMCI FTNLNPVKVE SLLQINAGAL
FSLIIASIGI SKVFKRPIES VQKRLVIHQL LIIPLILYAT NVSVLSLQAR NGLPLYSQVL
GWLSFVASLL LPVFHSVYPS KDYALRLLII FLTFVPAFII LTISFELLFY NGFSLILLQW
LNIEENLKFP RAEIIKSQEE TGRLPKGYWL QLIRISIIGF FFLQLAFFGT GNIASISSFS
LDSVYRLIPI FDPFPMGALL MFKLIVPYVL LSTCLGIMNH NLEIRQFTIS TLIISTSDFL
SLNFFFLVRT EGSWLDIGLS ISNYCLAILS SLFMLILELV GSIILRGVEY NDKGINEEEI
EDEDEDVEDE EEDDDHNGEI DPRNIIPAQS ETEELPISAR IRRRGAKRD