Gene PICST_31331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31331 
Symbol 
ID4838844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp533066 
End bp535024 
Gene Length1959 bp 
Protein Length652 aa 
Translation table12 
GC content42% 
IMG OID640390159 
Productpredicted protein 
Protein accessionXP_001384399 
Protein GI150865257 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGA ACTTCAATCA AACTGCTTCC AGTGATCGAG ACCAGATTTC TGCATTTTAC 
GACCAAACGC ATGTAGAAGC TTACAGCCAT GATCCAGAAT TCACACAGAT ATTCTCGACT
GATGTACTTC CAAACTCTGA AGATGTGGAA TTAGCTAATT ATCTATTATA CCTTGACAAC
GACTACAACT CGTTCATTCA GATCGCAAAC GCAATTCTGA TAAACGACTC CAAGTTTCAG
ATAGAAGACT CCAACTCGCA ATTAGCTGAC TCGACACCCC AGACAGAAGC ATTGTCAATG
ATGAGCTCTA CAGAAATCGA ACAACTTGTG AACTACAAAG TTAGCAATAA CAATACCAAT
AATAATTTCG ATACTAATGG TACTTATGGA ACTAACTTCC AGGATCAAAA TCAAAGCGAA
AATGGATTTC CTCACTTAGT CAGTCACCAA ACTCAAGATA CTCCAAGGAT TACCGTCACG
GATGAGACAG GAAGAGAGAT TTCCCAATGG CAACAACCTA CTTACGAATC AAGTACCTCT
TTGCAGGACT TCTACTCGTA TGATGTGGCA TTGCAAGACC CTATCCCATC TTCAAGTAGT
AGTCTTATTC TTGGGAACTT GGATGAAACT GCAAGGCAAA ATCATGCTGC AAACTACCCG
TTTTCTGAGA GTGATCAGCC CGAATTTCAT ATTCCAAAAA TAGAAGAACT GATTCCAGAT
CCAGACGACT TTGCTCAATA TTCTCTCCTC ATATACCCAC AAGCCACAAC AGTAGCAAAA
AACATACCAA TCTTCAATGT CCCCACTAGC AGTCAAGGAT CTTCAAGAGC AGTTGCTCCT
ACGGTTCATT TCCAGCTCGA AACAAAAGAT CATACTGATG CAAACTTACA ACCCCGTACG
AATAACTCGC TGAGCAGCTC CTTCAGCCTT TCTACTTCCC CTGCAATGGT ATTTGACGAA
AGCAACCCTC TAAGGAGTTC CAGAAGTTCA AGAAGCTCCA GAAGTACACT AAATTCGATG
AATTCTTTGG ATAGCAAAGG TCCCAGTTTT ACTGAGGTAG CTAGCCAAGG CGAAGTTGCC
AGCTACTTCG AAGTAGTCAG ACATAGTGAC CAAGTCAGAG AATCTGAGGT AGACAGAGAA
ACTATGGAAG TCGGAAATTC CAGATTCAGG GTTCTGCCCT TGGCTAAGAT CAGCATTTCC
ATTTCGCCCG AGAGTGTTTC ATGTCTCAAC TGTAATATTG ACTACGGAAA CTACACTGGA
AAAATCACCA ACAAAATCGA AGGTCACTTA GCATTGGCGA ACGTGTATGT TGCCCCTGGT
GCTCATGAAA GTCAGATAAC CGACGCAACA CTTATTAAGC TATACACAGA GACAAAAGAA
TTGTCAGCCT CATCATGTAT CCATCAGAGA ACTAAGTACG ACAGGGAAAT CGATGAGTTA
CACAATTCCA TTAGCAATAT TGTCTACAAA ACTAGTAGCA AATATTCGTT AGATATGCCC
TACGAGCCAC AGTACCTAAG GTTTGAAGTT GGTCCCGATA GTAGGTTGGT TATGGCGAGC
AAGAGTGGTT TGTGTCCCTA TTGCGAAGAA GTGAGATTCC TTCCGTTCAA GAACTCGAGT
TACTTGTCTC ATTTGACATT GGAACACGGA GTGTTCTCCA ACGGTTATCT CACGCCTGAT
GGGCTCTATT TTGGATCTTA CAAGTTGAAG AAGAATAGCA GTAGGAATAA CCAGGAGCAC
ACTCCATCTG GCAGAGAACG ACAAGTAGAA GCCTTGATGT GCCCCTTGTG TTTCGATATG
GTGGAATTTG GATGTTGGGA AGGAAAGAAA AACAAGCTCT TGTCTTACTT CAGACACTTT
AAAAATATCC ATGGTCAACA TACGATCAAG GCGAGAAGCT CACAGATTCC ACCCATTCAA
GACCGGGGCC GTACGCTCCA TATTCTACCA GATCCTTAG
 
Protein sequence
MNMNFNQTAS SDRDQISAFY DQTHVEAYSH DPEFTQIFST DVLPNSEDVE LANYLLYLDN 
DYNSFIQIAN AISINDSKFQ IEDSNSQLAD STPQTEALSM MSSTEIEQLV NYKVSNNNTN
NNFDTNGTYG TNFQDQNQSE NGFPHLVSHQ TQDTPRITVT DETGREISQW QQPTYESSTS
LQDFYSYDVA LQDPIPSSSS SLILGNLDET ARQNHAANYP FSESDQPEFH IPKIEESIPD
PDDFAQYSLL IYPQATTVAK NIPIFNVPTS SQGSSRAVAP TVHFQLETKD HTDANLQPRT
NNSSSSSFSL STSPAMVFDE SNPLRSSRSS RSSRSTLNSM NSLDSKGPSF TEVASQGEVA
SYFEVVRHSD QVRESEVDRE TMEVGNSRFR VSPLAKISIS ISPESVSCLN CNIDYGNYTG
KITNKIEGHL ALANVYVAPG AHESQITDAT LIKLYTETKE LSASSCIHQR TKYDREIDEL
HNSISNIVYK TSSKYSLDMP YEPQYLRFEV GPDSRLVMAS KSGLCPYCEE VRFLPFKNSS
YLSHLTLEHG VFSNGYLTPD GLYFGSYKLK KNSSRNNQEH TPSGRERQVE ALMCPLCFDM
VEFGCWEGKK NKLLSYFRHF KNIHGQHTIK ARSSQIPPIQ DRGRTLHILP DP