Gene PICST_34985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34985 
Symbol 
ID4836922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1051390 
End bp1052505 
Gene Length1116 bp 
Protein Length371 aa 
Translation table12 
GC content45% 
IMG OID640388237 
Productpredicted protein 
Protein accessionXP_001382978 
Protein GI126132906 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01123] branched-chain amino acid aminotransferase, group II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.127685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.13999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTG CTCCATTAGA CTCAACCAAG TTGGTAATTG AGAAAACCAC TAATCCAAAG 
GAAGTGTTGC CTAAGGAAAA ATTGGCATTT GGAAAGTCTT TCACTGACCA TATGTTGGAA
GTAGAATGGA CTGCTCAGAG CGGTTGGGGT ACTCCAAAAT TGTCGCCATA CCATAATTTT
TCACTCGATC CAGCCACCTG TGTGTTTCAT TACTCTTTCG AATTGTTTGA AGGTATGAAG
GCTTATCGTG ACAAGGATGG CAAGATCAGA ACATTCAGAT CAGACAAGAA CATGGAAAGA
ATGAACGGCT CTGCCGCCAG AGCATCGTTG CCAACTTTCG ATGGAGAAGA GTTCATGAAG
ATTATTGATA AGTTGTTGCT TGCTGATGAA AGGTTTGTGC CTGAAGGTTA TGGCTACTCA
TTGTACTTGA GACCAACTTT GATTGGTACA ACTCCTGCTT TGGGTGTGGC TGCTCCTGAT
AAGGCTCTAT TGTATGTAAT TGCATCTCCT GTGGGGCCAT ACTTTGCAGA AGGATTCAAG
CCTGTTTCCT TAGAAGCCAC TGACTATGCT GTAAGAGCCT GGCCAGGTGG TGTTGGAGCT
TTCAAATTGG GTGCCAACTA CGTCTCTTGT ATCCAGCCAC AAAGTGAGGC TGCCAAGAGA
GGTCATTCCC AGAACTTGTG GTTGTTCGGC GAAGAGGGTT ACATCACTGA AGTTGGTGCC
ATGAATGTAT TCTTTGTATT TCAGAATGCT GACGGCAAGA AGGAACTTGT CACTCCTCCT
TTGGATGGTA CTATTTTACC TGGTGTAACA AGAGACAGTA CCTTAACTTT GGCTAGAGAA
AAATTGAACT CAAACGAATG GATTGTATCC GAGCGCCCAT TGACAATCTA CGAAGTCAAG
GAAAGAGCAC TCAAGGGTGA GTTAGTGGAA GCCTTTGGTA CTGGTACTGC TGCAGTTGTG
TCTCCAATCA AGAACATCGA GCACCGTGGT GAAGCCATCG AGGTTCCAGT AGAAGATGGG
AAGGCTGGAG CTTTCACTAA GCAAATCAGC GAATGGATCA GAAGTATCCA GTACGGTGAA
GAGGATTTCA AGAACTGGTC GAGAGTTGCA AAATAA
 
Protein sequence
MTTAPLDSTK LVIEKTTNPK EVLPKEKLAF GKSFTDHMLE VEWTAQSGWG TPKLSPYHNF 
SLDPATCVFH YSFELFEGMK AYRDKDGKIR TFRSDKNMER MNGSAARASL PTFDGEEFMK
IIDKLLLADE RFVPEGYGYS LYLRPTLIGT TPALGVAAPD KALLYVIASP VGPYFAEGFK
PVSLEATDYA VRAWPGGVGA FKLGANYVSC IQPQSEAAKR GHSQNLWLFG EEGYITEVGA
MNVFFVFQNA DGKKELVTPP LDGTILPGVT RDSTLTLARE KLNSNEWIVS ERPLTIYEVK
ERALKGELVE AFGTGTAAVV SPIKNIEHRG EAIEVPVEDG KAGAFTKQIS EWIRSIQYGE
EDFKNWSRVA K