Gene PICST_44075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_44075 
Symbol 
ID4838193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp786476 
End bp787615 
Gene Length1140 bp 
Protein Length379 aa 
Translation table12 
GC content48% 
IMG OID640389508 
Productpredicted protein 
Protein accessionXP_001383431 
Protein GI126133813 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTC CCTACAAGCA ACCACCCCAC AAATTGACGA TGATTCCTGG CCCCATCGAA 
TTCTCTGACG AGGTTCTTGG GGCTATGGCC ACACCTTCGC AGGCCCACAC TTCTCCCGAG
TTTGTCAAAA CGTTCCAGTC GGTCTTGCAG AACTTGAGAA AGTTGTTCAA GTCTTCTGAC
CCCGACGCAC AGGCCTATGT AATTGCTGGT TCTGGAACTT TGGGCTGGGA CATTGTTTCC
ACTAACTTGC TTAGCCCAGG AGACAAAGTG TTGGTTTTGT CGACGGGATT CTTTTCCGAT
TCATTTGCTG ACTGTTTGAA GATTTACGGA ATCGATGTTG ATGTCGTTAC TGCTCCTGTC
GGAGGAGTGG TTCCGGTCGA AACCGTCGCT GAAAAGTTAA AGTCTACTAA GTACACAGCC
ATTACCATCA CCCATGTTGA TACGTCGACT TCTGTGGTAA GTGACGTGAA GGCTGTTTCT
GAAATCGTAA AGAAGGAATC GCCAGAAACG TTGATTGTAG TCGATGGAGT CTGTTCTATC
GGGGTAGAAG ACTTGGAGTT CGACAAATGG GGTATCGATT TCGCCTTGAC AGCTTCACAG
AAGGCCATTG GTGTTCCTGC TGGTTTGTCC ATCTCCTTTG CCTCGGCCAG AGCAGTGGCA
AAAGCTTTGG CAAGAAAGGA AACTGTCTTC TTTGCCTCGT TGAAGAGATG GACTCCGATC
ATGAAGGCTT ACGAATCCGG TAACGGTGCC TATTTTGCCA CGCCAGCCGT CCAGACTATC
ACCGCTTTGA AGGTATCGTT AGATGATATC TTGAGTGGTA GCATCGATGA CAGATTTGCT
AAGCACGCTG AAATCTCGTC TAAGTTCAAG TCGAGCGTTG AAAAGTTAGG CTTGAAGATA
GTTCCTCTCA GCCACGATGT CGCTGCTCAC GGATTGACCG CTGTTTACTT CCCAGAAAAC
ATCAATGGTG CCGACTTGCT TGCCAAGTTG AGCTCCAAGG GTTTCACCGT TGCTGGTGGT
ATCCACAAGG CTTTGGTAGG AAAATACTTC AGAGTAGGTC ACATGGGCTA CTCAGTCTAC
GCTGGACACG TAGACCAGCT CACCAAGGCT CTTGAAGAAT CATTGGACGA ACTCAAATAG
 
Protein sequence
MATPYKQPPH KLTMIPGPIE FSDEVLGAMA TPSQAHTSPE FVKTFQSVLQ NLRKLFKSSD 
PDAQAYVIAG SGTLGWDIVS TNLLSPGDKV LVLSTGFFSD SFADCLKIYG IDVDVVTAPV
GGVVPVETVA EKLKSTKYTA ITITHVDTST SVVSDVKAVS EIVKKESPET LIVVDGVCSI
GVEDLEFDKW GIDFALTASQ KAIGVPAGLS ISFASARAVA KALARKETVF FASLKRWTPI
MKAYESGNGA YFATPAVQTI TALKVSLDDI LSGSIDDRFA KHAEISSKFK SSVEKLGLKI
VPLSHDVAAH GLTAVYFPEN INGADLLAKL SSKGFTVAGG IHKALVGKYF RVGHMGYSVY
AGHVDQLTKA LEESLDELK