Gene PICST_28429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28429 
SymbolAMB1 
ID4851207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1196174 
End bp1197472 
Gene Length1299 bp 
Protein Length432 aa 
Translation table 
GC content45% 
IMG OID640392915 
Productbeta alanine synthase 
Protein accessionXP_001387458 
Protein GI126274193 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.406154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0472974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCA AATCGTTAAA ACTTACCCCC GGCAGATTGC TTGCTACCAT TCATGACACG 
GCTGAGAAGT TCGGTGCCAA AGGAGTGTGG GGCCCAGCAT CGACTGAAAC TGGAGTATGT
CGTCTCACGT TAAGCGATCT TGACAAAGGA GTGAGAGATT GGTTTATCGC GGAAACGAAA
GGCTTAGGCT GCGAAATCAA GGTCGACCAG ATCGGTAATA TTTTTGCCAT ATACCCAGGG
AAAAAAGAAA ATGCCAATGC TCTCCCAACA GCCATTGGGT CTCATTTGGA TACCCAGCCT
ACTGGGGGAA GATACGACGG GATCTACGGA GTGTTGTCTG GGTTGGAAGT GTTGAGAACA
CTAAAGGACA ACGACTTTGT GCCCAACTAC CCAATTGCGC TTATAGACTG GACCAATGAA
GAAGGTGCCA GATTCCCCAT GTCGATCATG GCCTCGAGTG TATGGGCTCA AAACATTCCT
TTGGAACGGG CCTATAAGCT TGAGTCAGTG ACAGATGCCG AGCCTGTAAC TGTAGAACAC
GAGTTGAAAA GAATCGGCTA TTTGGGTGAA ACTGCAGCCA GTTACCTTGC CAATCCCATT
AAAGCTCATT TCGAGATTCA TATTGAGCAG GGCCCTATTC TTGAAGACGA GGACAAACTC
ATTGGTATCG TCACAGGAGT CCAAGCATAT TCTTGGATCA AGGTAAAAGT ATTTGGTAAG
GCACAACATA CAGGGACTAC ACCTTTGGCA GCTCGTTCTG ATGCCTTGTT AGCAGCTTCC
AAGATGATTG TCAAAGGTAA CGAATTGGCC AAGAAACATA ACTGTTTATT CTCTGTAGGT
GTTCTCAATC TTCAACCAGC TGTAGTCAAT GTGATTCCCG AACATGTCGA GTTCATTATC
GATGTACGTC ATGTGAAGGA TGATGGTTTG AGCGTAATTT TGGAGGAGAT CAAGTCGGAC
TTTGTTCTGA TTGTTGGTGA TTCTGGAAGG GCTTTGACAG TCGAGTTTGA CCACATTTAC
ACTTCAGATG CTGTCAAATT CCATGAAGAC TGTATTTCTA GTGTAACCGA ATCAGCGGAA
GAGATAGTGG GAAAAGAGAA GGCTCGTACT ATCATCAGTG GTGCTGGTCA CGACTCATGT
GCTACAAGTA CTAGAGTACC TACGTCGATG ATCTTCATTC CTTCGAAAGA CGGAGTCAGT
CACAACCCTG CCGAATACAG TAAGCCGGAA GAGGTCCACA CTGGATTTGA AGTATTGCTT
AATGCGGTGC TCAAGTACGA TAGCAAGAGA ACTGATTAA
 
Protein sequence
MSFKSLKLTP GRLLATIHDT AEKFGAKGVW GPASTETGVC RLTLSDLDKG VRDWFIAETK 
GLGCEIKVDQ IGNIFAIYPG KKENANALPT AIGSHLDTQP TGGRYDGIYG VLSGLEVLRT
LKDNDFVPNY PIALIDWTNE EGARFPMSIM ASSVWAQNIP LERAYKLESV TDAEPVTVEH
ELKRIGYLGE TAASYLANPI KAHFEIHIEQ GPILEDEDKL IGIVTGVQAY SWIKVKVFGK
AQHTGTTPLA ARSDALLAAS KMIVKGNELA KKHNCLFSVG VLNLQPAVVN VIPEHVEFII
DVRHVKDDGL SVILEEIKSD FVLIVGDSGR ALTVEFDHIY TSDAVKFHED CISSVTESAE
EIVGKEKART IISGAGHDSC ATSTRVPTSM IFIPSKDGVS HNPAEYSKPE EVHTGFEVLL
NAVLKYDSKR TD