Gene PICST_32587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32587 
SymbolGAT2.1 
ID4839991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp171315 
End bp172454 
Gene Length1140 bp 
Protein Length379 aa 
Translation table12 
GC content39% 
IMG OID640391306 
ProductGATA-family of DNA binding protein-like protein 
Protein accessionXP_001385365 
Protein GI150865945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.527498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCCC CTAGCGAAAT TTTGAACAAT GAAATAATGG CTGCAAGAAT TCTTGTGAAT 
GCGAAGTTTG CAAACAAAAA TTTGCAACCG TTGAATACAA CTTTCGTTTC TATCAATACC
CCAATGGTTC AGCAGGAGGT TTCTTTGCAA AGAAGTACGG TAAGATGTCT TCCTCCACCA
ACTCCGTCAC CTACACCACC ATATACTACC GACCCCAAAG CATATACCGC TCCTCAGCTA
AACAAAAATA GAACTTGCTA CTATCACAGC CAAGCTAACT TCCACATCAT GAATCCTTAT
TGCGCTAACT GTATTTATTG TGGTCATGAA TACAAGAACC AAATTGTTAG ACTTGCTAAT
GAAGTGTCTA ATAATGCTAA CAACTACGCC AGCACTTTAT GGAATTACAA GAGCACTTTG
AACAGAATAG TTGCAGCGAA CGCTCAAGAT CTGGCAGAAA GTGAGGAGCT TTATAACAAC
TTCAATAATT TGTCTTACCA AGTAAAGGGT CATTTGCTGA CAATTGCAGC TACTTCTTCA
TCAATTACAA ATATTGAATC TTCCGTCAAC AAGCTCAGAG AAATTTCTTC GGACAAAGAG
TCAAACAATC CCATTACTCC TCCATTAGAA GAACAGACAC AAATTGTAAT ACACGAACCT
AATGCACAGA AAGAAAGTGC ACAAATTCAG ACATTTCATT ATTCTGAACC ACAAGAGAAC
CTTGTACAAA AGAAGCAGAT CGTTCAACTT TCTATGCCAG TTCCAACACA ATCTCCACAA
ACTTTACAGA CAGAAACCAA AACGAAAGAG TCTTTAAAAC CCAAAAAAGG ACGCCCAATC
TTACTTAAGA AAAGGGCTAA GGAACCGAGG AAATCTAAAA TCAATGTCAA AGTTTCGAAA
TGCTCTCATT GTCAATCACA TAGCACACCG GAATGGAGAA GGGGTCCAGG AGGTGTTCGT
TCATTGTGTA ACGCTTGTGG GTTGTTCTAC TCTAAGCTTG TTAAGAAGTT TGGAACAACA
GATGCAAATA CCATCTTTCT ATCCAGAAAG GAGTCTGATA AATTGATAGA CAGAACTATT
CCCACCACTC TACAGAGTAA TGCTATCTTA CAGAAACAAC TACGGCTAAT TCGAAAGTAA
 
Protein sequence
MAPPSEILNN EIMAARILVN AKFANKNLQP LNTTFVSINT PMVQQEVSLQ RSTVRCLPPP 
TPSPTPPYTT DPKAYTAPQL NKNRTCYYHS QANFHIMNPY CANCIYCGHE YKNQIVRLAN
EVSNNANNYA STLWNYKSTL NRIVAANAQD SAESEELYNN FNNLSYQVKG HLSTIAATSS
SITNIESSVN KLREISSDKE SNNPITPPLE EQTQIVIHEP NAQKESAQIQ TFHYSEPQEN
LVQKKQIVQL SMPVPTQSPQ TLQTETKTKE SLKPKKGRPI LLKKRAKEPR KSKINVKVSK
CSHCQSHSTP EWRRGPGGVR SLCNACGLFY SKLVKKFGTT DANTIFLSRK ESDKLIDRTI
PTTLQSNAIL QKQLRLIRK