Gene PICST_34492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34492 
SymbolARG1 
ID4851585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2213310 
End bp2214560 
Gene Length1251 bp 
Protein Length416 aa 
Translation table 
GC content47% 
IMG OID640393293 
Productargininosuccinate synthetase 
Protein accessionXP_001387008 
Protein GI126274941 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.491384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAG GTAAAGTTTG TTTGGCTTAC TCCGGTGGTT TAGATACCTC GGTCATCTTA 
GCCTGGTTGT TGGAAGAAGG TTATGAAGTC ATTGCTTTCT TGGCCAACAT CGGTCAGGAA
GAAGACTTTG AAGCTGCTGA AAAGAAGGCT TTGGCCATTG GTGCCACCAA GTTCGTCGTT
GTGGATGTCA GAAAGGAATT CGTCGAATCC GTCTGCTTCC CAGCCATTCA AGCCAATGCA
ATCTACGAAA ATGTATACTT GTTGGGTACT TCCTTGGCTA GACCAGTTAT TGCACAGGCC
CAGATTAAAG TTGCTGAAGA AAACGGTTGT TTCGCTGTCT CCCACGGTTG TACCGGTAAG
GGTAACGATC AGGTCAGATT CGAGTTGTCT TTCTACGCCT TGAAGCCAGA TGTCGTTGTC
ATCGCTCCAT GGAGAGACCC AGCTTTCTTC AACAGATTCG CCGGTAGAAA CGATTTGTTG
GAATACGCTG CCTCCAAGAA CATCCCCGTT GCTCAAACCA AGGCTAAGCC ATGGTCCACC
GATGAAAACT TGGCTCACAT CTCCTTTGAA GCCGGTATCT TGGAAAACCC AGACACCACT
CCTCCAAAGG ACATGTGGAA GTTGACAGTT GACCCAACCG ATGCTCCAGA CACCCCAGAA
GACTTTACTG TTGTCTTTGA AAAGGGATTG CCTGTCAAGT TGATCTTGGA TGGAGGTAAG
AAGGTTATCA CCGAATCTGT TGAATTGTTC CTCGAAGCTA ACGCCTTGGC CAGAAGAAAC
GGTGTCGGTA GAATCGACAT TGTCGAAAAC AGATTCATCG GTATCAAGTC CAGAGGTTGT
TACGAAACCC CAGGATTGAC CATCTTGAGA TCTACCCACA TTGATTTGGA AGGTTTGACT
TTGGACCGTG AAGTCCGTGC TATCAGAGAC CAATTCGTCT CCGTCACCTA TTCCAAGTTG
TTGTACAACG GTATGTACTT CACTCCTGAG TGCGAATATG TCAGATCTAT GATCCCTCCA
TCCCAGATCA CCGTTAACGG TCAAGTCAGA GCCAGAGCCT ACAAGGGTTC TTTGACCATC
TTGGGTAGAT CTTCGGAAAC CGAAAAGTTG TACGACGAAA CTGAATCTTC CATGGATGAA
TTGACCGGTT TCTCTCCAGA AGACACCTCA GGTTTCATTG CCGTCCAATC CATCAGAATC
AAGAAGTATG GTGAAGCAGC CAGAGAGAAG GGCCAAACTT TAAACTTATA A
 
Protein sequence
MSKGKVCLAY SGGLDTSVIL AWLLEEGYEV IAFLANIGQE EDFEAAEKKA LAIGATKFVV 
VDVRKEFVES VCFPAIQANA IYENVYLLGT SLARPVIAQA QIKVAEENGC FAVSHGCTGK
GNDQVRFELS FYALKPDVVV IAPWRDPAFF NRFAGRNDLL EYAASKNIPV AQTKAKPWST
DENLAHISFE AGILENPDTT PPKDMWKLTV DPTDAPDTPE DFTVVFEKGL PVKLILDGGK
KVITESVELF LEANALARRN GVGRIDIVEN RFIGIKSRGC YETPGLTILR STHIDLEGLT
LDREVRAIRD QFVSVTYSKL LYNGMYFTPE CEYVRSMIPP SQITVNGQVR ARAYKGSLTI
LGRSSETEKL YDETESSMDE LTGFSPEDTS GFIAVQSIRI KKYGEAAREK GQTLNL