Gene PICST_51037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51037 
SymbolARG2 
ID4851202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1184579 
End bp1186324 
Gene Length1746 bp 
Protein Length581 aa 
Translation table 
GC content40% 
IMG OID640392910 
Productacetylglutamate synthase 
Protein accessionXP_001387873 
Protein GI126274188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG5630] Acetylglutamate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0751076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0746208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT TGAAGAATTT GAACCGAGAA TTCATCTCCA ACTTGAAAAG TCACAAGCTC 
ATAACCGATG CTAAACGGAA CTTAATACTC TCCATCCTCA AATCTACTAC TACCAAGAGG
GAAGCCCGGA ATTACCTCAA TAAGTACCAG AACCAGTTTG ATTTTGGCGA TTTGAAGATC
TCGTCATCAG CAAAATACGA GCAGGATGTT TCCAAATTGA CAAAAAGAGA CTCCCAGCGT
GAGTTGTTTG TCAACAGGTA CTTGAACAAA CAGAATCCAT TCATCAATAT CTACGATGAT
GAGACCAAAC TCAAAAAAAT CCCTTTGAGA GTAGCATTGT TCAAGTTGAA ATTTCTCAAC
ATCGATCCCA AAGAATGGCG TGGCATTGCC GAGACTTTCA AGCGATTGGT AAATTTGGGA
ATTTCGCCCA TAGTGTTTCT AGATTATGAC CATCTTCCCA CAGACTCGTT CAAGTATAAC
GAATTGTATA TGATTAATCA GGTCAATAAG GTCATGAACT ACCTTGGAAA ACCCGAAGAA
GAAGGAAACC TAAAAACGAC GGTTTTGCGG TCATTATTCA CTGTCGAGAA CAAAGAAAGG
GGTCCAGTAA TTAATAGTTT GGAATCTATA TTGATTCCCT TGTATCAGGG AATTATTCCT
TTTATCCAGC CCATCATTTA CAATGCTGAG AGTACATTTC AGCAGTTTAT CAACTCGAAT
CAGCTTTTGT ACAGCTTGTG TGAATCGTTG TTGGACAAGA AGGATCTTCT CTCAGTGGAG
AAAATAGTCA TGATTGATCC TATTGGAGGA ATTCCCTCGG TTGAAAGAAA CCAGACGAGC
CACGTATTTA TCAACTTGTC TCAAGAATAC TCGGATATAG TTTCCGAATT GTATATTGGA
CATATTGAGC CTGATCAACG TGATTTGCAT CTTGCCAACT TGAATACCAT GCATGAAATC
TTGACACTAG CTTCCTCCAA ATCGGGCAAT GATGACACGA CGGGGATCAT CACCACTCCA
TTCATCATGT CTGTCAATGA TGATCTCATC AATCCGATTA TTTATAATGT TTTGACGGAT
AGGCCCATCA TCTCTTCATC TCTACCTAGC TCCAACAACA GGACACCACA GCTTTCAACT
TCTATTTTGA AGAAAGGAGT GGATGTGCGA TCGTACGATG CCGACAACTA TGCGAGAAAG
TTCACTTTGC ATAATTTAAT AGAAGATGAA CTCGTAGACA AAAATAGGTT GGTAGCTCTT
CTAGATGATT CGTTCGGCAA GAACTTGGAC ACAGATTCTT ATTTTGATAG AATCAATAAT
TCGCTAGCTA CCCTTGTCAT TGTAGGGGAT TACGACGGTG CTGCTATCAT CACTTGGGAG
TATAGTGGTA CCAACAAGAT CGCGTACTTG GACAAGTTCG CCATAGCAAA GAAGAACCAA
GGATTACCTG GATTGGCAGA TGTGATCTTC AAGATAATTC TCCTGTCGCA TCCCCATGAG
TTGATATGGA GATCTCGGAA AGTAAACCCT GTCAATAAGT GGTACTTTGA GAGATGTGTA
GGCTCCATGA GTTCACCTGA GTCCCAATGG AGAATCTTTT ACACGGGTGA TATTTTCAAC
CGCAGAATCG ACAAGAGAAG AAAGAGAATA GTTGGGAGTG AAGCTGTAAA CATTTCAGAC
AAATTGGTGC AATACAGTGA AATTTGTGAA GGCATTCCTC CTTCTTTCTT TTCGTCTAAG
GAATGA
 
Protein sequence
MSKLKNLNRE FISNLKSHKL ITDAKRNLIL SILKSTTTKR EARNYLNKYQ NQFDFGDLKI 
SSSAKYEQDV SKLTKRDSQR ELFVNRYLNK QNPFINIYDD ETKLKKIPLR VALFKLKFLN
IDPKEWRGIA ETFKRLVNLG ISPIVFLDYD HLPTDSFKYN ELYMINQVNK VMNYLGKPEE
EGNLKTTVLR SLFTVENKER GPVINSLESI LIPLYQGIIP FIQPIIYNAE STFQQFINSN
QLLYSLCESL LDKKDLLSVE KIVMIDPIGG IPSVERNQTS HVFINLSQEY SDIVSELYIG
HIEPDQRDLH LANLNTMHEI LTLASSKSGN DDTTGIITTP FIMSVNDDLI NPIIYNVLTD
RPIISSSLPS SNNRTPQLST SILKKGVDVR SYDADNYARK FTLHNLIEDE LVDKNRLVAL
LDDSFGKNLD TDSYFDRINN SLATLVIVGD YDGAAIITWE YSGTNKIAYL DKFAIAKKNQ
GLPGLADVIF KIILLSHPHE LIWRSRKVNP VNKWYFERCV GSMSSPESQW RIFYTGDIFN
RRIDKRRKRI VGSEAVNISD KLVQYSEICE GIPPSFFSSK E