Gene YPK_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_4087 
Symbol 
ID6090620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4514621 
End bp4515994 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content51% 
IMG OID641599186 
Productargininosuccinate lyase 
Protein accessionYP_001722799 
Protein GI170026294 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTAT GGGGCGGACG GTTTAGTCAG GCAGCAGATC AGCGTTTTAA GCAATTCAAT 
GATTCACTGC GGTTTGATTA CCGGCTGGCA GAGCAGGATA TTATCGGTTC TGTTGCCTGG
TCGAAAGCGC TGGTTACTGT TGGCGTATTG AACGCTGATG AACAGCAACA ACTTGAACAG
GCGCTGTCAG TTTTACTGGA AGAGGTGCAG GCCAACCCAC ACGCTATTTT GGCCAGTGAC
GCTGAAGATA TCCACAGTTG GGTCGAAACC AAACTGATCG ATAAAGTGGG TGATTTAGGC
AAAAAATTAC ACACTGGGCG CAGCCGTAAT GATCAGGTAG CGACCGATCT GAAATTGTGG
TGTAAGTTCC AGATAACCGA ATTACAGACG GCGGTACAAC AACTGCAACA GGCGTTGGTC
ATGACCGCTG AAGCGAATCA GGATGCGGTC ATGCCTGGTT ATACTCACTT ACAGCGTGCT
CAACCCGTGA CGTTTGCACA TTGGTGTTTG GCCTATGTAG AAATGCTGTC ACGTGATGAA
AGCCGTTTAC AAGATACCCT GAAACGTCTG GATGTTAGCC CATTAGGTTG CGGTGCGTTG
GCGGGCACGG CGTATGCTAT CGATCGTGAA CAACTGGCGG GCTGGTTGGG CTTTGCTTCT
GCTACTCGTA ACAGCCTGGA CAGCGTTTCT GACCGTGACC ATGTGCTGGA GCTATTATCT
GACGCCAGCA TTGGTATGGT ACATTTGTCT CGCTTTGCTG AAGATTTAAT TTTCTTCAAC
AGCGGTGAAG CAGCTTTTGT CGATTTATCG GATCGCGTGA CATCGGGTTC CTCCCTAATG
CCACAGAAGA AAAATCCAGA TGCGCTGGAG CTCATCCGTG GTAAATGTGG CCGGGTGCAA
GGTGCATTAA CCGGGATGAT GATGACGCTC AAAGGCCTGC CTTTGGCTTA TAACAAAGAT
ATGCAGGAAG ACAAAGAAGG GCTATTTGAT GCACTGGATA CCTGGCTTGA CTGCCTGCAC
ATGGCGGCAT TGGTACTGGA TGGGATTCAG GTAAAACGCC CGCGTTGTAA AGAAGCGGCT
GAACAGGGCT ATGCTAACGC GACTGAACTG GCTGATTATT TGGTCGCTAA AGGCGTACCA
TTCCGTGAAG CGCACCATAT TGTGGGTGAA GCGGTGGTAG AGGCAATCCG CCAGGGCAAA
GCTCTGGAGG CCTTAGCGCT GAGTGATTTA CAGCAATTCA GTTCCGTAAT TGGTGATGAT
GTGTACCCGA TTCTGGCATT GCAGTCTTGC TTAGATAAGC GAGTGGCCAA AGGCGGCGTT
TCGCCGCAGC AGGTTGCATC GGCCATTGCC GAGGCTAAAG CGCGGCTGTT TTAA
 
Protein sequence
MALWGGRFSQ AADQRFKQFN DSLRFDYRLA EQDIIGSVAW SKALVTVGVL NADEQQQLEQ 
ALSVLLEEVQ ANPHAILASD AEDIHSWVET KLIDKVGDLG KKLHTGRSRN DQVATDLKLW
CKFQITELQT AVQQLQQALV MTAEANQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLSRDE
SRLQDTLKRL DVSPLGCGAL AGTAYAIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
DASIGMVHLS RFAEDLIFFN SGEAAFVDLS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCKEAA
EQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK ALEALALSDL QQFSSVIGDD
VYPILALQSC LDKRVAKGGV SPQQVASAIA EAKARLF