Gene SeSA_A4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4333 
SymbolargH 
ID6518061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4208632 
End bp4210008 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content57% 
IMG OID642749290 
Productargininosuccinate lyase 
Protein accessionYP_002117037 
Protein GI194735037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTTT GGGGTGGGCG TTTTACACAG GCGGCAGACC AGCGGTTTAA ACAATTCAAT 
GATTCGTTGC GCTTCGATTA CCGTCTGGCG GAGCAGGATA TTGTCGGTTC TGTGGCCTGG
TCCAAAGCAT TGGTCACGGT AGGCGTACTG ACTGCCGATG AGCAACGACA GTTGGAAGAA
GCGCTGAACG TGCTGCTGGA GGAGGTTCGC GCGAATCCGC AGCAAATCCT GCAAAGCGAT
GCGGAAGATA TCCATAGCTG GGTGGAAGGT AAGCTCATCG ACAAAGTGGG TCAGTTGGGT
AAAAAGCTGC ACACCGGACG CAGCCGTAAC GATCAGGTGG CGACAGACCT GAAACTGTGG
TGCAAAGAGA CGGTGAGGGA ACTGCTTACC GCTAACCGCC AGCTACAGAG CGCGCTGGTG
GAAACCGCGC AGGCGAACCA GGACGCGGTA ATGCCGGGAT ATACCCATCT GCAACGCGCG
CAGCCGGTGA CTTTCGCCCA CTGGTGTCTC GCGTATGTCG AAATGCTGGC GCGCGATGAA
AGCCGCCTGC AGGACACGCT TAAACGTCTG GACGTGAGTC CGCTAGGTTG CGGCGCGTTG
GCGGGAACGG CCTATGAAAT TGACCGTGAA CAATTGGCAG GCTGGCTGGG CTTTACGTCT
GCGACCCGCA ACAGCCTGGA CAGCGTGTCT GATCGTGACC ACGTACTGGA ACTGCTTTCT
GATGCGGCTA TCGGCATGGT GCATCTGTCA CGCTTCGCGG AGGATCTGAT TTTCTTTAAT
TCTGGTGAAG CGGGTTTTGT AGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCGTTAATG
CCGCAGAAGA AAAACCCGGA CGCGCTGGAG CTGATTCGCG GTAAGTGCGG TCGCGTACAA
GGGGCGCTAA CCGGCATGAT GATGACCTTA AAAGGTCTGC CGCTGGCGTA TAACAAAGAT
ATGCAGGAAG ACAAAGAAGG GCTGTTCGAT GCGCTCGATA CCTGGCTTGA CTGCCTGCAT
ATGGCGGCGT TGGTGCTGGA CGGTATTCAG GTGAAACGCC TACGTTGTCA GGACGCGGCG
CAACAGGGGT ATGCCAACGC CACGGAGCTG GCGGATTACC TGGTCGCGAA AGGCGTGCCG
TTCCGCGAAG CGCACCATAT TGTCGGCGAA GCGGTGGTAG AAGCTATTCG CCAGGGTAAG
CCGCTGGAAG CGTTGCCGCT GGCCGATTTA CAGAAATTCA GCCGCGTGAT TGGCGACGAT
GTGTATCCGA TATTGTCTTT GCAGTCGTGT CTGGATAAAC GGGCGGCAAA AGGCGGCGTT
TCTCCGCAGC AGGTGGCGCA GGCCATCGAC GATGCCAGGG CGCGCCTCGC GTTGTAG
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TADEQRQLEE 
ALNVLLEEVR ANPQQILQSD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKETVRELLT ANRQLQSALV ETAQANQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDTLKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFTS ATRNSLDSVS DRDHVLELLS
DAAIGMVHLS RFAEDLIFFN SGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRLRCQDAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEALPLADL QKFSRVIGDD
VYPILSLQSC LDKRAAKGGV SPQQVAQAID DARARLAL