Gene SNSL254_A4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4454 
SymbolargH 
ID6484594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4333045 
End bp4334421 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content57% 
IMG OID642739693 
Productargininosuccinate lyase 
Protein accessionYP_002043387 
Protein GI194443574 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000120209 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACTTT GGGGTGGGCG TTTTACACAG GCGGCAGACC AGCGGTTTAA ACAATTCAAT 
GATTCGTTGC GCTTCGATTA CCGTCTGGCG GAGCAGGATA TTGTCGGTTC TGTGGCCTGG
TCTAAAGCAT TGGTCACGGT AGGCGTACTG ACTGCCGATG AGCAACGACA GTTGGAAGAA
GCGCTGAACG TATTGCTGGA AGAGGTTCGC GCGAATCCGC AGCAAATCCT GCAAAGCGAT
GCGGAAGATA TCCATAGCTG GGTGGAAGGT AAGCTCATCG ACAAAGTGGG TCAGTTGGGT
AAAAAGCTGC ACACCGGGCG CAGCCGTAAC GATCAAGTGG CGACGGACCT GAAACTGTGG
TGCAAAGAGA CGGTGAGGGA ACTGCTTACC GCTAACCGCC AGTTACAGAG CGCGCTGGTG
GAAACCGCGC AGGCGAACCA GGACGCGGTA ATGCCGGGAT ATACCCATCT GCAACGCGCG
CAGCCAGTGA CTTTCGCCCA CTGGTGTCTC GCGTATGTCG AAATGCTGGC GCGCGATGAA
AGCCGCCTGC AGGACACGCT TAAACGTCTG GACGTGAGTC CGCTAGGTTG CGGCGCGTTG
GCGGGAACGG CCTATGAAAT TGACCGTGAA CAATTGGCAG GCTGGCTGGG CTTTGCGTCT
GCGACCCGCA ACAGCCTGGA CAGCGTGTCC GATCGTGACC ACGTACTGGA ACTGCTTTCT
GATGCGGCTA TCGGCATGGT GCATCTGTCA CGCTTCGCGG AAGATCTGAT TTTCTTTAAT
TCTGGTGAAG CGGGTTTTGT AGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCGTTAATG
CCGCAGAAGA AAAACCCGGA CGCGCTGGAG CTGATTCGCG GTAAGTGCGG TCGCGTACAA
GGGGCGCTAA CCGGCATGAT GATGACTTTA AAAGGTCTGC CGCTGGCGTA TAACAAAGAT
ATGCAGGAAG ACAAAGAAGG GCTGTTCGAT GCGCTCGATA CCTGGCTTGA CTGCCTGCAT
ATGGCGGCGT TGGTGCTGGA CGGTATTCAG GTGAAACGCC CACGTTGTCA GGACGCGGCG
CAACAGGGGT ATGCCAACGC CACGGAGCTG GCGGATTACC TGGTCGCGAA AGGCGTGCCG
TTCCGCGAAG CGCACCATAT TGTTGGCGAA GCGGTGGTAG AAGCTATTCG CCAGGGTAAG
CCGCTGGAAG CGTTGCCGCT GGCCGATTTA CAGAAATTCA GCCGCGTGAT TGGCGACGAT
GTGTATCCGA TATTGTCTTT GCAGTCGTGT CTGGATAAAC GGGCGGCAAA AGGCGGCGTT
TCTCCGCTGC AGGTGGCGCA GGCCATCAAC GATGCGAAGG CGCGCCTCGC GTTGTAG
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TADEQRQLEE 
ALNVLLEEVR ANPQQILQSD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKETVRELLT ANRQLQSALV ETAQANQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDTLKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
DAAIGMVHLS RFAEDLIFFN SGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQDAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEALPLADL QKFSRVIGDD
VYPILSLQSC LDKRAAKGGV SPLQVAQAIN DAKARLAL