Gene SeD_A4528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4528 
SymbolargH 
ID6871177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4372492 
End bp4373868 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content57% 
IMG OID642787441 
Productargininosuccinate lyase 
Protein accessionYP_002218052 
Protein GI198243996 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000184312 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACTTT GGGGTGGGCG TTTTACACAG GCGGCAGACC AGCGGTTTAA ACAATTCAAT 
GATTCGTTGC GCTTCGATTA CCGTCTGGCG GAGCAGGATA TTGTCGGTTC TGTGGCCTGG
TCCAAAGCAT TGGTCACGGT AGGCGTACTG ACTGCCGATG AGCAACGACA GTTGGAAGAA
GCGCTGAACG TGCTGCTGGA AGAGGTTCGC GTGAATCCGC AGCAAATCCT GCAAAGCGAT
GCGGAAGATA TCCATAGCTG GGTGGAAGGT AAGCTCATCG ACAAAGTGGG TCAGTTGGGT
AAAAAGCTGC ACACCGGGCG CAGCCGTAAC GATCAGGTGG CGACGGACCT GAAACTGTGG
TGCAAAGAGA CGGTGAGGGA ACTGCTTACC GCTAACCGCC AGCTACAGAG CGCGCTGGTG
GAAACCGCGC AGGCGAACCA GGACGCGGTA ATGCCGGGAT ATACCCATCT GCAACGCGCG
CAGCCAGTGA CTTTCGCCCA CTGGTGTCTC GCGTATGTCG AAATGCTGGC GCGCGATGAA
AGCCGCCTGC AGGACACGCT TAAACGTCTG GACGTGAGTC CGCTAGGTTG CGGCGCGTTG
GCGGGAACGG CCTATGAAAT TGACCGTGAA CAATTGGCAG GCTGGCTGGG CTTTGCGTCT
GCGACCCGCA ACAGCCTGGA CAGCGTGTCC GATCGTGACC ACGTACTGGA ACTGCTTTCT
GATGCGGCTA TCGGCATGGT GCATCTGTCA CGCTTCGCGG AAGATCTGAT TTTCTTTAAT
TCTGGTGAAG CGGGTTTTGT AGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCGTTAATG
CCGCAGAAGA AAAACCCGGA CGCGCTGGAG CTGATTCGCG GTAAGTGCGG TCGCGTACAA
GGGGCGCTAA CCGGCATGAT GATGACTTTA AAAGGTCTGC CGCTGGCGTA TAACAAAGAT
ATGCAGGAAG ACAAAGAAGG GCTGTTCGAT GCGCTCGATA CCTGGCTTGA CTGCCTGCAT
ATGGCGGCGT TGGTGCTGGA CGGTATTCAG GTGAAACGCC CACGTTGTCA GGACGCGGCG
CAACAGGGGT ATGCCAACGC CACGGAGCTG GCGGATTACC TGGTCGCGAA AGGCGTGCCG
TTCCGCGAAG CGCACCATAT TGTCGGCGAA GCGGTGGTAG AAGCCATTCG CCAGGGTAAG
CCGCTGGAAG CGTTGCCGCT GGCCGATTTA CAGAAATTCA GCCACGTGAT TGGCGACGAT
GTGTATCCGA TGTTGTCTTT GCAGTCGTGT CTGGATAAAC GGGCGGCAAA AGGCGGCGTT
TCTCCGCAGC AGGTGGCGCA GGCCATCGAC GATGCCAGGG CGCGCCTCGC GTTGTAG
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TADEQRQLEE 
ALNVLLEEVR VNPQQILQSD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKETVRELLT ANRQLQSALV ETAQANQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDTLKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
DAAIGMVHLS RFAEDLIFFN SGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQDAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEALPLADL QKFSHVIGDD
VYPMLSLQSC LDKRAAKGGV SPQQVAQAID DARARLAL