Gene SeHA_C4451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4451 
SymbolargH 
ID6488171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4338224 
End bp4339600 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content57% 
IMG OID642744533 
Productargininosuccinate lyase 
Protein accessionYP_002048122 
Protein GI194450311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.000102786 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACTTT GGGGTGGGCG TTTTACACAG GCGGCAGACC AGCGGTTTAA ACAATTCAAT 
GATTCGTTGC GCTTCGATTA CCGTCTGGCG GAGCAGGATA TTGTCGGTTC TGTGGCCTGG
TCCAAAGCAT TGGTCACGGT AGGCGTACTG ACTGCCGATG AGCAACGACA GTTGGAAGAA
GCGCTGAACG TATTGCTGGA AGAGGTTCGC GCGAATCCGC AGCAAATCCT GCAAAGCGAT
GCGGAAGATA TCCATAGCTG GGTGGAAGGT AAGCTCATCG ACAAAGTGGG TCAGTTGGGT
AAAAAGCTGC ACACCGGGCG CAGCCGTAAC GATCAAGTGG CGACGGACCT GAAACTGTGG
TGCAAAGAGA CGGTGAGGGA ACTGCTTACC GCTAACCGCC AGTTACAGAG CGCGCTGGTG
GAAACCGCGC AGGCGAACCA GGACGCGGTA ATGCCGGGAT ATACCCATCT GCAACGCGCG
CAGCCAGTGA CTTTCGCCCA CTGGTGTCTC GCGTATGTCG AAATGCTGGC GCGCGATGAA
AGCCGCCTGC AGGACACGCT TAAACGTCTG GACGTGAGTC CGCTAGGTTG CGGCGCGTTG
GCGGGAACGG CCTATGAAAT TGACCGTGAA CAATTGGCAG GCTGGCTGGG CTTTGCGTCT
GCGACCCGCA ACAGCCTGGA CAGCGTGTCC GATCGTGACC ACGTACTGGA ACTGCTTTCT
GATGCGGCTA TCGGCATGGT GCATCTGTCA CGCTTCGCGG AAGATCTGAT TTTCTTTAAT
TCTGGTGAAG CGGGTTTTGT AGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCGTTAATG
CCGCAGAAGA AAAACCCGGA CGCGCTGGAG CTGATTCGCG GTAAGTGCGG TCGCGTACAA
GGGGCGCTAA CCGGCATGAT GATGACCTTA AAAGGTCTGC CGCTGGCGTA TAACAAAGAT
ATGCAGGAAG ACAAAGAAGG GCTGTTCGAT GCGCTCGATA CCTGGCTTGA CTGCCTGCAT
ATGGCGGCGT TGGTGCTGGA CGGTATTCAG GTGAAACGCC CACGTTGTCA GGACGCGGCG
CAACAGGGGT ATGCCAACGC CACGGAGCTG GCGGATTACC TGGTCGCGAA AGGCGTGCCG
TTCCGCGAAG CGCACCATAT TGTCGGCGAA GCGGTGGTAG AAGCCATTCG CCAGGGTAAG
CCGCTGGAAG CGTTGCCGCT GGCCGATTTA CAGAAATTCA GCCACGTGAT TGGCGACGAT
GTGTATCCGA TGTTGTCTTT GCAGTCGTGT CTGGATAAAC GGGCGGCAAA AGGCGGCGTT
TCTCCGCAGC AGGTGGCGCA GGCCATCAAC GATGCGAAGG CGCGCCTCGC GTTGTAG
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TADEQRQLEE 
ALNVLLEEVR ANPQQILQSD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKETVRELLT ANRQLQSALV ETAQANQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDTLKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
DAAIGMVHLS RFAEDLIFFN SGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQDAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEALPLADL QKFSHVIGDD
VYPMLSLQSC LDKRAAKGGV SPQQVAQAIN DAKARLAL