Gene EcSMS35_4407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4407 
SymbolargH 
ID6143737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4502673 
End bp4504046 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content56% 
IMG OID641619228 
Productargininosuccinate lyase 
Protein accessionYP_001746352 
Protein GI170680227 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0966471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC 
GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG
TCCAAAGCCC TGGTAACGGT CGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG
GCGCTGAACG TGCTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC
GCCGAAGATA TCCATAGCTG GGTGGAAGGT AAACTGATCG ACAAAGTGGG TCAGTTAGGC
AAAAAACTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CAACTGACCT GAAACTGTGG
TGCAAAGATA CCGTTAGCGA GTTGCTGACG GCTAACCGGC AGTTACAATC CGCTCTGGTC
GAGACCGCGC AGAACAATCA GGACGCGGTG ATGCCGGGTT ACACTCACCT GCAACGTGCT
CAACCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTACGTTG AGATGCTGGC GCGTGATGAA
AGCCGTTTGC AGGACGCGCT TAAGCGTCTG GATGTCAGCC CGCTAGGCTG TGGCGCACTG
GCAGGAACGG CCTATGAAAT CGACCGAGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG
GCGACCCGTA ACAGTCTCGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTTTCT
GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC
ACCGGCGAAG CGGGGTTTGT CGAGCTTTCG GACCGCGTGA CTTCCGGTTC ATCATTAATG
CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG CCGGGTGCAG
GGTGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT
ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT
ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CGCGTTGCCA GGAAGCGGCG
CAGCAGGGGT ATGCGAACGC CACTGAACTG GCGGATTACC TGGTGGCGAA AGGCGTACCG
TTCCGCGAGG CGCATCATAT TGTGGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA
CCGCTGGAAG AACTACCGCT CAGTGAGTTG CAGAAATTCA GTCAGGTGAT TGGCGAAGAT
GTCTATCCGA TTCTGTCGCT ACAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGCGTC
TCACCGCAGC AGGTGGCGCA GGCGATTGCT TTTGCGCAGG CTCGGTTGGG GTAA
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE 
ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEELPLSEL QKFSQVIGED
VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG