Gene EcSMS35_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1123 
Symbol 
ID6146152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1142343 
End bp1143719 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content41% 
IMG OID641616003 
Productargininosuccinate lyase ArgH-like protein 
Protein accessionYP_001743195 
Protein GI170683952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.168929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACGA AATTATGGGG CGGACGTTTT GATATGCCAA CAAATAAACT CGTTGAACAA 
TATAATGCAA CTATCACGCT GGAGCAGCGC CTATGCCCCT TTGATATTCA AGGAAGCATA
GTTCATGCCA CGATGTTAGG ACGTCAGGGA ATAATCACAC AGGATGAAGC CAATACAATT
ATTAGAGGTT TGAGACAAGT AAGTAAAGAA ATTGAAGACG GCCAATTCAT TTTCGATACT
GTAGATGAAG ATATTCATAT GGCCATTGAA AGACGGATGA CTGAAATTAT TGGCCCTGTT
GGTGGTAAAC TCCACACGGG GCGAAGTCGT AATGATCAAA CCACTGTTGA TTCTAAAATG
CATATGCGAG CAATTATCCG TGAAATTCAA GAGGATATTA CCAACCTGCA AAAAATAATA
ATTAACAAAG CAGAAAACAA TATTAATGTC ATCATGCCAG GCTATACTCA TTTGCAAACA
GGTCAGCCCA TTCTTTTATC TCACTGGATT ATGGCATATT ACTGGATGTT GCGTCGTGAC
TGGAATAGGT TTGAAGATCT GTATCAACGG ATGGGAGAAT GCCCTTTGGG GGCAGCGGCT
CTCGCCGGTA CGACATTCCC TATTGATCGT AATTTTACGG CTCGTGAACT TGGTTTTGAT
AAGCCAACTG AGAATAGTAT TGATTCTGTC AGTGACCGCG ACCATATGGT CGAATTCACC
GCGGCAGCAG CGATGTGTTT TATGCATCTA ACTCGCCTTT CAGAGGAACT GATTTTATTC
TCTAGCCAAG ACTTTAAATT TATTGAACTT TCTGATGACT TCTGTACAGG ATCCAGCATC
ATGCCGCAGA AAAAGAACCC TGATGTGGCG GAAAAAATGC GTGGTAAAGG TGGGAGAATG
TATGGAAATC TGATGGCCAT GCTGACTATT ATGAAAGGCA TACCGCTAGC GTATAATACA
GACATGAGCG AGGATAAAGA GCAGGTCTAT GACTCAATGG ATACTCTACA GGCCAGCTTA
AGAATAATGG CACCTATGAT CGAAAAAATG GTTATCCTTG CCGAAAATAC GCGTGCAGCA
GCCGCTCGAG GATTCTCGAA TGCAACAGAT ATGGCCGATT ATCTGGTCCG TAAAGGTATT
CCTTTCAGAG AAGCTCACCA TATTGTTGGT AGTGCAGTAA ATTACTGTAT TAAACATAAA
AAAATGTTAG AAGAGCTTAC TATGGAAGAA TTCTCCACAT TTGATAATAA AATAGAAAAA
GATATTTATG AAAGTATTTC TCTGGAGGCT TGCATTAAGG CCAGGATGTC TTATGGTGGA
ACCGGACCTG ATGCTGTCAA AAAACAAATA GAGATTGCAA AATCACTTTT AAAATAG
 
Protein sequence
MSTKLWGGRF DMPTNKLVEQ YNATITLEQR LCPFDIQGSI VHATMLGRQG IITQDEANTI 
IRGLRQVSKE IEDGQFIFDT VDEDIHMAIE RRMTEIIGPV GGKLHTGRSR NDQTTVDSKM
HMRAIIREIQ EDITNLQKII INKAENNINV IMPGYTHLQT GQPILLSHWI MAYYWMLRRD
WNRFEDLYQR MGECPLGAAA LAGTTFPIDR NFTARELGFD KPTENSIDSV SDRDHMVEFT
AAAAMCFMHL TRLSEELILF SSQDFKFIEL SDDFCTGSSI MPQKKNPDVA EKMRGKGGRM
YGNLMAMLTI MKGIPLAYNT DMSEDKEQVY DSMDTLQASL RIMAPMIEKM VILAENTRAA
AARGFSNATD MADYLVRKGI PFREAHHIVG SAVNYCIKHK KMLEELTMEE FSTFDNKIEK
DIYESISLEA CIKARMSYGG TGPDAVKKQI EIAKSLLK