Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1123 |
Symbol | |
ID | 6146152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1142343 |
End bp | 1143719 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641616003 |
Product | argininosuccinate lyase ArgH-like protein |
Protein accession | YP_001743195 |
Protein GI | 170683952 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.168929 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACGA AATTATGGGG CGGACGTTTT GATATGCCAA CAAATAAACT CGTTGAACAA TATAATGCAA CTATCACGCT GGAGCAGCGC CTATGCCCCT TTGATATTCA AGGAAGCATA GTTCATGCCA CGATGTTAGG ACGTCAGGGA ATAATCACAC AGGATGAAGC CAATACAATT ATTAGAGGTT TGAGACAAGT AAGTAAAGAA ATTGAAGACG GCCAATTCAT TTTCGATACT GTAGATGAAG ATATTCATAT GGCCATTGAA AGACGGATGA CTGAAATTAT TGGCCCTGTT GGTGGTAAAC TCCACACGGG GCGAAGTCGT AATGATCAAA CCACTGTTGA TTCTAAAATG CATATGCGAG CAATTATCCG TGAAATTCAA GAGGATATTA CCAACCTGCA AAAAATAATA ATTAACAAAG CAGAAAACAA TATTAATGTC ATCATGCCAG GCTATACTCA TTTGCAAACA GGTCAGCCCA TTCTTTTATC TCACTGGATT ATGGCATATT ACTGGATGTT GCGTCGTGAC TGGAATAGGT TTGAAGATCT GTATCAACGG ATGGGAGAAT GCCCTTTGGG GGCAGCGGCT CTCGCCGGTA CGACATTCCC TATTGATCGT AATTTTACGG CTCGTGAACT TGGTTTTGAT AAGCCAACTG AGAATAGTAT TGATTCTGTC AGTGACCGCG ACCATATGGT CGAATTCACC GCGGCAGCAG CGATGTGTTT TATGCATCTA ACTCGCCTTT CAGAGGAACT GATTTTATTC TCTAGCCAAG ACTTTAAATT TATTGAACTT TCTGATGACT TCTGTACAGG ATCCAGCATC ATGCCGCAGA AAAAGAACCC TGATGTGGCG GAAAAAATGC GTGGTAAAGG TGGGAGAATG TATGGAAATC TGATGGCCAT GCTGACTATT ATGAAAGGCA TACCGCTAGC GTATAATACA GACATGAGCG AGGATAAAGA GCAGGTCTAT GACTCAATGG ATACTCTACA GGCCAGCTTA AGAATAATGG CACCTATGAT CGAAAAAATG GTTATCCTTG CCGAAAATAC GCGTGCAGCA GCCGCTCGAG GATTCTCGAA TGCAACAGAT ATGGCCGATT ATCTGGTCCG TAAAGGTATT CCTTTCAGAG AAGCTCACCA TATTGTTGGT AGTGCAGTAA ATTACTGTAT TAAACATAAA AAAATGTTAG AAGAGCTTAC TATGGAAGAA TTCTCCACAT TTGATAATAA AATAGAAAAA GATATTTATG AAAGTATTTC TCTGGAGGCT TGCATTAAGG CCAGGATGTC TTATGGTGGA ACCGGACCTG ATGCTGTCAA AAAACAAATA GAGATTGCAA AATCACTTTT AAAATAG
|
Protein sequence | MSTKLWGGRF DMPTNKLVEQ YNATITLEQR LCPFDIQGSI VHATMLGRQG IITQDEANTI IRGLRQVSKE IEDGQFIFDT VDEDIHMAIE RRMTEIIGPV GGKLHTGRSR NDQTTVDSKM HMRAIIREIQ EDITNLQKII INKAENNINV IMPGYTHLQT GQPILLSHWI MAYYWMLRRD WNRFEDLYQR MGECPLGAAA LAGTTFPIDR NFTARELGFD KPTENSIDSV SDRDHMVEFT AAAAMCFMHL TRLSEELILF SSQDFKFIEL SDDFCTGSSI MPQKKNPDVA EKMRGKGGRM YGNLMAMLTI MKGIPLAYNT DMSEDKEQVY DSMDTLQASL RIMAPMIEKM VILAENTRAA AARGFSNATD MADYLVRKGI PFREAHHIVG SAVNYCIKHK KMLEELTMEE FSTFDNKIEK DIYESISLEA CIKARMSYGG TGPDAVKKQI EIAKSLLK
|
| |