Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4407 |
Symbol | argH |
ID | 6143737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4502673 |
End bp | 4504046 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619228 |
Product | argininosuccinate lyase |
Protein accession | YP_001746352 |
Protein GI | 170680227 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0966471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG TCCAAAGCCC TGGTAACGGT CGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG GCGCTGAACG TGCTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC GCCGAAGATA TCCATAGCTG GGTGGAAGGT AAACTGATCG ACAAAGTGGG TCAGTTAGGC AAAAAACTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CAACTGACCT GAAACTGTGG TGCAAAGATA CCGTTAGCGA GTTGCTGACG GCTAACCGGC AGTTACAATC CGCTCTGGTC GAGACCGCGC AGAACAATCA GGACGCGGTG ATGCCGGGTT ACACTCACCT GCAACGTGCT CAACCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTACGTTG AGATGCTGGC GCGTGATGAA AGCCGTTTGC AGGACGCGCT TAAGCGTCTG GATGTCAGCC CGCTAGGCTG TGGCGCACTG GCAGGAACGG CCTATGAAAT CGACCGAGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG GCGACCCGTA ACAGTCTCGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTTTCT GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC ACCGGCGAAG CGGGGTTTGT CGAGCTTTCG GACCGCGTGA CTTCCGGTTC ATCATTAATG CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG CCGGGTGCAG GGTGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CGCGTTGCCA GGAAGCGGCG CAGCAGGGGT ATGCGAACGC CACTGAACTG GCGGATTACC TGGTGGCGAA AGGCGTACCG TTCCGCGAGG CGCATCATAT TGTGGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA CCGCTGGAAG AACTACCGCT CAGTGAGTTG CAGAAATTCA GTCAGGTGAT TGGCGAAGAT GTCTATCCGA TTCTGTCGCT ACAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGCGTC TCACCGCAGC AGGTGGCGCA GGCGATTGCT TTTGCGCAGG CTCGGTTGGG GTAA
|
Protein sequence | MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEELPLSEL QKFSQVIGED VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG
|
| |