Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4446 |
Symbol | argH |
ID | 6271248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4157197 |
End bp | 4158570 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641728241 |
Product | argininosuccinate lyase |
Protein accession | YP_001882654 |
Protein GI | 187731864 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG TCCAAAGCCC TAGTAACGGT CGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG GCGCTGAACG TGTTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC GCCGAAGATA TCCATAGCTG GGTGGAAGGC AAACTGATCG ACAAAGTGGG CCAGTTAGGC AAAAAGCTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CGACTGACCT GAAACTGTGG TGCAAAGATA CCGTTAGCGA GTTACTGACG GCTAACCGGC AGCTGCAATC GGCGCTGGTG GAAACCGCAC AAAACAATCA GGACGCGGTA ATGCCAGGTT ACACTCACCT GCAACGCGCC CAGCCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTATGTTG AGATGCTGGC GCGTGATGAA AGCCGTTTGC AGGATGCGCT TAAGCGTCTG GATGTCAGCC CATTAGGCTG TGGCGCGCTG GCGGGAACGG CCTATGAAAT CGACCGTGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG GCGACCCGTA ACAGTCTGGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTGTCG GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC ACCGGCGAAG CGGGGTTTGT GGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCATTAATG CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG TCGGGTGCAG GGTGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CACGTTGCCA GGAAGCGGCT CAGCAGGGTT ACGCCAACGC CACCGAACTG GCGGATTACC TGGTGGCGAA AGGCGTACCT TTCCGCGAGG CGCACCATAT CGTGGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA CCGCTGGAAG ATCTGCCGCT CGACGAGTTG CAGAAATTCA GCCACGTGAT TGGCGAAGAT GTCTATCCGA TTCTGTCGCT ACAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGTGTC TCACCGCAGC AGGTGGCCCA AGCGATTGCT TTTGCGCAGG CGCGGTTGGG GTAA
|
Protein sequence | MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEDLPLDEL QKFSHVIGED VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG
|
| |