Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1023 |
Symbol | artI |
ID | 6969019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1034217 |
End bp | 1034948 |
Gene Length | 732 bp |
Protein Length | 243 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643385036 |
Product | arginine ABC transporter, periplasmic arginine-binding protein ArtI |
Protein accession | YP_002269536 |
Protein GI | 209397117 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.916151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTCTGATTGC CGCGTTAATT GCAGGTTTTA GTCTTTCCGC CACAGCTGCC GAAACCATTC GTTTTGCTAC CGAAGCCTCC TATCCTCCGT TTGAATCGAT TGATGCAAAC AACCAGATCG TTGGTTTTGA CGTCGACCTG GCACAGGCGC TGTGTAAAGA GATTGATGCG ACCTGTACTT TCTCTAACCA GGCATTTGAC AGCCTGATCC CAAGCCTGAA ATTCCGTCGC GTGGAAGCGG TGATGGCCGG GATGGATATC ACCGCGGAAC GTGAAAAGCA GGTACTGTTT ACCACGCCGT ACTATGACAA CTCTGCCCTG TTTGTGGGTC AGCAGGGTAA ATACACCAGC GTTGATCAGC TGAAAGGCAA AAAAGTCGGC GTACAGAATG GTACGACTCA CCAGAAATTC ATTATGGATA AGCACCCGGA AATCACCACC GTGCCGTATG ACAGCTACCA GAACGCAAAA CTGGATCTGC AAAACGGTCG TATCGACAGC GTATTTGGTG ACACCGCAGT GGTAACTGAA TGGCTGAAAG ATAACCCGAA ACTGGCGGCA GTGGGCGATA AAGTGACCGA TAAAGATTAC TTCGGTACTG GCCTCGGCAT CGCGGTACGT CAGGGCAACA CTGAGCTGCA GCAGAAACTC AACACTGCGC TGGAAAAAGT GAAGAAAGAT GGCACTTACG AAACCATCTA CAACAAATGG TTCCAGAAGT AA
|
Protein sequence | MKKVLIAALI AGFSLSATAA ETIRFATEAS YPPFESIDAN NQIVGFDVDL AQALCKEIDA TCTFSNQAFD SLIPSLKFRR VEAVMAGMDI TAEREKQVLF TTPYYDNSAL FVGQQGKYTS VDQLKGKKVG VQNGTTHQKF IMDKHPEITT VPYDSYQNAK LDLQNGRIDS VFGDTAVVTE WLKDNPKLAA VGDKVTDKDY FGTGLGIAVR QGNTELQQKL NTALEKVKKD GTYETIYNKW FQK
|
| |