Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4851 |
Symbol | arsB |
ID | 6971727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4486328 |
End bp | 4487617 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388542 |
Product | arsenical pump membrane protein |
Protein accession | YP_002272970 |
Protein GI | 209400831 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | [TIGR00935] arsenical pump membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACTGG CAGGCGCTAT CTTTGTCCTG ACCATTGTAT TGGTTATCTG GCAGCCGAAA GGTTTAGGCA TCGGCTGGAG TGCAACGCTC GGCGCAGTAC TGGCGTTAGT TACGGGCGTA GTACATCCGG GCGATATTCC GGTGGTGTGG AATATCGTCT GGAACGCGAC GGCTGCGTTT ATTGCCGTTA TTATCATCAG CCTGCTGCTG GATGAGTCCG GCTTTTTTGA ATGGGCGGCA CTGCACGTTT CACGCTGGGG CAATGGACGC GGTCGCTTGC TGTTTACCTG GATTGTCCTG CTCGGTGCTG CCGTTGCCGC CCTGTTTGCC AATGATGGCG CGGCGCTTAT TTTGACACCG ATTGTCATCG CCATGCTGCT GGCTTTAGGG TTCAGTAAAG GCACTACGCT GGCGTTTGTG ATGGCGGCCG GATTCATTGC CGATACCGCC AGCCTGCCGC TTATTGTCTC CAACCTGGTG AATATCGTTT CCGCAGATTT CTTTGGCCTC GGCTTTCGCG AATATGCCTC GGTGATGGTG CCGGTGGATA TCGCCGCGAT TGTTGCCACG CTGGTGATGT TGCATCTCTA TTTCCGCAAA GATATTCCGC AGAACTACGA CATGGCGCTG CTGAAATCTC CCGCAGAAGC GATTAAAGAT CCTGCTACGT TCAAAACTGG CTGGGTTGTT TTACTGCTTC TGCTGGTGGG ATTTTTCGTC CTGGAACCGC TCGGCATTCC GGTCAGCGCC ATTGCGGCAG TGGGCGCGTT GATATTATTT GTCGTCGCTA AACGCGGTCA TGCGATTAAT ACGGGTAAAG TGCTGCGCGG TGCGCCCTGG CAGATTGTCA TCTTCTCGCT CGGCATGTAT CTGGTGGTTT ATGGCCTGCG CAATGCCGGA TTAACGGAAT ATCTTTCTGG CGTACTCAAC GTGCTGGCGG ATAACGGCCT GTGGGCCGCG ACGCTCGGCA CCGGATTCCT CACCGCCTTC CTCTCTTCTA TTATGAACAA TATGCCGACG GTACTGGTTG GCGCGTTGTC CATTGATGGC AGCACGGCAT CTGGCGTTAT CAAAGAAGCG ATGGTTTATG CCAACGTGAT TGGCTGCGAT TTGGGGCCGA AAATCACCCC GATTGGTAGC CTGGCAACGC TGCTCTGGCT GCACGTACTT TCGCAGAAGA ATATGACTAT CAGCTGGGGA TATTACTTCC GTACAGGGAT TATCATGACC CTGCCTGTGC TGTTTGTGAC GCTGGCTGCG CTGGCGCTAC GTCTCTCTTT CACTTTGTAA
|
Protein sequence | MLLAGAIFVL TIVLVIWQPK GLGIGWSATL GAVLALVTGV VHPGDIPVVW NIVWNATAAF IAVIIISLLL DESGFFEWAA LHVSRWGNGR GRLLFTWIVL LGAAVAALFA NDGAALILTP IVIAMLLALG FSKGTTLAFV MAAGFIADTA SLPLIVSNLV NIVSADFFGL GFREYASVMV PVDIAAIVAT LVMLHLYFRK DIPQNYDMAL LKSPAEAIKD PATFKTGWVV LLLLLVGFFV LEPLGIPVSA IAAVGALILF VVAKRGHAIN TGKVLRGAPW QIVIFSLGMY LVVYGLRNAG LTEYLSGVLN VLADNGLWAA TLGTGFLTAF LSSIMNNMPT VLVGALSIDG STASGVIKEA MVYANVIGCD LGPKITPIGS LATLLWLHVL SQKNMTISWG YYFRTGIIMT LPVLFVTLAA LALRLSFTL
|
| |