Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3794 |
Symbol | arsB |
ID | 6146139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3860362 |
End bp | 3861651 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618620 |
Product | arsenical pump membrane protein |
Protein accession | YP_001745760 |
Protein GI | 170683643 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | [TIGR00935] arsenical pump membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.146507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACTGG CGGGCGCTAT CTTTGTCCTG ACCATCGTAT TGGTTATCTG GCAGCCGAAA GGTTTAGGCA TCGGCTGGAG TGCGACACTG GGCGCCGTAC TGGCGTTAGT TACTGGCGTG GTCCATCCGG GCGATATTCC GGTGGTGTGG AATATCGTCT GGAACGCGAC GGCGGCGTTT ATAGCCGTCA TTATCATCAG CTTGCTGCTG GATGAGTCCG GCTTTTTTGA ATGGGCGGCG CTGCACGTCT CACGCTGGGG CAATGGGCGC GGTCGCTTGC TGTTTACCTG GATTGTCCTG CTCGGTGCTG CCGTTGCAGC TCTGTTTGCC AATGATGGCG CGGCGCTTAT TTTGACACCG ATTGTCATCG CCATGCTGCT GGCTTTAGGG TTCAACAAAG GCACTACGCT GGCGTTTGTT ATGGCGGCCG GATTCATTGC CGATACCGCC AGCCTGCCGC TTATTGTCTC TAACCTGGTG AATATCGTTT CGGCAGATTT CTTTAGCCTC GACTTTCGCG AATACGCCTC GGTGATGGTG CCGGTGGATA TCGCTGCAAT ACTGGCTACC CTGGTGATGT TGCATCTCTA TTTCCGCAAA GATATTCCGC AGAACTACGA CATGGCGCTG CTAAAATCTC CCGCAGAAGC GGTTAAAGAT CCTGCCACCT TCAAAACTGG CGGGGTTGTT TTACTGCTTC TGCTGGTGGG ATTTTTCGTC CTGGAACCGC TCGGCATTCC GGTAAGCGCC ATTGCGGCAG TGGGCGCGCT GATATTATTT GTCGTCGCTA AACGCGGTCA TGCGATTAAT ACGGGTAAGG TGCTGCGCGG TGCGCCCTGG CAGATTGTCA TCTTCTCGCT CGGCATGTAT CTGGTGGTTT ATGGCCTGCG CAATGCCGGA TTAACGGAAT ATCTTTCTGG CGTACTCAAC GTGCTGGCGG ATAACGGCCT GTGGGCGGCG ACGCTCGGCA CCGGATTCCT CACCGCCTTC CTCTCTTCTA TTATGAACAA TATGCCGACG GTACTGGTTG GCGCGTTGTC CATTGATGGC AGCACGGCAT CTGGTGTCAT CAAAGAAGCG ATGGTTTATG CCAACGTAAT TGGCTGCGAT TTGGGGCCGA AAATTACACC GATTGGTAGC CTGGCAACGC TGCTCTGGCT GCACGTACTT TCGCAGAAGA ATATGACTAT CAGCTGGGGA TATTACTTCC GTACAGGGAT TATCATGACC CTGCCGGTGC TGTTTGTGAC GCTCGCCGCG CTGGCGCTTC GTCTCTCTTT CACTTTGTAA
|
Protein sequence | MLLAGAIFVL TIVLVIWQPK GLGIGWSATL GAVLALVTGV VHPGDIPVVW NIVWNATAAF IAVIIISLLL DESGFFEWAA LHVSRWGNGR GRLLFTWIVL LGAAVAALFA NDGAALILTP IVIAMLLALG FNKGTTLAFV MAAGFIADTA SLPLIVSNLV NIVSADFFSL DFREYASVMV PVDIAAILAT LVMLHLYFRK DIPQNYDMAL LKSPAEAVKD PATFKTGGVV LLLLLVGFFV LEPLGIPVSA IAAVGALILF VVAKRGHAIN TGKVLRGAPW QIVIFSLGMY LVVYGLRNAG LTEYLSGVLN VLADNGLWAA TLGTGFLTAF LSSIMNNMPT VLVGALSIDG STASGVIKEA MVYANVIGCD LGPKITPIGS LATLLWLHVL SQKNMTISWG YYFRTGIIMT LPVLFVTLAA LALRLSFTL
|
| |