Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0214 |
Symbol | |
ID | 6066306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 243232 |
End bp | 244521 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641599615 |
Product | arsenical pump membrane protein |
Protein accession | YP_001723222 |
Protein GI | 170018268 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | [TIGR00935] arsenical pump membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.656406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACTGG CAGGCGCTAT CTTTGTCCTG ACCATCGTAT TGGTTATCTG GCAGCCGAAA GGTTTAGGCA TCGGCTGGAG TGCAACGCTC GGCGCAGTAC TGGCGTTAGT TACGGGCGTG GTCCATCCGG GTGATATTCC GGTGGTGTGG AATATCGTCT GGAACGCGAC GGCTGCGTTT ATCGCCGTCA TTATCATCAG CCTGCTGCTG GATGAGTCCG GCTTTTTTGA ATGGGCGGCG CTGCACGTCT CACGCTGGGG TAATGGTCGT GGTCGCTTGC TGTTTACCTG GATTGTCCTG CTCGGTGCTG CCGTTGCCGC CCTGTTTGCC AATGATGGCG CGGCGCTTAT TTTGACACCG ATTGTCATCG CCATGCTGCT GGCTTTAGGG TTCAGTAAAG GCACTACGCT GGCGTTCGTG ATGGCGGCCG GATTCATTGC CGATACCGCC AGCCTGCCAC TTATTGTCTC CAACCTGGTG AATATCGTTT CCGCTGATTT CTTTGGCCTC GGCTTTCGCG AATACGCCTC GGTGATGGTG CCGGTGGATA TCGCCGCGAT TGTTGCCACG CTGGTGATGC TACATCTCTA TTTTCGCAAA GATATTCCGC AGAACTACGA TATGGCGCTG CTGAAATCTC CCGCAGAAGC GATCAAAGAT CCTGCTACGT TCAAAACTGG CTGGGTTGTT TTACTGCTTC TGCTGGTGGG ATTTTTCGTC CTGGAACCGC TCGGCATTCC GGTGAGCGCC ATTGCAGCTG TGGGCGCGCT GATATTATTT GTCGTCGCTA AACGCGGTCA TGCGATTAAT ACGGGTAAAG TCCTGCGCGG TGCCCCCTGG CAGATTGTCA TCTTCTCGCT CGGCATGTAT CTGGTGGTTT ATGGCCTGCG CAATGCCGGA TTAACGGAAT ATCTTTCTGG CGTACTCAAC GTGCTGGCGG ATAACGGCCT GTGGGCCGCG ACGCTCGGCA CCGGATTCCT CACCGCCTTC CTCTCTTCTA TTATGAACAA TATGCCGACG GTACTGGTTG GCGCGTTGTC CATTGATGGC AGCACGGCAT CTGGCGTTAT CAAAGAAGCG ATGGTTTATG CCAATGTGAT TGGCTGCGAT TTGGGACCGA AAATTACCCC AATTGGTAGC CTGGCTACGC TACTCTGGCT GCACGTACTT TCGCAGAAGA ATATGACTAT CAGCTGGGGA TATTACTTCC GTACAGGGAT TATCATGACC CTGCCTGTGC TGTTTGTGAC GCTGGCTGCG CTGGCGCTAC GTCTCTCTTT CACTTTGTAA
|
Protein sequence | MLLAGAIFVL TIVLVIWQPK GLGIGWSATL GAVLALVTGV VHPGDIPVVW NIVWNATAAF IAVIIISLLL DESGFFEWAA LHVSRWGNGR GRLLFTWIVL LGAAVAALFA NDGAALILTP IVIAMLLALG FSKGTTLAFV MAAGFIADTA SLPLIVSNLV NIVSADFFGL GFREYASVMV PVDIAAIVAT LVMLHLYFRK DIPQNYDMAL LKSPAEAIKD PATFKTGWVV LLLLLVGFFV LEPLGIPVSA IAAVGALILF VVAKRGHAIN TGKVLRGAPW QIVIFSLGMY LVVYGLRNAG LTEYLSGVLN VLADNGLWAA TLGTGFLTAF LSSIMNNMPT VLVGALSIDG STASGVIKEA MVYANVIGCD LGPKITPIGS LATLLWLHVL SQKNMTISWG YYFRTGIIMT LPVLFVTLAA LALRLSFTL
|
| |