Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4789 |
Symbol | zntA |
ID | 6969186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4428778 |
End bp | 4430976 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643388485 |
Product | zinc/cadmium/mercury/lead-transporting ATPase |
Protein accession | YP_002272913 |
Protein GI | 209400548 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2217] Cation transport ATPase |
TIGRFAM ID | [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC [TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase [TIGR01525] heavy metal translocating P-type ATPase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACTC CTGACAATCA CGGCAAGAAA GCCCCTCAAT TTGCTGCGTT CAAACCGCTA ACCACGGTAC AGAACGCCAA CGACTGTTGC TGCGACGGCG CATGTTCCAG CACGCCAACT CTCTCTGAAA ACGTCTCCGG CACCCGCTAT AGCTGGAAAG TCAGCGGCAT GGACTGCGCC GCCTGTGCGC GCAAGGTAGA AAATGCCGTG CGCCAGCTTG CAGGCGTGAA TCAGGTACAG GTGTTGTTCG CCACCGAAAA ACTGGTGGTT GATGCCGACA ATGACATCCG TGCACAAGTT GAATCTGCGG TGCAAAAAGC GGGCTATTCC CTGCGCGATG AGCAGGCCGC CGACGAGCCA CAAGCATCGC GCCTGAAAGA GAATCTGCCG CTGATTACGC TTATCGTGAT GATGGCAATC AGCTGGGGTT TGGAGCAATT TAATCATCCG TTCGGGCAAC TGGCGTTTAT CGCGACCACG CTGGTTGGGC TGTACCCGAT TGCTCGTCAG GCATTACGGC TGATCAAATC CGGCAGCTAC TTCGCCATTG AAACCTTAAT GAGCGTAGCC GCTATTGGTG CACTGTTTAT TGGCGCAACG GCTGAAGCTG CGATGGTGTT GCTGCTGTTT TTGATTGGTG AACGACTGGA AGGCTGGGCC GCCAGCCGCG CGCGTCAAGG GGTCAGCGCG TTAATGGCGC TGAAACCAGA AACCGCCACG CGCCTGCGTA ACGGTGAGCG GGAAGAGGTG GCGATTAACA GCCTGCGCCC TGGCGATGTG ATTGAAGTCG CCGCAGGTGG ACGTTTGCCT GCCGACGGTA AACTGCTCTC ACCGTTTGCC AGTTTTGATG AAAGCGCCCT GACCGGCGAA TCTATTCCGG TGGAGCGCGC AACGGGCGAT AAAGTTCCTG CAGGAGCCAC CAGCGTAGAC CGTCTGGTAA CGCTGGAAGT GCTGTCAGAA CCGGGTGCCA GCGCCATTGA CCGGATTCTG AAACTGATCG AAGAAGCCGA AGAGCGTCGC GCACCCATTG AGCGGTTTAT CGACCGTTTC AGCCGTATTT ACACGCCCGC GATTATGGCC GTCGCTCTGC TGGTGACGCT GGTGCCACCG CTGCTGTTTG CCACCAGCTG GCAGGAGTGG ATTTATAAAG GGCTGACGCT GCTGCTGATT GGCTGCCCGT GTGCGTTAGT TATCTCAACG CCTGCGGCGA TTACCTCCGG GCTGGCGGCG GCAGCGCGTC GTGGGGCGTT GATTAAAGGC GGAGCGGCGC TGGAACAGCT GGGTCGTGTT ACCCAGGTGG CGTTTGATAA AACCGGTACG CTGACCGTCG GTAAACCGCG CGTTACCGCG ATTCATCCGG CAACGGGTAT TAGTGAATCT GAACTGCTGA CACTGGCGGC GGCGGTCGAG CAAGGCGCGA CGCATCCACT GGCGCAGGCC ATCGTGCGCG AAGCACAGGT TGCTGCACTC GCCATTCCCG CCGCCGAATC ACAGCGGGCG CTGGTCGGGT CTGGCATTGA AGCGCAGGTT AACGGTGAGC GCGTGTTGAT ATGCGCTGCC GGGAAACATC CCGCTGATGC ATTTGCTGGT TTGATTAATG AACTGGAAAG CGCCGGGCAA ACGGTTGTGC TGGTAGTACG TAATGATGAC GTGCTGGGTA TCATTGCATT GCAGGATACC CTGCGCGCCG ATGCTGCAAC TGCCATCAGT GAACTGAACG CGCTGGGCGT CAAAGGGGTG ATCCTCACCG GCGATAATCC ACGCGCAGCG GCGGCAATTG CCGGGGAGCT GGGGCTGGAG TTTAAAGCGG GCCTGTTGCC GGAAGATAAA GTCAAAGCGG TGACCGAGCT GAATCAACAT GCGCCGCTGG CGATGGTCGG TGACGGTATT AACGACGCGC CAGCGATGAA AGCTGCCGCC ATCGGGATTG CAATGGGCAG CGGCACAGAC GTGGCGCTGG AAACCGCCGA CGCAGCATTA ACCCATAACC ACCTGCGCGG TCTGGTGCAA ATGATTGAAC TGGCACGCGC CACTCACGCC AATATCCGCC AGAACATCAC TATTGCGCTG GGGCTGAAAG GGATCTTCCT CGTCACCACG CTGTTAGGGA TGACCGGACT ATGGCTGGCA GTGCTGGCAG ATACGGGTGC GACGGTGCTG GTGACAGCGA ATGCGTTAAG ATTGTTGCGC AGGAGATAA
|
Protein sequence | MSTPDNHGKK APQFAAFKPL TTVQNANDCC CDGACSSTPT LSENVSGTRY SWKVSGMDCA ACARKVENAV RQLAGVNQVQ VLFATEKLVV DADNDIRAQV ESAVQKAGYS LRDEQAADEP QASRLKENLP LITLIVMMAI SWGLEQFNHP FGQLAFIATT LVGLYPIARQ ALRLIKSGSY FAIETLMSVA AIGALFIGAT AEAAMVLLLF LIGERLEGWA ASRARQGVSA LMALKPETAT RLRNGEREEV AINSLRPGDV IEVAAGGRLP ADGKLLSPFA SFDESALTGE SIPVERATGD KVPAGATSVD RLVTLEVLSE PGASAIDRIL KLIEEAEERR APIERFIDRF SRIYTPAIMA VALLVTLVPP LLFATSWQEW IYKGLTLLLI GCPCALVIST PAAITSGLAA AARRGALIKG GAALEQLGRV TQVAFDKTGT LTVGKPRVTA IHPATGISES ELLTLAAAVE QGATHPLAQA IVREAQVAAL AIPAAESQRA LVGSGIEAQV NGERVLICAA GKHPADAFAG LINELESAGQ TVVLVVRNDD VLGIIALQDT LRADAATAIS ELNALGVKGV ILTGDNPRAA AAIAGELGLE FKAGLLPEDK VKAVTELNQH APLAMVGDGI NDAPAMKAAA IGIAMGSGTD VALETADAAL THNHLRGLVQ MIELARATHA NIRQNITIAL GLKGIFLVTT LLGMTGLWLA VLADTGATVL VTANALRLLR RR
|
| |