Gene ECH74115_4789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4789 
SymbolzntA 
ID6969186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4428778 
End bp4430976 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content58% 
IMG OID643388485 
Productzinc/cadmium/mercury/lead-transporting ATPase 
Protein accessionYP_002272913 
Protein GI209400548 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACTC CTGACAATCA CGGCAAGAAA GCCCCTCAAT TTGCTGCGTT CAAACCGCTA 
ACCACGGTAC AGAACGCCAA CGACTGTTGC TGCGACGGCG CATGTTCCAG CACGCCAACT
CTCTCTGAAA ACGTCTCCGG CACCCGCTAT AGCTGGAAAG TCAGCGGCAT GGACTGCGCC
GCCTGTGCGC GCAAGGTAGA AAATGCCGTG CGCCAGCTTG CAGGCGTGAA TCAGGTACAG
GTGTTGTTCG CCACCGAAAA ACTGGTGGTT GATGCCGACA ATGACATCCG TGCACAAGTT
GAATCTGCGG TGCAAAAAGC GGGCTATTCC CTGCGCGATG AGCAGGCCGC CGACGAGCCA
CAAGCATCGC GCCTGAAAGA GAATCTGCCG CTGATTACGC TTATCGTGAT GATGGCAATC
AGCTGGGGTT TGGAGCAATT TAATCATCCG TTCGGGCAAC TGGCGTTTAT CGCGACCACG
CTGGTTGGGC TGTACCCGAT TGCTCGTCAG GCATTACGGC TGATCAAATC CGGCAGCTAC
TTCGCCATTG AAACCTTAAT GAGCGTAGCC GCTATTGGTG CACTGTTTAT TGGCGCAACG
GCTGAAGCTG CGATGGTGTT GCTGCTGTTT TTGATTGGTG AACGACTGGA AGGCTGGGCC
GCCAGCCGCG CGCGTCAAGG GGTCAGCGCG TTAATGGCGC TGAAACCAGA AACCGCCACG
CGCCTGCGTA ACGGTGAGCG GGAAGAGGTG GCGATTAACA GCCTGCGCCC TGGCGATGTG
ATTGAAGTCG CCGCAGGTGG ACGTTTGCCT GCCGACGGTA AACTGCTCTC ACCGTTTGCC
AGTTTTGATG AAAGCGCCCT GACCGGCGAA TCTATTCCGG TGGAGCGCGC AACGGGCGAT
AAAGTTCCTG CAGGAGCCAC CAGCGTAGAC CGTCTGGTAA CGCTGGAAGT GCTGTCAGAA
CCGGGTGCCA GCGCCATTGA CCGGATTCTG AAACTGATCG AAGAAGCCGA AGAGCGTCGC
GCACCCATTG AGCGGTTTAT CGACCGTTTC AGCCGTATTT ACACGCCCGC GATTATGGCC
GTCGCTCTGC TGGTGACGCT GGTGCCACCG CTGCTGTTTG CCACCAGCTG GCAGGAGTGG
ATTTATAAAG GGCTGACGCT GCTGCTGATT GGCTGCCCGT GTGCGTTAGT TATCTCAACG
CCTGCGGCGA TTACCTCCGG GCTGGCGGCG GCAGCGCGTC GTGGGGCGTT GATTAAAGGC
GGAGCGGCGC TGGAACAGCT GGGTCGTGTT ACCCAGGTGG CGTTTGATAA AACCGGTACG
CTGACCGTCG GTAAACCGCG CGTTACCGCG ATTCATCCGG CAACGGGTAT TAGTGAATCT
GAACTGCTGA CACTGGCGGC GGCGGTCGAG CAAGGCGCGA CGCATCCACT GGCGCAGGCC
ATCGTGCGCG AAGCACAGGT TGCTGCACTC GCCATTCCCG CCGCCGAATC ACAGCGGGCG
CTGGTCGGGT CTGGCATTGA AGCGCAGGTT AACGGTGAGC GCGTGTTGAT ATGCGCTGCC
GGGAAACATC CCGCTGATGC ATTTGCTGGT TTGATTAATG AACTGGAAAG CGCCGGGCAA
ACGGTTGTGC TGGTAGTACG TAATGATGAC GTGCTGGGTA TCATTGCATT GCAGGATACC
CTGCGCGCCG ATGCTGCAAC TGCCATCAGT GAACTGAACG CGCTGGGCGT CAAAGGGGTG
ATCCTCACCG GCGATAATCC ACGCGCAGCG GCGGCAATTG CCGGGGAGCT GGGGCTGGAG
TTTAAAGCGG GCCTGTTGCC GGAAGATAAA GTCAAAGCGG TGACCGAGCT GAATCAACAT
GCGCCGCTGG CGATGGTCGG TGACGGTATT AACGACGCGC CAGCGATGAA AGCTGCCGCC
ATCGGGATTG CAATGGGCAG CGGCACAGAC GTGGCGCTGG AAACCGCCGA CGCAGCATTA
ACCCATAACC ACCTGCGCGG TCTGGTGCAA ATGATTGAAC TGGCACGCGC CACTCACGCC
AATATCCGCC AGAACATCAC TATTGCGCTG GGGCTGAAAG GGATCTTCCT CGTCACCACG
CTGTTAGGGA TGACCGGACT ATGGCTGGCA GTGCTGGCAG ATACGGGTGC GACGGTGCTG
GTGACAGCGA ATGCGTTAAG ATTGTTGCGC AGGAGATAA
 
Protein sequence
MSTPDNHGKK APQFAAFKPL TTVQNANDCC CDGACSSTPT LSENVSGTRY SWKVSGMDCA 
ACARKVENAV RQLAGVNQVQ VLFATEKLVV DADNDIRAQV ESAVQKAGYS LRDEQAADEP
QASRLKENLP LITLIVMMAI SWGLEQFNHP FGQLAFIATT LVGLYPIARQ ALRLIKSGSY
FAIETLMSVA AIGALFIGAT AEAAMVLLLF LIGERLEGWA ASRARQGVSA LMALKPETAT
RLRNGEREEV AINSLRPGDV IEVAAGGRLP ADGKLLSPFA SFDESALTGE SIPVERATGD
KVPAGATSVD RLVTLEVLSE PGASAIDRIL KLIEEAEERR APIERFIDRF SRIYTPAIMA
VALLVTLVPP LLFATSWQEW IYKGLTLLLI GCPCALVIST PAAITSGLAA AARRGALIKG
GAALEQLGRV TQVAFDKTGT LTVGKPRVTA IHPATGISES ELLTLAAAVE QGATHPLAQA
IVREAQVAAL AIPAAESQRA LVGSGIEAQV NGERVLICAA GKHPADAFAG LINELESAGQ
TVVLVVRNDD VLGIIALQDT LRADAATAIS ELNALGVKGV ILTGDNPRAA AAIAGELGLE
FKAGLLPEDK VKAVTELNQH APLAMVGDGI NDAPAMKAAA IGIAMGSGTD VALETADAAL
THNHLRGLVQ MIELARATHA NIRQNITIAL GLKGIFLVTT LLGMTGLWLA VLADTGATVL
VTANALRLLR RR