Gene EcSMS35_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3752 
SymbolzntA 
ID6142828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3817745 
End bp3819943 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content58% 
IMG OID641618578 
Productzinc/cadmium/mercury/lead-transporting ATPase 
Protein accessionYP_001745718 
Protein GI170683459 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACTC CTGACAATCA CGGCAAGAAA GCCCCTCAAT TTGCTGCGTT CAAACCGCTA 
ACTACGGTAC AGAACACCAA CGACTGTTGC TGCGATGGCG CATGTTCCAG CACGCCAACG
CTCTCTGAAA GCGTCTCCGG CACCCGCTAT AGCTGGAAAG TCAGCGGCAT GGACTGCGCC
GCCTGTGCGC GCAAAGTGGA AAATGCCGTG CGCCAGCTTG CAGGCGTGAA TCAGGTGCAG
GTGTTGTTCG CCACCGAAAA ATTGGTAGTC GATGCCGACA ATGACATCCG TGCCCAGGTT
GAATCTGCGG TGCAAAAAGC GGGCTATTCC CTGCGCGATG AACAGACCAC CGACGAACCA
CAAGCATCAC GCCTGAAAGA GAATCTGCCG CTGATTACGC TAATCGTGAT GATGGCAATC
AGCTGGGGTC TGGAGCAGTT CAATCATCCG TTTGGTCAAC TGGCGTTTAT CGCGACCACG
CTGGTTGGGC TGTACCCGAT TGCTCGTCAG GCATTACGGC TGATCAAATC TGGCAGCTAC
TTCGCCATTG AAACCTTAAT GAGCGTAGCC GCTATTGGTG CGCTGTTTAT TGGCGCAACG
GCTGAAGCTG CGATGGTGTT GCTGCTGTTT TTGATTGGTG AACGACTGGA AGGCTGGGCC
GCCAGCCGCG CGCGTCAGGG CGTTAGCGCG TTAATGGCGC TGAAACCGGA AACCGCCACG
CGTCTGCGTA ACGGTGAGCG TGAAGAGGTG GCGATTAACA GCCTGCGCCC TGGCGATGTG
ATTGAAGTTG CGGCAGGTGG GCGTTTGCCT GCCGACGGTA AACTGCTCTC ACCGTTTGCC
AGTTTTGATG AAAGCGCCCT GACCGGAGAA TCCATTCCGG TGGAGCGCGC AACGGGCGAT
AAAGTCCCTG CTGGTGCCAC CAGCGTAGAC CGTCTGGTAA CGCTGGAAGT GCTGTCAGAA
CCGGGTGCCA GCGCCATTGA CCGGATTCTG AAACTGATCG AAGAAGCCGA AGAGCGTCGC
GCCCCCATTG AGCGGTTTAT CGACCGTTTC AGCCGTATCT ACACGCCAGC GATTATGGCC
GTCGCTCTGC TGGTGACGCT GGTGCCGCCG CTGCTGTTTG CCGCCAGCTG GCAAGAGTGG
ATTTATAAAG GGCTGACGCT GCTGCTGATT GGTTGCCCGT GTGCGTTAGT TATCTCCACG
CCCGCGGCGA TTACCTCCGG GCTGGCGGCA GCAGCGCGTC GTGGGGCGTT GATTAAAGGC
GGCGCGGCGC TGGAGCAGCT GGGTCGTGTT ACCCAGGTGG CGTTTGATAA AACCGGTACG
CTGACCGTCG GTAAACCGCG CGTTACCGCG ATTCATCCGG CAACGGGTAT TAGTGAATCT
GAACTGCTGA CACTGGCGGC GGCGGTCGAG CAAGGCGCGA CGCATCCACT GGCACAAGCC
ATTGTGCGCG AAGCACAGGT TGCTGAACTC GCCATTCCCA CCGCCCAATC ACAGCGGGCG
CTGGTCGGGT CTGGCATTGA AGCGCAGGTT AACGGTGAGC GCGTGTTGAT ATGCGCTGCC
GGAAAACATC CTGCTGATGC ATTTGCTGGT TTGATTAATG AACTGGAAAG CGCCGGGCAA
ACGGTAGTGC TGGTAGTACG TAATGATGAC GTGCTGGGTG TCATTGCGTT GCAGGATACC
CTGCGCGCCG ATGCTGCAAC TGCTATCAGT GAACTGAACG CGCTGGGGGT TAAAGGAGTG
ATCCTCACCG GCGATAATCC ACGCGCCGCG GCGGCAATTG CCGGGGAGTT GGGGCTGGAA
TTTAAAGCGG GCCTGTTGCC GGAAGATAAA GTCAAAGCGG TGACCGAGCT GAATCAACAT
GCGCCGCTGG CGATGGTCGG TGATGGTATT AACGACGCGC CAGCGATGAA AGCTGCCGCC
ATCGGGATTG CAATGGGCAG TGGCACAGAC GTGGCGCTGG AAACCGCCGA TGCAGCATTA
ACCCATAACC ACCTGCGCGG TCTGGTGCAA ATGATTGAAC TGGCACGCGC CACTCACGCC
AATATCCGCC AGAACATCAC CATTGCGCTG GGGCTGAAAG GCGTTTTCCT CGTCACCACA
CTGTTAGGAA TGACCGGGCT ATGGCTGGCG GTGCTGGCAG ATACGGGTGC GACGGTGCTG
GTGACGGCGA ATGCGTTAAG ATTGTTGCGC AGGAAGTAA
 
Protein sequence
MSTPDNHGKK APQFAAFKPL TTVQNTNDCC CDGACSSTPT LSESVSGTRY SWKVSGMDCA 
ACARKVENAV RQLAGVNQVQ VLFATEKLVV DADNDIRAQV ESAVQKAGYS LRDEQTTDEP
QASRLKENLP LITLIVMMAI SWGLEQFNHP FGQLAFIATT LVGLYPIARQ ALRLIKSGSY
FAIETLMSVA AIGALFIGAT AEAAMVLLLF LIGERLEGWA ASRARQGVSA LMALKPETAT
RLRNGEREEV AINSLRPGDV IEVAAGGRLP ADGKLLSPFA SFDESALTGE SIPVERATGD
KVPAGATSVD RLVTLEVLSE PGASAIDRIL KLIEEAEERR APIERFIDRF SRIYTPAIMA
VALLVTLVPP LLFAASWQEW IYKGLTLLLI GCPCALVIST PAAITSGLAA AARRGALIKG
GAALEQLGRV TQVAFDKTGT LTVGKPRVTA IHPATGISES ELLTLAAAVE QGATHPLAQA
IVREAQVAEL AIPTAQSQRA LVGSGIEAQV NGERVLICAA GKHPADAFAG LINELESAGQ
TVVLVVRNDD VLGVIALQDT LRADAATAIS ELNALGVKGV ILTGDNPRAA AAIAGELGLE
FKAGLLPEDK VKAVTELNQH APLAMVGDGI NDAPAMKAAA IGIAMGSGTD VALETADAAL
THNHLRGLVQ MIELARATHA NIRQNITIAL GLKGVFLVTT LLGMTGLWLA VLADTGATVL
VTANALRLLR RK