Gene EcHS_A3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3668 
SymbolzntA 
ID5593636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3652721 
End bp3654919 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content58% 
IMG OID640922784 
Productzinc/cadmium/mercury/lead-transporting ATPase 
Protein accessionYP_001460264 
Protein GI157162946 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACTC CTGACAATCA CGGCAAGAAA GCCCCTCAAT TTGCCGCGTT CAAACCGCTA 
ACCACGGTAC AGAACGCCAA CGACTGTTGC TGCGACGGCG CATGTTCCAG CACGCCAACG
CTCTCTGAAA ACGTCTCCGG CACCCGCTAT AGCTGGAAAG TCAGCGGCAT GGACTGCGCC
GCCTGTGCGC GCAAGGTAGA AAATGCCGTG CGCCAGCTTG CAGGCGTGAA TCAGGTGCAG
GTACTGTTCG CCACCGAAAA ACTGGTGGTT GATGCCGACA ATGACATCCG TGCACAAGTT
GAATCTGCGG TGCAAAAAGC GGGCTATTCC CTGCGTGATG AGCAGGCCGC CGACGAGCCA
CAAGCATCGC GCCTGAAAGA GAATCTGCCG CTGATTACGC TAATCGTGAT GATGGCAATC
AGCTGGGGTC TGGAGCAGTT CAATCATCCG TTCGGGCAAC TGGCGTTTAT CGCGACCACG
CTGGTTGGGC TGTACCCGAT TGCTCGTCAG GCATTACGGC TGATCAAATC AGGCAGTTAC
TTCGCCATTG AAACGTTAAT GAGCGTAGCT GCTATTGGTG CGCTGTTTAT TGGCGCAACG
GCTGAAGCTG CGATGGTGTT GCTGCTGTTT TTGATTGGTG AACGACTGGA AGGCTGGGCC
GCCAGCCGCG CGCGTCAGGG CGTTAGCGCG TTAATGGCGC TGAAACCAGA AACCGCCACG
CGCCTGCGTA ACGGTGAGCG GGAAGAGGTG GCGATTAACA GCCTGCGCCC TGGCGATGTG
ATTGAAGTTG CGGCGGGCGG GCGTTTGCCT GCCGACAGTA AACTGCTCTC GCCCTTTGCC
AGTTTTGATG AAAGCGCCCT GACCGGCGAA TCCATTCCGG TGGAGCGCGC AACGGGCGAT
AAAGTTCCTG CAGGAGCCAC CAGCGTAGAC CGTCTGGTAA CGCTGGAAGT GCTGTCAGAA
CCGGGTGCCA GCGCCATTGA CCGGATTCTG AAACTGATCG AAGAAGCCGA AGAGCGCCGC
GCCCCCATTG AGCGGTTTAT CGACCGTTTC AGCCGTATTT ATACGCCAGC AATTATGGCC
GTCGCCCTGC TGGTAACGCT GGTGCCGCCG CTGCTGTTTG CCGCCAGCTG GCAAGAGTGG
ATTTATAAAG GGCTGACGCT GCTGCTGATT GGCTGCCCGT GTGCGTTAGT TATCTCCACG
CCCGCGGCGA TTACCTCCGG GCTGGCAGCA GCGGCACGTC GTGGGGCGTT AATTAAAGGC
GGTGCGGCGC TGGAGCAGCT AGGTCGTGTT ACCCAGGTGG CGTTTGATAA AACCGGTACG
CTGACCGTCG GTAAACCGCG CGTTACCGCG ATTCATCCGG CAACGGGTAT TAGTGAATCT
GAACTGCTGA CACTGGCGGC GGCGGTCGAG CAAGGCGCGA CGCATCCACT GGCGCAGGCG
ATCGTGCGCG AAGCACAGGT TGCTGCACTC GCCATTCCCG CCGCCGAATC ACAGCGGGCG
CTGGTCGGGT CTGGCATTGA AGCGCAGGTT AACGGTGAGC GCGTATTGAT TTGCGCTGCC
GGGAAACATC CCGCTGATGC ATTTGCTGGT TTAATTAACG AACTGGAAAG CGCCGGGCAA
ACGGTAGTGC TGGTAGTACG TAACGATGAC GTGCTTGGTG TCATTGCGTT ACAGGATACC
CTGCGCGCCG ATGCTGCAAC TGCCATCAGT GAACTGAACG CGCTGGGCGT CAAAGGGGTG
ATCCTCACCG GCGATAATCC ACGCGCAGCG GCGGCAATTG CCGGGGAGCT GGGGCTGGAG
TTTAAAGCGG GCCTGTTGCC GGAAGATAAA GTCAAAGCGG TGACCGAGCT GAATCAACAT
GCGCCGCTGG CGATGGTCGG TGACGGTATT AACGACGCGC CAGCGATGAA AGCTGCCGCC
ATTGGGATTG CAATGGGCAG TGGCACAGAC GTGGCGCTGG AAACCGCTGA TGCAGCATTA
ACCCATAACC ACCTGCGCGG CCTGGTGCAA ATGATTGAAC TGGCACGCGC CACTCACGCC
AATATCCGCC AGAACATCAC CATTGCGCTG GGACTGAAAG GGATCTTCCT CGTCACCACG
CTTTTAGGGA TGACCGGACT ATGGCTGGCG GTGCTGGCAG ATACGGGTGC GACGGTGCTG
GTGACAGCGA ATGCGTTAAG ATTGTTGCGC AGGAGATAA
 
Protein sequence
MSTPDNHGKK APQFAAFKPL TTVQNANDCC CDGACSSTPT LSENVSGTRY SWKVSGMDCA 
ACARKVENAV RQLAGVNQVQ VLFATEKLVV DADNDIRAQV ESAVQKAGYS LRDEQAADEP
QASRLKENLP LITLIVMMAI SWGLEQFNHP FGQLAFIATT LVGLYPIARQ ALRLIKSGSY
FAIETLMSVA AIGALFIGAT AEAAMVLLLF LIGERLEGWA ASRARQGVSA LMALKPETAT
RLRNGEREEV AINSLRPGDV IEVAAGGRLP ADSKLLSPFA SFDESALTGE SIPVERATGD
KVPAGATSVD RLVTLEVLSE PGASAIDRIL KLIEEAEERR APIERFIDRF SRIYTPAIMA
VALLVTLVPP LLFAASWQEW IYKGLTLLLI GCPCALVIST PAAITSGLAA AARRGALIKG
GAALEQLGRV TQVAFDKTGT LTVGKPRVTA IHPATGISES ELLTLAAAVE QGATHPLAQA
IVREAQVAAL AIPAAESQRA LVGSGIEAQV NGERVLICAA GKHPADAFAG LINELESAGQ
TVVLVVRNDD VLGVIALQDT LRADAATAIS ELNALGVKGV ILTGDNPRAA AAIAGELGLE
FKAGLLPEDK VKAVTELNQH APLAMVGDGI NDAPAMKAAA IGIAMGSGTD VALETADAAL
THNHLRGLVQ MIELARATHA NIRQNITIAL GLKGIFLVTT LLGMTGLWLA VLADTGATVL
VTANALRLLR RR