Gene Elen_1544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1544 
Symbol 
ID8415842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1835793 
End bp1837715 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content67% 
IMG OID645024512 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_003181901 
Protein GI257791295 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0567508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0780509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA AACAGAAACG CACGAGGAAC CGCATCCTCC TCGCCATCGC GCTGTTCGTC 
GTCGTGTACG TCGTGGCGGA GCTGCTTCCG CTGTCGACGT GGCTCGGCAG CGAGACGGCG
GCGCTATGGG CGGAGTTCGC GCTGTTCCTC GTCCCCTATC TCATCGCCGG CTACGACGTG
CTGCTGCGCG CGGCGAAGAA CATCGGCCAC GGCCAGGTGT TCGACGAGAA CTTCCTCATG
AGCGTGGCCA CCATCGGCGC GTTCGCGCTG GTGCTGTTCC CCGACAGCGA CCCGCACATG
GCCGAAGGCG CGGCCGTCAT GCTGTTCTAC CAGGTAGGCG AGCTGTTCCA AAGCTACGCC
GTGGGCAAGA GCCGCAAATC CATCGCCGAC ATGATGGACA TCGCGCCCGA CTTCGCCAAC
GTGGAGCGCG ACGGGCAGCT CGTGCAGGTG GACCCCTACG AGGTGGCCGT GGGCGACGAG
ATCGTCGTGA AAGCCGGCGA ACGCGTGCCG CTCGACGGCG TGGTGCTGTC CGGCACGTCC
CAGCTGGACA CCGCTGCCCT CACCGGCGAG TCGGTGCCGC GCGAAGTGCG CGAGGGCGAC
GAGATCATCT CGGGCTGCGT CAACATGACC GGCCTCATCA CCGTGCGCGT GACCAAGCCC
TTCGGCGAGT CCACCGTCAG CCGCATCCTC GAGCTCGTGG AGAACGCGGC CGAGAAGAAG
GCCAAGACCG AGAACTTCAT CACCCGCTTC GCGCGCTACT ACACCCCGGC CGTGGTGGGC
ATCGCGGTGC TCTTGGCCGT CGTACCGCCG TTGCTGCTGG GCGGCGGCTG GTCGGACTGG
GTGCAGCGCG GGCTCATCTT CCTCGTGGTG TCGTGCCCGT GCGCGCTCGT CATCAGCGTG
CCGCTGTCGT TCTTCGGCGG CATCGGCGGC GCCTCGCGCC TGGGCATCCT CGTGAAGGGC
AGCAACTACC TGGAAACGCT CGCCGGCACC GAGACCGTCG TGTTCGACAA GACCGGCACG
CTCACCGACG GCTCGTTCAA CGTGGTGGCC ATCCACCCGC AGGCGGGAAT CGACCCCGAC
CGCCTGCTGT CCATTGCCGC GCACGCCGAA GCGTATTCCA ACCACCCCAT CGCGCTGTCG
GTGAAGCAGG CGTACTCGGG GCCCATCGAC CAGCAGCGCA TCGACGACGT GCAGGAGCAG
AGCGGCCACG GCGTGCGCGC GAAGATCGAC GAGCACGTAG TGCTGGTGGG CAACGACAAG
CTGATGAGCG AATGCGGCGT GGGCTGCCAC GAGTGCGAGC TGACCGGCAC CATCCTGCAC
GTGTCGCTTG ACGGCGAGTA CATCGGGCAC ATCGTCATCG CCGACGTCGT CAAGCCCGAC
GCAGCCGAGG CCGTCGCCGC GCTGCGGGCC GCGGGCGTGA AGAAGACCGT CATGCTGACG
GGCGACCGCG CCGACGTGGC GGCCGCCGTG GCGAAGGAGC TGGGCATCGA CGAGTTCCGC
GCGCAGCTGC TGCCGCAGGA CAAGGTCGCC GAAGTGGAGA AGCTGCTGGA GGAGACGCAT
GCGCACGGCT CCGGCAAGGG CAAGCTCGCG TTCGTGGGCG ACGGCATCAA CGACGCGCCC
GTGCTGACGC GCGCCGACAT CGGCATCGCC ATGGGCGCGA TGGGCTCGGA CGCGGCCATC
GAGGCGGCCG ACGTGGTGCT CATGGACGAC AAGCCGTCCA ACATCGCCCG CGCTATCGGC
ATCGCCCGCA AGACCATGGG CATCGTGTGG CAGAACATCG TGTTCGCCCT CGGCATCAAG
TTCCTCGTGC TGGTCTTGGC GGCCGTGGGC ATCGCCAACA TGTGGCTGGC CGTGTTCGCC
GACGTAGGCG TTGCCGTGAT CGCCATCCTG AACGCCATGA GGGCGATGAA CGTGAAGAAG
TAA
 
Protein sequence
MNKKQKRTRN RILLAIALFV VVYVVAELLP LSTWLGSETA ALWAEFALFL VPYLIAGYDV 
LLRAAKNIGH GQVFDENFLM SVATIGAFAL VLFPDSDPHM AEGAAVMLFY QVGELFQSYA
VGKSRKSIAD MMDIAPDFAN VERDGQLVQV DPYEVAVGDE IVVKAGERVP LDGVVLSGTS
QLDTAALTGE SVPREVREGD EIISGCVNMT GLITVRVTKP FGESTVSRIL ELVENAAEKK
AKTENFITRF ARYYTPAVVG IAVLLAVVPP LLLGGGWSDW VQRGLIFLVV SCPCALVISV
PLSFFGGIGG ASRLGILVKG SNYLETLAGT ETVVFDKTGT LTDGSFNVVA IHPQAGIDPD
RLLSIAAHAE AYSNHPIALS VKQAYSGPID QQRIDDVQEQ SGHGVRAKID EHVVLVGNDK
LMSECGVGCH ECELTGTILH VSLDGEYIGH IVIADVVKPD AAEAVAALRA AGVKKTVMLT
GDRADVAAAV AKELGIDEFR AQLLPQDKVA EVEKLLEETH AHGSGKGKLA FVGDGINDAP
VLTRADIGIA MGAMGSDAAI EAADVVLMDD KPSNIARAIG IARKTMGIVW QNIVFALGIK
FLVLVLAAVG IANMWLAVFA DVGVAVIAIL NAMRAMNVKK