Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2304 |
Symbol | |
ID | 8416628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2707618 |
End bp | 2709678 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645025289 |
Product | heavy metal translocating P-type ATPase |
Protein accession | YP_003182652 |
Protein GI | 257792046 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2217] Cation transport ATPase |
TIGRFAM ID | [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC [TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase [TIGR01525] heavy metal translocating P-type ATPase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAGA CGATGGCCAT GCGCGCACCC TTGCTGGAAC GACTCGAAGA ACTGCTCGAT CGAGGCGGCA CGAAAAAGGA CGTCGCGCTG CTGGCCGTGT CGGCCGTCGC GCTGGGATTC AGCTTCTTCG CGCCCGAGAC GCTGCCCTTC AACCCGGCCT GGATCGCCAT CGTGCTGTGC GGCGCGCCCA TCATCCTGGG CGCCGTCATC GGGCTGGTGA CGGAGTTCGA CATCAAGGCC GACGTGCTCG TATCGCTCGC GCTCATCGCG GCGGTGGCCA TCGGCGAGGA CTTCGCCGCC GGCGAGGTAG CGCTCATCAT GCAGCTGGGC GCGCTGCTGG AGGACCTGAC GGTGGCGAAG GCCCGCGCCG GCATCGAGCG GCTCGTGCAT CTGTCGCCCC GCACGGCGCG CATCGTGCGC GACGGTGTGG AAACCGTCGT CGCCGCCGAG GACGTGCAGG TGGGAGACGT GCTGCGCGTG CTGCCCGGCG AAACCGTGGC GGTGGACGGC ACCGTCCTGG AAGGACGCAC CTCCATCGAC GAGTCGGTGA TGACCGGCGA ATCGCTGCCG GTCGACAAGG CGCCGGGCGA CGAGGTGAAG AGCGGGACGG TGAACCAGTT CGGGGCGTTC GACATGCGCG CCCAGCGCGT GGGCGAGGAC AGCTCGCTCG CCCGCATGAT CGAGCTCGTG CAGTCGGCCG ATGCCGGCAA GGCCAAGATC GTGCGCCTCG CCGACCGATG GGCCACCTGG ATCGTCGTCA TCGCCCTGAC CGCGGCGGCC GGCACGTGGC TCGTCACCGG CGAGATCATC CGCGCCGTCA CCATCCTCGT GGTGTTCTGC CCCTGCGCGC TCGTGCTGGC CACGCCCACG GCCATCATGG CGGCCATCGG CAACGTGACG AAGCTCGGCG TGCTCGTGCG CGAAGGCGAT GCGCTCGAGC GGCTGGCGAA GGTGCGCAAG GTGGCGTTCG ACAAGACCGG CACGCTCACC TACGGCCGCC CCGAAGTGGT GGCCGTCGGG ACGATCGAGG GCTCGAAGGT GGACGAAGAG CAGCTGTATT CCCTGGTGGC CAGCGCGGAA AGCATGAGCG AGCACCCCCT CGGCAAAGCC GTCGTGCGCG GGTGGGAGCA GCGCGCGGCC GCCGCGCCGG GCGAGGCCGC AACCGCGGAG GCCTCCGCAG CCGCGGACGC GGCCTCCGCC GCACCCGCGC TCGCCCGCCC GTCGTCGTTC GACATGGTTC CCGGCCGCGG CGTGCGCGCC TCGGTTGCCG GCCGCGACGT CGCCGCCGGC AACCGCGAGA TGCTGGCCGA GCTGGGCGTC GACGGAGCCG ACGCGCTGCA GGCGGCGGCC CTCCCGTTCG CCGAGCAGGG CTGCACGGTG GTGCTCGCCG CCCTCGACGG CCAGGCCGCC GGATTCATCG CCCTGGCCGA CACCGTGCGC CCCACGGCCG AAGCCGCCGT GCGCGGCATC CGCACGCTGG GCGTCGAGCC GGTGCTGCTC ACCGGCGACC ATGCCCAGGC CGCGCGCCAC ATCGCCGCCC AGGCCGGCAT CGTCGAGGTG GAGGCCGACT GCCTCCCCCA GAACAAGCTG GCCGCCGTCG AGCGCCTCGA GCGCGAAGGC GCGCCCGTGT GCATGGTGGG CGACGGCATC AACGACGCGC CCGCCCTCAA ACGCGCCTCG GTGGGCATCG CCATGGGCGG CGCGGGCAGC GACATCGCCG TCGATGCCGC CGACATCGCG CTCGTGCGCG ACGACATCGC CGCACTGCCC CACCTGCTGG CCATCTCGCA GCGCATGATG ACCACCATCA AGCTGAACAT GGGATTCTCG CTCGGGCTCA ACTTCGTCGC CATCGCGCTG GCCATGACCG GCATCCTGAA CCCCGTCGTG GGCGCGCTCG TGCACAACGC CGGCTCCGTC ATCGTCATCG TGAATTCCGC GCTGCTGCTG AAGTGGAAGC ACCGCGGCGG CTCCGAGGCG AGCGGCGACG CATCGCCCGC ATCGGCGCCC GGCGCGGCAA CCGCCCTGCG CGATCTCGAG CCGGACGCGG AAGCGGCCTA G
|
Protein sequence | MKETMAMRAP LLERLEELLD RGGTKKDVAL LAVSAVALGF SFFAPETLPF NPAWIAIVLC GAPIILGAVI GLVTEFDIKA DVLVSLALIA AVAIGEDFAA GEVALIMQLG ALLEDLTVAK ARAGIERLVH LSPRTARIVR DGVETVVAAE DVQVGDVLRV LPGETVAVDG TVLEGRTSID ESVMTGESLP VDKAPGDEVK SGTVNQFGAF DMRAQRVGED SSLARMIELV QSADAGKAKI VRLADRWATW IVVIALTAAA GTWLVTGEII RAVTILVVFC PCALVLATPT AIMAAIGNVT KLGVLVREGD ALERLAKVRK VAFDKTGTLT YGRPEVVAVG TIEGSKVDEE QLYSLVASAE SMSEHPLGKA VVRGWEQRAA AAPGEAATAE ASAAADAASA APALARPSSF DMVPGRGVRA SVAGRDVAAG NREMLAELGV DGADALQAAA LPFAEQGCTV VLAALDGQAA GFIALADTVR PTAEAAVRGI RTLGVEPVLL TGDHAQAARH IAAQAGIVEV EADCLPQNKL AAVERLEREG APVCMVGDGI NDAPALKRAS VGIAMGGAGS DIAVDAADIA LVRDDIAALP HLLAISQRMM TTIKLNMGFS LGLNFVAIAL AMTGILNPVV GALVHNAGSV IVIVNSALLL KWKHRGGSEA SGDASPASAP GAATALRDLE PDAEAA
|
| |