Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3871 |
Symbol | |
ID | 3911675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4424838 |
End bp | 4427564 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885772 |
Product | heavy metal translocating P-type ATPase |
Protein accession | YP_487475 |
Protein GI | 86750979 |
COG category | [P] Inorganic ion transport and metabolism [S] Function unknown |
COG ID | [COG2217] Cation transport ATPase [COG3350] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC [TIGR01511] copper-(or silver)-translocating P-type ATPase [TIGR01525] heavy metal translocating P-type ATPase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGACA CAGAGCACAC CGGGGGCAGC ACGGCTGTCA AGGATGCGGG TTGCGGATGC TCGGCGGAGG CAGCTGTTCC TGCGCCGCCT GCGGCGTCGT CCTGTTGCGG CCGCCACGCG AGCGATCAGC CGATCTCACC TGCGTCCGCC AAGGCGATCG ATCCCGTCTG CGGCATGACC GTCGACCCGG CGTCCAGCAA GCATCGCTTC GACCACGCCG GCACCACCTA TCACTTCTGC TGCGCCGGAT GCCGCACCAA ATTCGCGGCC GACCCGGAAG GCATTCTGGC CAAAGCCGCC AGGCCGACTG CGGCGCCCAA GCCCGCAGCG CAATTGCATC AACTGACCGA CTTCGCCGCA CCGTCGTCCT GCTGCGGCGG GCATGATCAT GCCGCACATC ACCACGATCA CGGCGCGACT GCAGCAGCCG ACGGCAAGGT GATCGATCCG GTCTGCGGCA TGAAAGTGGA CCCGGCGACC ACGCCGCATC GGTTCGATTA CCAGGGCCAG ACCTATTTCT TCTGCGCGGC GAGCTGCCGC GGCAAATTCG CCGCCGATCC CGTCTCCTAT CTCGACAAGT CGAAAGCGAA GCCGGCGCCG GTGGTGCCGG AGGGCACGAT CTACACCTGC CCGATGGATC CGCAGATCCG TCAGGTCGGC CCGGGAAGCT GCCCGATCTG CGGCATGGCG CTCGAACCCG AGCTGGTGTC GCTCGACGCG CCGCCGAATG CCGAACTGAT CGACATGACC CGGCGTTTCT GGATCGGCCT CGCGCTGGCG CTGCCGGCGG TCGTGCTGGA AATGGGCGGC CACCTCGTTG GTGGTCACGG CTTGATCGAT CCTGCGCTGT CGAATTGGAT CCAGCTCGCC TGCGCGACGC CGGTCGTGCT GTGGGCCGGC TGGCCGTTCT TCGTCCGCGG CTGGCAGTCG CTGGTCACGC GCAACCTCAA CATGTTCACG CTGGTCGCGA TGGGCACCGG CGTCGCCTAT GTCTACAGCC TGGTCGCGAC GCTGGCGCCG CAGCTGTTCC CACCGGCCTT CCAGAGCCAT GGCGGCAGCG TGCCGGTGTA TTTCGAGGCC GCAGCGGTGA TCACCGTGCT GGTGCTGCTC GGCCAGGTGC TCGAACTGCG CGCCCGCGAG GCGACCTCCG GCGCGATCAA GGCACTGCTC ACCCTCGCGC CGAAATCCGC ACGGCGAATC GCGGCGGACG GGACCGACCA CGAGGTCGAG ATCGACAGCC TCGCGGTCGG CGACAAGCTC CGCGTCCGCC CCGGCGAAAA GGTGCCGGTC GACGGCATCA TCCTCGAGGG ACGCTCGACG CTCGACGAAT CGCTGGTGAC CGGCGAATCG ATGCCGGTGA CGCGCGAGGC CGGCGGCAAG GTCGTCGCCG GAACGCTCAA CCAGGCCGGC GGCTTCGTGA TGCGCGCCGA ACAGGTCGGC CGCGACACCG TGCTGTCGCA GATCGTGCAG ATGGTGGCGC AGGCGCAGCG ATCGCGCGCG CCGATCCAGC GCGTCGCCGA CCTCGTCGCG GGCTGGTTCG TGCCGGCCGT GGTGCTGGCC GCGCTGGTCG CGTTCGCCGC CTGGGCGACC TTCGGCCCCG AGCCGCGGCT GACCTTCGCG CTGGTCGCCG CGGTCAGCGT GCTGATCATC GCCTGCCCGT GCGCGCTCGG TCTCGCCACG CCGATGTCGA TCATGGTCGG CGTCGGCCGC GGCGCGCAGG CCGGGGTGCT GATCCGCAAC GCCGAAGCGC TGGAGCGGAT GGAGAAGGTC GACACGCTGG TGATCGACAA GACCGGCACG CTGACCGAAG GCAAGCCGAA GGTGGTGGCG ATCGCCACCG CGAGCGGCTT CGACGAGGCC GAATTGCTGC GGCTCGCGGC CGGCGTCGAA CGCGCCAGCG AACACCCGCT CGCGCACGCC ATCGTCACCG CCGCCAGTGA TCGCAGTCTC GACCTCGCGC CGGTCGACGG GTTCGAGGCG CCGACCGGCA AGGGCGCGAC CGGCCGGGTC GCCGGCCGTT CGGTCGTGAT CGGCAACGTC GACTATCTCG CTTCGCTCGG GATCGACACG GCCCCGCTCG CCGACATAGC GGAGCATCAC CGCGCCGACG GAGCCACCGT GGTCAGCGTC GGCATCGACG GGCGGTTCGC CGGATTGATC GCGATTGCCG ATCCGGTGAA AGCATCGACG CCGGACGCGT TGCGCGCACT CGCCGCCGAA GGCCTCCGGG TGATCATGCT GACCGGCGAC AACCGAACCA CGGCGCAGGC CGTCGCCCGA AAACTCGGCA TCGCCGATGT CGAAGCCGAG GTCCTGCCCG ATCAGAAGAG CGCGGTGGTC GAAAAGCTGC GCAAGCAGGG CCGCATCGTC GCGATGGCCG GCGACGGCGT CAACGACGCT CCGGCATTGG CCGCCGCCGA TGTCGGCATC GCGATGGGAA CCGGAACCGA CGTGGCGATG GAGAGCGCGG GCATCACGCT GCTGAAGGGC GACCTTGGCG GCATCGTTCG CGCGCGAAAA CTGTCCCAGG CGACGATGCG CAATATCCGG CAGAATCTGT TCTTCGCCTT CATCTACAAT TCGGCTGGAA TCCCGATCGC CGCCGGCATT CTGTATCCGA GCTTCGGCCT GCTGCTGTCG CCGATCATCG CCGCAGCGGC AATGTCGCTG TCGTCGGTCA GCGTGATCGG CAATGCCCTG CGGCTGCGCG CCACCTCGCT GGATTGA
|
Protein sequence | MHDTEHTGGS TAVKDAGCGC SAEAAVPAPP AASSCCGRHA SDQPISPASA KAIDPVCGMT VDPASSKHRF DHAGTTYHFC CAGCRTKFAA DPEGILAKAA RPTAAPKPAA QLHQLTDFAA PSSCCGGHDH AAHHHDHGAT AAADGKVIDP VCGMKVDPAT TPHRFDYQGQ TYFFCAASCR GKFAADPVSY LDKSKAKPAP VVPEGTIYTC PMDPQIRQVG PGSCPICGMA LEPELVSLDA PPNAELIDMT RRFWIGLALA LPAVVLEMGG HLVGGHGLID PALSNWIQLA CATPVVLWAG WPFFVRGWQS LVTRNLNMFT LVAMGTGVAY VYSLVATLAP QLFPPAFQSH GGSVPVYFEA AAVITVLVLL GQVLELRARE ATSGAIKALL TLAPKSARRI AADGTDHEVE IDSLAVGDKL RVRPGEKVPV DGIILEGRST LDESLVTGES MPVTREAGGK VVAGTLNQAG GFVMRAEQVG RDTVLSQIVQ MVAQAQRSRA PIQRVADLVA GWFVPAVVLA ALVAFAAWAT FGPEPRLTFA LVAAVSVLII ACPCALGLAT PMSIMVGVGR GAQAGVLIRN AEALERMEKV DTLVIDKTGT LTEGKPKVVA IATASGFDEA ELLRLAAGVE RASEHPLAHA IVTAASDRSL DLAPVDGFEA PTGKGATGRV AGRSVVIGNV DYLASLGIDT APLADIAEHH RADGATVVSV GIDGRFAGLI AIADPVKAST PDALRALAAE GLRVIMLTGD NRTTAQAVAR KLGIADVEAE VLPDQKSAVV EKLRKQGRIV AMAGDGVNDA PALAAADVGI AMGTGTDVAM ESAGITLLKG DLGGIVRARK LSQATMRNIR QNLFFAFIYN SAGIPIAAGI LYPSFGLLLS PIIAAAAMSL SSVSVIGNAL RLRATSLD
|
| |