Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4071 |
Symbol | |
ID | 6411755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4373195 |
End bp | 4374247 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642713953 |
Product | arsenical-resistance protein |
Protein accession | YP_001993042 |
Protein GI | 192292437 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00163154 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACCT TCGAACGCTA TCTGACCCTG TGGGTCGCGC TGTGCATCGT CGTCGGGATC GCGCTCGGGC ATCTGATGCC GGGTGCGTTT CAGGCGATCG GTGCAGCCGA GGTCGCCAAG GTCAATCTGC CGGTGGCGGC GCTGATCTGG CTGATGATCA TCCCGATGCT GGTGCGGATC GACTTCGCCG CACTCGGGCG CGTGCGCGAG CATTGGCGCG GCATCGGCGT GACGCTGTTC ATCAACTGGG CGGTGAAGCC GTTCTCGATG GCGGCGCTGG CGTGGCTGTT CGTGGGCTAT CTGTTCAAGT CCTATCTGCC GGCGGACCAG ATCAACTCCT ATATCGCCGG CCTGATCATT CTGGCGGCGG CGCCGTGCAC CGCGATGGTG TTCGTATGGT CGAACCTGAC CAAGGGCGAG CCGCATTTCA CGCTGAGCCA AGTGGCGCTC AATGACACCA TCATGGTGTT CGCATTCGCG CCGATCGTCG GTCTGCTGCT CGGCCTGTCG GCCATCACCG TGCCGTGGGA CACGCTGGTG CTATCGGTGG TGCTGTACAT CGTGGTGCCG GTGATCGTGG CTCAGGGCCT GCGGCGCTGG GCGCTGGCGT CGGGCGGTGA AACACAGTTG CAGCGGCTGC TCGGCATCTT CCAGCCGCTG TCGCTGGTTG CGCTGCTGGC CACGCTGGTG CTGCTGTTCG GCTTTCAGGG CGAGCAGATC ATCCGGCAGC CGCTGGTGAT CGCGCTGCTC GCAGTGCCGA TCCTGATCCA GGTGTATTTC AACGCCGGGC TCGCGTATCT GCTCAACCGG CTCAGCGGCG AACAGCATTG CGTTGCCGGC CCGTCGGCGC TGATCGGCGC CAGCAACTTC TTCGAACTCG CGGTCGCGGC GGCGATCAGC CTGTTCGGCT TCGAATCCGG CGCCGCACTC GCCACCGTGG TCGGCGTGCT GATCGAGGTG CCGGTGATGC TGACGGTGGT GGCGATCGTC AACCGCTCCA AGCGCTGGTA CGAAGCGGAT CAGCGCGCGC CGGTGGCGCG GACGCCGGGC TGA
|
Protein sequence | MSTFERYLTL WVALCIVVGI ALGHLMPGAF QAIGAAEVAK VNLPVAALIW LMIIPMLVRI DFAALGRVRE HWRGIGVTLF INWAVKPFSM AALAWLFVGY LFKSYLPADQ INSYIAGLII LAAAPCTAMV FVWSNLTKGE PHFTLSQVAL NDTIMVFAFA PIVGLLLGLS AITVPWDTLV LSVVLYIVVP VIVAQGLRRW ALASGGETQL QRLLGIFQPL SLVALLATLV LLFGFQGEQI IRQPLVIALL AVPILIQVYF NAGLAYLLNR LSGEQHCVAG PSALIGASNF FELAVAAAIS LFGFESGAAL ATVVGVLIEV PVMLTVVAIV NRSKRWYEAD QRAPVARTPG
|
| |