Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0170 |
Symbol | |
ID | 3907775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 182676 |
End bp | 185453 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882052 |
Product | ABC-2 type transport system |
Protein accession | YP_483793 |
Protein GI | 86747297 |
COG category | [R] General function prediction only [V] Defense mechanisms |
COG ID | [COG1131] ABC-type multidrug transport system, ATPase component [COG4152] ABC-type uncharacterized transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.476544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATC CCGCGGTCGG GACGGACGCG CGCGCGCCGG TGGTCCGGCT CGCCGCGGTG TCGCTGCGCT ACGGCAAGAC GCTGGCGCTC GACGACGTCA CGCTCGATTT GCCGTCGGGC TGCATGATCG GCCTGATCGG TCCCGATGGC GTCGGCAAAT CGAGCCTGCT GTCGCTGGTC TCGGGGGCGC GCGCGGTGCA GCAGGGTCGC GTCGAGGTGT TCGGCGGCGA CATCGCCGAT GCCGCACATC GCCGCGATAC GTGTCCGCGG ATCGCCTATA TGCCGCAGGG GCTCGGCAAG AATCTGTATC CGACGCTGTC GGTGTTCGAG AATGTCGACT TCTTCGGCCG GCTGTTCGGC CAAGGCCGTC GCGAGCGCGC CGCGCGGATC GCCGAACTAC TTGAAAGTAC CGGGCTTTCG CCGTTCGCCG AGCGGCCTGC CGGAAAGCTG TCCGGCGGTA TGAAGCAGAA GCTCGGGCTG TGCTGCGCGC TGATCCACGA TCCGGATCTG CTGATCCTAG ACGAGCCGAC CACCGGCGTC GATCCGCTGT CGCGCGGGCA GTTCTGGGAA TTGATCAACG ACATCAGGGC GCAACGCCCC GGTATGAGCG TGATCGTGGC GACCGCCTAC ATGGAGGAGG CGGAGCGCTT CGACCATCTG GTGGCGATGA ATGCCGGGCA GGTGCTGGCG ACCGGAACGC CTGCGGATCT GTTGCGGCAA ACTGGTGGCA AATCGTTGGA TGCCGCCTTC ATCGCGCTGT TGCCCGAAGA CGAGCGCCGC GGCCATGCCG AGGTGGTGAT TCCGCCGCGC ACGACCGGCA CGACCGGCAT CGCGATCGAG GCTGAACATC TCACCATGCG GTTCGGCGAC TTCACCGCGG TCGACGACGT CTCGTTCCGG ATCGAGCAGG GCGAGATCTT CGGCTTCCTC GGCTCCAACG GCTGCGGCAA GACCACGACG ATGAAGATGC TGACCGGTCT GCTCGCGGCC AGCGAGGGCA CCGCGAAGCT GTTCGGCAAC GAGGTCGATC CGAACGACAT GGCGGTGCGC CGGCGCGTCG GTTACATGTC GCAGGCGTTC TCGCTCTACA CCGAACTCAC CGTGCGGCAG AATCTCGAAC TGCACGCGCG GCTGTTCCAG ATGGAGCCCG CCAAGATCGC GCCGCGGATC GCCGAGATGG AGCGCCGCTT CGATCTCGCC GAGGTGATCG ACAAGCTGCC GGATGAGTTG CCGCTCGGCA TCCGTCAGCG GCTGTCGCTG GCGGTGGCGA TGATCCACTC GCCCGACATT CTGATCCTCG ACGAGCCGAC CTCCGGCGTC GATCCGATCG CGCGCGACGG CTTCTGGCAG ATGTTGTCCG ACCTGTCGCG CAACGACAAC GTCACCATCT TCGTTTCCAC CCACTTCATG AACGAGGCGG AGCGCTGCGA CCGCATTTCG CTGATGCATG CCGGCCGCGT GCTGATCAGC GACACGCCGG GCGCGATCGT CGCGAGCCGA TCGGCGGCGA GCCTGGAGGA CGCCTTCATC GCCTATCTGG AGGAGGCGAT CGGAACCGCG GCGACGCCAT CGGCGCCGCA GACCACTGCC GCGCAAGTCA CATCCGCAGC AGATGAGGAC GCCCCGCATC CGCCCCGGGC ATCGTCGTCC TGGTTCGATC TGCGGCGGAT GCTGGCTTAT ACGCGGCGCG AGGCGCTCGA ACTGCAGCGC GATCCGATCC GCGCCACGCT GGCGCTGATC GGCAGCGTGG TGCTGATGTT CGTGCTCGGT TACGGCATCA ATCTCGACGT CGAAAAACTG ACCTTCGCCG CGCTCGATCG CGACGACACT GCGATCAGCC GCGACTACAT CCTCGACATC GCGGGCTCGC GCTATTTCGC CGAGCAGCGC CCGATCACCG ATTACGCCGA TCTCGACCGC AGGATGCGCA GCGGCGAGCT GACGATGGCG ATCGAGATCC CGCCGGGATT CGGCCGCGAC GTCTCGCGCG GCCGTTCGGT CGAGGTCGGC GCCTGGATCG ACGGCGCGAT GCCGTCGCGG GCGGAAACCG CGCGCGGCTA CGCGCAGGCG ATGCATCTCG GCTGGCTCAA GCGGAAGGCA AGCGAGCTCT ACGGCGATGC CGCGACCGCC GGCAGCTTCC AGATCGCGAT GCGCTATCGC TACAATCCGG ACATCCGGAG CGTGGTGGCG ATGGCGCCCG CGGTGATTCC GCTGCTGTTG CTGATGATTC CGGCGATGCT CGCAGCGCTC AGCGTGGTGC GCGAGAAGGA GCTCGGCTCG ATCATCAATT TCTACGCGAC GCCGACCACG CGGCTGGAAT TCCTGATCGG CAAGCAACTG CCCTATGTGG TGCTGGCGAT GCTGAATTTC GTGATGCTGA CGGCGTTCGC GATTGTGGTG TTCCGGGTGC CGTTCACCGG CAGCTTCTTG GCCTTCGGCA CGGGCGCGCT GCTCTACGTC GTGTTCGCCA CCGCGCTCGG CCTGCTGCTG TCCACCTTCA TGAACAGCCA GATCGCGGCG ATCTTCGGCA CCACGCTGCT GACGCTGATC CCGGCGATCC AGTTCTCCGG CCTGATCGAT CCGGTTTCGT CGCTGCAGGG CGCCGGTGCG TTCATCGGCA AGATCTATCC GACCACCTAT TTCGTCGACA TCACGCGCGG CGCGTTCTCG AAGGGGCTGG GCTTCGAACA GATGTGGGGG TCCTTCGTCC CGCTGCTGAT CGCGGTGCCG CTGCTGTTCG GCCTCGGCGC CGCGCTGCTC CAGAAACAGG CCAAGTGA
|
Protein sequence | MSDPAVGTDA RAPVVRLAAV SLRYGKTLAL DDVTLDLPSG CMIGLIGPDG VGKSSLLSLV SGARAVQQGR VEVFGGDIAD AAHRRDTCPR IAYMPQGLGK NLYPTLSVFE NVDFFGRLFG QGRRERAARI AELLESTGLS PFAERPAGKL SGGMKQKLGL CCALIHDPDL LILDEPTTGV DPLSRGQFWE LINDIRAQRP GMSVIVATAY MEEAERFDHL VAMNAGQVLA TGTPADLLRQ TGGKSLDAAF IALLPEDERR GHAEVVIPPR TTGTTGIAIE AEHLTMRFGD FTAVDDVSFR IEQGEIFGFL GSNGCGKTTT MKMLTGLLAA SEGTAKLFGN EVDPNDMAVR RRVGYMSQAF SLYTELTVRQ NLELHARLFQ MEPAKIAPRI AEMERRFDLA EVIDKLPDEL PLGIRQRLSL AVAMIHSPDI LILDEPTSGV DPIARDGFWQ MLSDLSRNDN VTIFVSTHFM NEAERCDRIS LMHAGRVLIS DTPGAIVASR SAASLEDAFI AYLEEAIGTA ATPSAPQTTA AQVTSAADED APHPPRASSS WFDLRRMLAY TRREALELQR DPIRATLALI GSVVLMFVLG YGINLDVEKL TFAALDRDDT AISRDYILDI AGSRYFAEQR PITDYADLDR RMRSGELTMA IEIPPGFGRD VSRGRSVEVG AWIDGAMPSR AETARGYAQA MHLGWLKRKA SELYGDAATA GSFQIAMRYR YNPDIRSVVA MAPAVIPLLL LMIPAMLAAL SVVREKELGS IINFYATPTT RLEFLIGKQL PYVVLAMLNF VMLTAFAIVV FRVPFTGSFL AFGTGALLYV VFATALGLLL STFMNSQIAA IFGTTLLTLI PAIQFSGLID PVSSLQGAGA FIGKIYPTTY FVDITRGAFS KGLGFEQMWG SFVPLLIAVP LLFGLGAALL QKQAK
|
| |