Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4052 |
Symbol | |
ID | 3911859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4622503 |
End bp | 4624341 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885956 |
Product | ABC transporter related |
Protein accession | YP_487656 |
Protein GI | 86751160 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0831351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCC CACCGCTCCT CGACATCCGC GACCTCACCG TCGAATTCGC CACCCGCCGC GGCATCGTCA AGGCGGTGCA GCATTTCGAT ATTTCGGTCG GCAAGGGCGA GACGCTGGCG ATCGTCGGCG AATCCGGCTC GGGCAAATCG GTGACGTCGT TCGCGGTGAT GCGCATCCTT GATCGCGCCG GCCGGATCGC CGAGGGCTCG GTGATGTTCG GCGGCATCGA TATCAAGGCC GCGACCGAGC AGCAGATGCG CGACCTGCGC GGCCGCGAGA TCTCGATGAT CTTCCAGAAC CCGCGTGCCG CGCTCAATCC GATCCGCAAG GTCGGCGACC AGATCGAGGA CGTGCTGCGC CAGCACGTTC AATCCACCTC GTCCGACCGC GGCGAGAAGG CGATCGCGGC TCTCGAGGCG GTGAAGATCG CGCGGCCGCG CGAGCGCTAT CACGCCTATC CGTTCCAACT CTCCGGCGGC ATGTGCCAGC GCGTGGTGAT CGCGCTGGCG CTGGCCTGCA ATCCGCAATT GCTGATCGCC GACGAGCCGA CCACCGGCCT CGACGTCACC ACCCAGAAGG CGGTGATGGA CCTGATCGTC GAACTCACGC GTAGTCGTGG CCTGTCGACC ATCCTGATCA CCCACGACCT CGGCCTCGCC GCCGCCTATT GCGACCGCGT CGTGGTGATG GAGAAGGGCC GCGTGGTCGA GACCGCGCTG GCCGCCGACA TCTTCGCCAA CCCGCAGCAC CCCTACACGA AGAAGTTGAT GCGCGCGACG CCGCGGCTGG GCGTGAGTTT GCGCGAGTTG CTCTCCGACG AAGAACGCGG GACGATGGCG GTCGCGATGC CGGCGCAATC AACCAAGCCC GTCATGGCCG GGCTTGACCC GGCCATCCAT CCCGCTTCGC AAGACGCTTC TTCGAAGGCG ATGGACCCCC GGGTCAAGCC CGGGGGTGAC GACCAGGCAG GCGGGGAGCG CGAAGCGACT CTGCAGGCAC CGCGGCCCCT CCTCGTCGTC GACAAGCTCG TCAAGGAATA TCCCCGCCAG GGCGCGACCG CCGTGCTCGG CAAACTGTTT TCGCGCGGTC CCACGGTCGA GCCCGATGTC TTCCGCGCCG TCGACGGCAT CAGCTTTACG GTCGGCCATG GCGAGAGCGT CGGGCTTGTC GGCGAATCCG GCTGCGGCAA GTCGACGACC TCGATGATGG TGATGCGGCT GCTCGATCAG ACCTCGGGGC GGATCAGTTT CGACGGCGAG GAGATCGGCG CTATTCTTCC GGGACGTTTC GCGCGGCTGC CGCAGCGCAA GGCGATCCAG ATGGTGTTCC AGGACCCGAC CGACAGCCTC AACCCGCGCT TCACCGCCGC ACGCGCCATC GCCGATCCAA TCATGCAGCT CGGCGACATC AAGGGCCGCG ACGCGCTGCG CGCACGCTGC GAGGAATTGG CCGAACAGGT CGGCCTGCCG CTCGATCTGC TCGACCGCTT TCCGCATCAG CTCTCCGGCG GCCAGAAAGC CCGGGTCGGC ATCGCCCGCG CCATCGCGCT GCAGCCGAAG CTGATCATTC TCGACGAACC CACCGCCGCG CTCGACGTCT CCGTGCAGGC CGTGGTGCTG AATTTGCTGC AGGACCTGAA ACAGTCGATG GGGATGAGCT ACCTGTTCGT GTCGCACGAT CTCAACGTGG TGCGGCTCTT GTGCGACCGC GTGATCGTGA TGCGCGCCGG CCGGATCGTC GAACAGGGAA CATCCGAGCA GGTGCTCGGC GCGCCGCAGG ACGCCTACAC CCGCGAACTG TTGACGGCGA TCCCGCATCC GCCGCTGCCG GTGACGTGA
|
Protein sequence | MTTPPLLDIR DLTVEFATRR GIVKAVQHFD ISVGKGETLA IVGESGSGKS VTSFAVMRIL DRAGRIAEGS VMFGGIDIKA ATEQQMRDLR GREISMIFQN PRAALNPIRK VGDQIEDVLR QHVQSTSSDR GEKAIAALEA VKIARPRERY HAYPFQLSGG MCQRVVIALA LACNPQLLIA DEPTTGLDVT TQKAVMDLIV ELTRSRGLST ILITHDLGLA AAYCDRVVVM EKGRVVETAL AADIFANPQH PYTKKLMRAT PRLGVSLREL LSDEERGTMA VAMPAQSTKP VMAGLDPAIH PASQDASSKA MDPRVKPGGD DQAGGEREAT LQAPRPLLVV DKLVKEYPRQ GATAVLGKLF SRGPTVEPDV FRAVDGISFT VGHGESVGLV GESGCGKSTT SMMVMRLLDQ TSGRISFDGE EIGAILPGRF ARLPQRKAIQ MVFQDPTDSL NPRFTAARAI ADPIMQLGDI KGRDALRARC EELAEQVGLP LDLLDRFPHQ LSGGQKARVG IARAIALQPK LIILDEPTAA LDVSVQAVVL NLLQDLKQSM GMSYLFVSHD LNVVRLLCDR VIVMRAGRIV EQGTSEQVLG APQDAYTREL LTAIPHPPLP VT
|
| |