Gene RPB_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4052 
Symbol 
ID3911859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4622503 
End bp4624341 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content67% 
IMG OID637885956 
ProductABC transporter related 
Protein accessionYP_487656 
Protein GI86751160 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0831351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC CACCGCTCCT CGACATCCGC GACCTCACCG TCGAATTCGC CACCCGCCGC 
GGCATCGTCA AGGCGGTGCA GCATTTCGAT ATTTCGGTCG GCAAGGGCGA GACGCTGGCG
ATCGTCGGCG AATCCGGCTC GGGCAAATCG GTGACGTCGT TCGCGGTGAT GCGCATCCTT
GATCGCGCCG GCCGGATCGC CGAGGGCTCG GTGATGTTCG GCGGCATCGA TATCAAGGCC
GCGACCGAGC AGCAGATGCG CGACCTGCGC GGCCGCGAGA TCTCGATGAT CTTCCAGAAC
CCGCGTGCCG CGCTCAATCC GATCCGCAAG GTCGGCGACC AGATCGAGGA CGTGCTGCGC
CAGCACGTTC AATCCACCTC GTCCGACCGC GGCGAGAAGG CGATCGCGGC TCTCGAGGCG
GTGAAGATCG CGCGGCCGCG CGAGCGCTAT CACGCCTATC CGTTCCAACT CTCCGGCGGC
ATGTGCCAGC GCGTGGTGAT CGCGCTGGCG CTGGCCTGCA ATCCGCAATT GCTGATCGCC
GACGAGCCGA CCACCGGCCT CGACGTCACC ACCCAGAAGG CGGTGATGGA CCTGATCGTC
GAACTCACGC GTAGTCGTGG CCTGTCGACC ATCCTGATCA CCCACGACCT CGGCCTCGCC
GCCGCCTATT GCGACCGCGT CGTGGTGATG GAGAAGGGCC GCGTGGTCGA GACCGCGCTG
GCCGCCGACA TCTTCGCCAA CCCGCAGCAC CCCTACACGA AGAAGTTGAT GCGCGCGACG
CCGCGGCTGG GCGTGAGTTT GCGCGAGTTG CTCTCCGACG AAGAACGCGG GACGATGGCG
GTCGCGATGC CGGCGCAATC AACCAAGCCC GTCATGGCCG GGCTTGACCC GGCCATCCAT
CCCGCTTCGC AAGACGCTTC TTCGAAGGCG ATGGACCCCC GGGTCAAGCC CGGGGGTGAC
GACCAGGCAG GCGGGGAGCG CGAAGCGACT CTGCAGGCAC CGCGGCCCCT CCTCGTCGTC
GACAAGCTCG TCAAGGAATA TCCCCGCCAG GGCGCGACCG CCGTGCTCGG CAAACTGTTT
TCGCGCGGTC CCACGGTCGA GCCCGATGTC TTCCGCGCCG TCGACGGCAT CAGCTTTACG
GTCGGCCATG GCGAGAGCGT CGGGCTTGTC GGCGAATCCG GCTGCGGCAA GTCGACGACC
TCGATGATGG TGATGCGGCT GCTCGATCAG ACCTCGGGGC GGATCAGTTT CGACGGCGAG
GAGATCGGCG CTATTCTTCC GGGACGTTTC GCGCGGCTGC CGCAGCGCAA GGCGATCCAG
ATGGTGTTCC AGGACCCGAC CGACAGCCTC AACCCGCGCT TCACCGCCGC ACGCGCCATC
GCCGATCCAA TCATGCAGCT CGGCGACATC AAGGGCCGCG ACGCGCTGCG CGCACGCTGC
GAGGAATTGG CCGAACAGGT CGGCCTGCCG CTCGATCTGC TCGACCGCTT TCCGCATCAG
CTCTCCGGCG GCCAGAAAGC CCGGGTCGGC ATCGCCCGCG CCATCGCGCT GCAGCCGAAG
CTGATCATTC TCGACGAACC CACCGCCGCG CTCGACGTCT CCGTGCAGGC CGTGGTGCTG
AATTTGCTGC AGGACCTGAA ACAGTCGATG GGGATGAGCT ACCTGTTCGT GTCGCACGAT
CTCAACGTGG TGCGGCTCTT GTGCGACCGC GTGATCGTGA TGCGCGCCGG CCGGATCGTC
GAACAGGGAA CATCCGAGCA GGTGCTCGGC GCGCCGCAGG ACGCCTACAC CCGCGAACTG
TTGACGGCGA TCCCGCATCC GCCGCTGCCG GTGACGTGA
 
Protein sequence
MTTPPLLDIR DLTVEFATRR GIVKAVQHFD ISVGKGETLA IVGESGSGKS VTSFAVMRIL 
DRAGRIAEGS VMFGGIDIKA ATEQQMRDLR GREISMIFQN PRAALNPIRK VGDQIEDVLR
QHVQSTSSDR GEKAIAALEA VKIARPRERY HAYPFQLSGG MCQRVVIALA LACNPQLLIA
DEPTTGLDVT TQKAVMDLIV ELTRSRGLST ILITHDLGLA AAYCDRVVVM EKGRVVETAL
AADIFANPQH PYTKKLMRAT PRLGVSLREL LSDEERGTMA VAMPAQSTKP VMAGLDPAIH
PASQDASSKA MDPRVKPGGD DQAGGEREAT LQAPRPLLVV DKLVKEYPRQ GATAVLGKLF
SRGPTVEPDV FRAVDGISFT VGHGESVGLV GESGCGKSTT SMMVMRLLDQ TSGRISFDGE
EIGAILPGRF ARLPQRKAIQ MVFQDPTDSL NPRFTAARAI ADPIMQLGDI KGRDALRARC
EELAEQVGLP LDLLDRFPHQ LSGGQKARVG IARAIALQPK LIILDEPTAA LDVSVQAVVL
NLLQDLKQSM GMSYLFVSHD LNVVRLLCDR VIVMRAGRIV EQGTSEQVLG APQDAYTREL
LTAIPHPPLP VT