Gene RPB_4019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4019 
Symbol 
ID3911826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4586924 
End bp4588237 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID637885923 
Producthypothetical protein 
Protein accessionYP_487623 
Protein GI86751127 
COG category[S] Function unknown 
COG ID[COG4949] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.132904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.807924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTG AAGTTCTGAT CGGTGATGGC GATGGTAGAC TTTCGCCGCA TCCACTGCGG 
GCGGCCGTCT TGGGCGAGGT TCACGCGCGC CCGTTCACTG CGCTCGCAGT GCCGGCGCGG
GTGCTGCATT TCGCGTTCGA CACCTCGGGC GAGAAGGCCA AGGCCGATCG CATCGCGCTG
ACGAAATTCT GCGAATCACG CGGGCTGCAG CCGCCGCCGT CCAACGAGAA GCATCACCGC
GCCTCGTTCG GCACGACCAT GCTGCGCTGG GAACAGCATT CCGAATTCAC CACCTACACC
TGGGAATTCA CCGCCGACCC GGTCGCGATG CCGTTTCATC CGGAGGCCTC GTCGCTGGCT
TCGCCGATGC GGCTGGTGCC CCAGCCCGGG CCGTTGCTGG TCGCGGTCGA TCTGCATGCG
CTGCCGGACG ATCCGCCGCG CACCGCGCCG GAGCGATTGT TCGATCGCGC CAGCCTCGCT
GTCGCGGAGA ATTCCGACGG CGCGGCGGTC TATGCCACCG ATTTCCAGCC CGGTCCCTCG
GGCTTCGTGC GGGTGCTGGT GATCGATCGC GGCATGGCGC CGGAGCGCGC CGGAGCGCTG
GTGCAGCGTG TGCTCGAAAT CGAGACCTAT CGCACGCTGG CGCTGCTCGG CCTGCCGGAA
GCGCAGCGGC TCGGTCCCTC GATCAGCAAC GGCGAGCGCC GCCTCGCCGA AGTCACCGCC
GAAATGCGCA AGGCGGCCGA TCTCGCCATC AACAACAGAC TGCTGCAGGA ACTGACCGAA
CTCGCCGCCG AGGTCGAAGC CGGCGCCGCC GCCAGTCTGG GCCGCTTCAG CGCCAGCCGC
GCCTATGAAG AGATCATGAC CGGCCGGCTG GCGACGCTCG GCGAACGCAA GGTCGGCGGC
CTGCCGACCT GGTCGTCGTT CCTCGCCCGC CGGATGAAGC CGGCGATGCG CACCTGCACC
ACCACCGAGG CGCGACAATC CGACCTGTCG CTGAAACTCG CCCGCGCCGC CAACCTGCTG
CGAACCCGCG TCGACGTCGA GCTCGAACAT CAGAATCAAG AGCTGCTGAA ATCGATGAAC
GCGCGGACGC GGCTGCAATT GCGGCTGCAG GCCACCGTCG AAGGCCTCTC CACCGCGGCG
ATCACCTACT ACGTGGTCGG GCTGTTCGGT TATTTGGTGA AGGGTCTGCA CGATTCCGGC
CAGATCACGG TCGAGCCGAG CCTCGTCACC GCGGGTTTCG TGCCGATCGC CGCGTTCTCG
ATCTGGTGGA CGGTGCGCAG CATCCGCAGG AAACACATCG CGAGCGAGGA TTGA
 
Protein sequence
MTAEVLIGDG DGRLSPHPLR AAVLGEVHAR PFTALAVPAR VLHFAFDTSG EKAKADRIAL 
TKFCESRGLQ PPPSNEKHHR ASFGTTMLRW EQHSEFTTYT WEFTADPVAM PFHPEASSLA
SPMRLVPQPG PLLVAVDLHA LPDDPPRTAP ERLFDRASLA VAENSDGAAV YATDFQPGPS
GFVRVLVIDR GMAPERAGAL VQRVLEIETY RTLALLGLPE AQRLGPSISN GERRLAEVTA
EMRKAADLAI NNRLLQELTE LAAEVEAGAA ASLGRFSASR AYEEIMTGRL ATLGERKVGG
LPTWSSFLAR RMKPAMRTCT TTEARQSDLS LKLARAANLL RTRVDVELEH QNQELLKSMN
ARTRLQLRLQ ATVEGLSTAA ITYYVVGLFG YLVKGLHDSG QITVEPSLVT AGFVPIAAFS
IWWTVRSIRR KHIASED