Gene RPB_3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3721 
Symbol 
ID3911523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4258053 
End bp4259111 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID637885623 
Productputative DNA-binding protein 
Protein accessionYP_487327 
Protein GI86750831 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.299367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCTG ACGACGGCAA GGGCGAAGGC GAACTGATCC TCTACCGCAC GGAGGATGGC 
CGCGATGCGT TGCAACTGCG GCTGGTGGAC GGGACGGTCT GGCTGACCCA GGCCGAAATT
GCGCTGCTGT TCGACACCAC CAAGCAGAAC GTCAGCCTGC ACTTCAAGAG CATCTTTGCC
GACGCCGAAC TGCCCGAGGG GGCAGTTGTC AAGGATTCCT TGACAACTGC GGCCGACGGC
AAACGCTATA CGACAAAGCT CTACAACCTC AATGCCATCC TGGCGGTCGG CTACCGCGTG
CGCAGCGCCC GCGGGGTGCA GTTCCGGCGC TGGGCCACCG AGGTCTTGAA CGACTATCTG
GTCAAGGGCT TCGTCATCAA CGACGAGCGC CTGAAGGATC CTGACGGCTT CGATTATTTC
GACGAGCTGC TCGAACGCAT CCGCGACATC CGGGCGTCCG AAAAACGGTT CTACCAGAAA
GTCCGCGACC TGTTCGCCGC CACCAGCGCC GACTACGACC CCAAGGCGGA GGCCGCCAAG
GCCTTCTTCG CCACGATCCA GAACAAGCTG GTGTTCGCCA TCACCGGGAT GAACGCCGCC
GAACTGATCG TCACGAGGGC CGATCCTGCG CGGCCGAACA TGGCGCTGAC CAGCTGGAAG
GGCGATCGCG TCCGCAAAAG CGACGTCACC ATCTCGAAAA ATTATCTCAC CGCCGACGAG
ATCAGCGACC TCAACCTGCT GACCACGGCC TTCCTGGATT TTGCCGAGCT GCGCGCCCGC
AACCGCCAGC CGACCACCAT GGCCGAATGG ATGGCGCAGA CCGATCGCTT CGTCGCCTTC
AACGAACGCG GCGTGCTGCA GGGCGCCGGC CGCGTCTCCC ATACGAGCAT GGAGCAGGTC
GTGGCCGAGC GCTTCGAGAC TTTCGACAAA CGCCGTCGCG CCGCCGAGAC CGATGCAGCC
GAGGCGGAGG CGATCAGCGA GCTGACTGAA CTCGAGCAGC AGGCGCGATC GTCGAAGCCA
CGCCCTGAGG TCAAAAAACC GCCGAAGAAG CTATCCTGA
 
Protein sequence
MNPDDGKGEG ELILYRTEDG RDALQLRLVD GTVWLTQAEI ALLFDTTKQN VSLHFKSIFA 
DAELPEGAVV KDSLTTAADG KRYTTKLYNL NAILAVGYRV RSARGVQFRR WATEVLNDYL
VKGFVINDER LKDPDGFDYF DELLERIRDI RASEKRFYQK VRDLFAATSA DYDPKAEAAK
AFFATIQNKL VFAITGMNAA ELIVTRADPA RPNMALTSWK GDRVRKSDVT ISKNYLTADE
ISDLNLLTTA FLDFAELRAR NRQPTTMAEW MAQTDRFVAF NERGVLQGAG RVSHTSMEQV
VAERFETFDK RRRAAETDAA EAEAISELTE LEQQARSSKP RPEVKKPPKK LS