Gene RPB_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3524 
Symbol 
ID3911326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4032284 
End bp4033912 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content70% 
IMG OID637885426 
ProductPepSY-associated TM helix 
Protein accessionYP_487130 
Protein GI86750634 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.617806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.525721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCCGC AGAATGATCA GCGCGTGGCG GGACGGGGCA TCCGGCAGAG CATGTCCGGC 
CTGCACACCT GGGCCGGGCT GCTGCTCGGC TGGGTGCTGT ATGCGATGTT CCTCACCGGA
ACGGTGTCGT TCTTCAAGGA GGAGCTGTCG CAATGGATGC GGCCGGAGCA GCCGCGTCTG
ACCCAGCCGC TCGATCCAGC GGTGGTGGCG CAGCGCGTCG CCGACGAGAT CGGCGCGATC
GCGCCGGGCG CGACGCAGTG GAGCATCAAG CTTCCCGACG GACGCAGCAA CACCGTCTAC
GCGTTCTGGC GGCTTCCCGG CGGCGGACCG GGCGCGCGGG GGTTCGGACA GGATCTGTTC
GATCCGGTGA CCGGCCATCG CGTCGCGGCG CGCCTCACGC TCGGCGGCGA CTTCTTCTAT
CGCTTTCATT TCCAGTTCTA CTACATGTCG CCGTTCTGGG GACGGCTGCT CGCCGGTCTC
GCGGCGATGG CCATGCTGGT GGCGATCGTC GCCGGCGTGA TCACCCACAA GAAGATCTTC
ACCGACTTCT TCACCTTCCG CTGGGGCAAG GGCCAGCGCT CCTGGCTCGA CGCGCACAAC
GCGCTGTCGG TGTTCGGGCT GCCGTTCCAT GTGATGATCA CCTACACCGG TCTGGTGACG
CTGATGGCGC TGTACGTGCC GTGGGGCGAG CGGGCCGCCA TCAAGACGCC GGCCGAGCGC
CAGCAGCTCA GCGCCGAACT CAATGCCTTC GCCCAGGCCG GCAAGCCGAG CGGCGAGACC
GTGCCGCTGG CGTCGATCGA GGCGATGGTC CGGCAGGCGC AGCAGCGCTG GGGGACGACC
GATGCGGGGC GCGTCAACGC CACCAATCCG GGCGACGCGA CGGCCCGCAT CGCGGTCACC
CGCGGTGATG CCGGGCGCGT GTCGATGAGC CCGGACTACA TCGAATTCGA CGGCGTCACC
GGGAAACTGC TCGCCGTGCA CGATCGCGTC GGCGCCGCTG CCGAGACCCG CGGCGTGCTG
TATGCGCTGC ATCTCGGCCG CTTCAGCGAC ATCGCGACGC GCTGGCTGTA TTTCCTGGTC
AGCCTGATGG GCACCGCGAT GGTCGGCACC GGGCTGGTGC TGTGGATCGT CAAGCGGCGG
GCGAAGCTGC CCGATCCGGA GCGGCCGTAT TTCGGCTTCC GTCTGGTCGA GCGGCTGAAC
ATCGCCAGCA TCGCCGGGCT GTCGATCGCG ATGACGGCGT TCCTGTGGGG CAACCGGCTG
CTGCCGGTCG CGATGGCGGA GCGGCCGTTC TGGGAAATCC ATGTCTTCTT CATCGTCTGG
GGGCTGACGC TGCTGCACGC GCTGCTGCGG CCGGCCAAGG CGGCGTGGCG CGAGCAGCTC
TGGACGGCGG CGGCGCTGCT GGCGCTGGTC CCCGTGCTCA ACGCGGTGAC GACGCAGCGG
CCGCTGTGGC GCAGCCTGGC CGAGGGCGAC TGGGAGTTCG CCGGAATCGA GCTGATGTGC
TGGGCACTGG CGGCGCTGCA CGCCGTGCTG GCGATCCGCA CGGCGCGGCA ACCGGCCGGC
GCGGCGGCGG GGCGCAAGGC CGAGCGGCGA TCGCCGATCG GCACCGCCGC CGGCGAGACC
GCGCCATGA
 
Protein sequence
MKPQNDQRVA GRGIRQSMSG LHTWAGLLLG WVLYAMFLTG TVSFFKEELS QWMRPEQPRL 
TQPLDPAVVA QRVADEIGAI APGATQWSIK LPDGRSNTVY AFWRLPGGGP GARGFGQDLF
DPVTGHRVAA RLTLGGDFFY RFHFQFYYMS PFWGRLLAGL AAMAMLVAIV AGVITHKKIF
TDFFTFRWGK GQRSWLDAHN ALSVFGLPFH VMITYTGLVT LMALYVPWGE RAAIKTPAER
QQLSAELNAF AQAGKPSGET VPLASIEAMV RQAQQRWGTT DAGRVNATNP GDATARIAVT
RGDAGRVSMS PDYIEFDGVT GKLLAVHDRV GAAAETRGVL YALHLGRFSD IATRWLYFLV
SLMGTAMVGT GLVLWIVKRR AKLPDPERPY FGFRLVERLN IASIAGLSIA MTAFLWGNRL
LPVAMAERPF WEIHVFFIVW GLTLLHALLR PAKAAWREQL WTAAALLALV PVLNAVTTQR
PLWRSLAEGD WEFAGIELMC WALAALHAVL AIRTARQPAG AAAGRKAERR SPIGTAAGET
AP