Gene RPB_3552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3552 
Symbol 
ID3911354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4065589 
End bp4067802 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content65% 
IMG OID637885454 
ProductTonB-dependent receptor 
Protein accessionYP_487158 
Protein GI86750662 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0127436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCG CTCGTCCGCG GTCGCGCAGC CTCGTGCTGC TGCAGGCCAC TGCGCTCGTT 
TCCACTTGTC TTGTTTCGCT GCCGGCGATG GCACAGAGCC CTCGCGACGG CACTTTGCCG
CCGGTGACGG TGCAGGCGCC GGATCAGGCC CGCCCGGCCG TGCCGCGCGC GCAGCGCAGC
CCGTCGCGGG CGACGCGAGC GGTGCGCGCC ACCAGCCCGG CGCCGTTGCC GGCCGCCGCA
CCGGTCGACA GCGCGCAGGC GGCGAGAGGC CCGGCGTTGA CCGTGCTCAC GGTGCAGCAG
GCCTTGAACG ACATTCAGCA GACGCCCGGC GGCGTCGCGC TGGTGCCCGC CAACGCCTAT
CGCAACTCGA CGGTGTCCAA CACCATCAAG GACATTCTCG ACTACGTGCC GGGCGTATTC
GCGCAGCCGA AATGGGGCGA CGACACCAGG CTGTCGATCC GCGGCTCGGG GCTGTCGCGC
AACTTCCATC TGCGCGGCGT GCAGCTCTAC ATGGACGGCA TTCCGATCAA CACCGCGGAC
GGCTATGGTG ATTTTCAGGA GATCGACCCC ACCGCGTACA AATACGTCGC GGTCTACAAG
GGCGCCAACG CGCTGCAGTT CGGCGCCAAT TCGCTCGGCG GCGCGATCAA TTTCGTCACC
GCGACCGGCC GCGATCCGTT CCCGAACGGC GTGAGCGTCG ACGCCGGTGC GTTCGGCTAT
CGCCGGCTGC AGGCCAATGC CGGTGGCGTC AACGGCCCGT GGGACGGCTA TGTCACCGCC
TCGACGCAGG CGGCTGAAGG TTTCCGCAAC CACAGCGACG GCGAGGCCCA CCGGCTCAGC
GCCAATATCG GCTACCAGAT CACGCCGGAC ATCGAGACCC GGTTCTATCT CAACGCCAAT
CACGTCCGGC AGCGGATCCC CGGCAGCGTC ACCAAGACCT CGGCGCTGAC CTCGCCGGAA
ACAGCCGCAG CCAACAACGT CGCGCTCGAC CAGCAGCGCA ACATCGACAC GGTGCGGCTC
GCCAACAAGA CCACGATCCG GTTCGACAAC ACGGTGGTCG ATTTCGGCGC CTTCGGCGTC
GATCGCCACC TGATGCATCC GATCTTCCAG TGGCTGGACT ATCGCTATCA GGACTATGGC
GGCTTCGCCA AAGTCACCGA CGATCGCTTC ATCGGCGGCT ATCGCAACCG TCTGGTCGCC
GGCGTCAACC TGCTCAACGG TCGGATCGAC AACAAGCAAT ACGTCAACCT CGCCGGGCAG
AAGGGTGCGC TGGCGTTCTC GTCGATCGAC AAATCCACCA ACACGTCGTT CTACGTCGAG
GACTCGTTCT ATTTCCTGCG CAACGTTGCG CTGGTCGCCG GCACGCAGTT CCTGCACGCC
ACCCGTGACC GGGTCGTCCG ATTCACGTCG AACAACGATC CGTCGGGGAC GACGGAGTTC
AACTTATGGA GTCCCAAGGC TGGGTTGTTG TGGGACATCG ATCCGACCTG GCAGGCGTTC
GGGAACATCT CGCGGAGCGC CGAAGTGCCG AGCTTCGGCG AGAGCTCTGC AGCGCAGTCG
ATTCCCTTCA CCAGCATCCG GCCGCAGCGT GCGACGACTT ACGAGATCGG CACCCGTGGC
CGTCGGCCCG ATCTGGTCTG GGAGCTGACC GGCTATCGCG CCGAGATCAA GGACGAGCTG
CTCTGCATCT ACAGTTCGTT AGGCAATTGC AACGTGACCA ACGCCGATCG CAGCGTGCAT
CAGGGCATCG AGGCCGGGCT CGGCGTCGCG CTGTTCAAGA ACCTGTTCGA GCGAGGACAC
GCGCCCGACC GGCTGTGGCT GAACATGGCC TACACGCTGA ACGACTTCCG CTTCGACAAC
GATGCGACCT TCGGCAACAA TCTGCTGCCC GGCGCGCCGC GGCACTATCT GCGCGCCGAA
TTGCTCTACA AGAATCCGAA CGGGTTCTAT GCCGGCCCGA ACGTGGAGTG GGTGCCGCAG
GCCTATTTCG TCGACAGCGC CAACACGCTG AAGACCGAGC CCTATGCGCT GCTCGGCCTC
AAGGCGGGGA TCGACAATGG CGGGCCGTAC TCGATCTATA TCGAGGGCCG CAACCTCACC
AACAAGGCCT ACATCGCCTC CGCCAGCATC ATCGACAAGG CCACCGCGAC CTCGCCTCTG
TTCGAACCGG GCACCGGACG CGCGGTCTAT GCCGGCTTCA AGGTGCGCTG GTGA
 
Protein sequence
MSSARPRSRS LVLLQATALV STCLVSLPAM AQSPRDGTLP PVTVQAPDQA RPAVPRAQRS 
PSRATRAVRA TSPAPLPAAA PVDSAQAARG PALTVLTVQQ ALNDIQQTPG GVALVPANAY
RNSTVSNTIK DILDYVPGVF AQPKWGDDTR LSIRGSGLSR NFHLRGVQLY MDGIPINTAD
GYGDFQEIDP TAYKYVAVYK GANALQFGAN SLGGAINFVT ATGRDPFPNG VSVDAGAFGY
RRLQANAGGV NGPWDGYVTA STQAAEGFRN HSDGEAHRLS ANIGYQITPD IETRFYLNAN
HVRQRIPGSV TKTSALTSPE TAAANNVALD QQRNIDTVRL ANKTTIRFDN TVVDFGAFGV
DRHLMHPIFQ WLDYRYQDYG GFAKVTDDRF IGGYRNRLVA GVNLLNGRID NKQYVNLAGQ
KGALAFSSID KSTNTSFYVE DSFYFLRNVA LVAGTQFLHA TRDRVVRFTS NNDPSGTTEF
NLWSPKAGLL WDIDPTWQAF GNISRSAEVP SFGESSAAQS IPFTSIRPQR ATTYEIGTRG
RRPDLVWELT GYRAEIKDEL LCIYSSLGNC NVTNADRSVH QGIEAGLGVA LFKNLFERGH
APDRLWLNMA YTLNDFRFDN DATFGNNLLP GAPRHYLRAE LLYKNPNGFY AGPNVEWVPQ
AYFVDSANTL KTEPYALLGL KAGIDNGGPY SIYIEGRNLT NKAYIASASI IDKATATSPL
FEPGTGRAVY AGFKVRW