Gene RPD_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4046 
Symbol 
ID4024563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4495439 
End bp4497808 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content68% 
IMG OID637964249 
ProductTonB-dependent receptor 
Protein accessionYP_571166 
Protein GI91978507 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.236751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCC GCTTCAAGCG CGCCTTCCTG GTCGGAAGCG CAGCCTTCGC GGCCTCCGAA 
GGCTTGCCGG CATCGCAGGC CCTCGCACAG CAGGCCGCCA CCGCATTGCC GGAAGTCGTC
GTCACCGCGC CGAGCCCGAT CGTGCGACGC CATTCGGCCC CGCCGTCACG ACCCGCGACG
CGGGTCGCCG CCCCCGCGCG CCAGCGCGGC ACCGCTCCGG CCGAGGCGCC GCCCGTCGTC
GCCCAGCCGG CGCCGCCGCC CCCCGGCGCG CTGCCGATCT TCACCGATCA GTTCGCGACC
GTTACCGTGG TCCCGAACGA GGAGATCCGG CGCAATGGCG GCGGCACGCT CGGCGACCTT
CTGAACAACA AGCCGGGCAT TACCGGCTCC GGCTACGCGC CGGGGGCCTC CAGCCGGCCG
ATCATCCGCG GCCTCGACGT CAACCGAGTC GGCATCGTCG AGAACGGCAT CGGCAGCAAC
GGCGCGTCCG ATCTCGGCGA AGACCATTTC GTCCCGATCG ATCCGCTCGC GACCAACCAG
GTCGAAGTGA TCCGCGGCCC GGCGACGCTG CGCTACGGCT CGACGGCGAT CGGCGGCGTG
GTCAGCGCCA GCAACAACCG CATCCCCGAC GCGCTGCCGC CCTGCGCGAC GCCGTTCCAG
AGCTACGGCC TGCCGGTGAA GGCGCCGGCT GCGATCGGCG GCGCGGCGGG CTGCATGAAC
GCCGAGGTCC GCAGTGCGAT GAGCTCGGTC GATCGCGGCG TCGAAAGCGC GGTGCTGCTG
GATGCCGGCG GCAACAACGT CGCGGTCCAT GCCGACGTGT TCGGCCGCAA TGCCGGCGAC
TATAACGTGC CGAGCTATCC GTATCAGGCG CCGGGACTCC CCTTCAACGG ACGCCAGCCC
AACTCCGCGA CGCAGGCGAC CGGCGCCTCG ATCGGCGGCT CCTATCTGTT CGACGGCGGC
TTCATCGGCG CGGCGATCAC GCAGAACAAT TCGGTCTATC GGATTCCCGG GACCGAGGGC
GCCGAATTCG GGACGCGGAT CGATGCGAAG CAGACCAAGG TCACCGCCAA GGGCGAGTAT
CGCCCGGATG CGGCGGCGAT CGAGGCGATC CGGTTCTGGG TCGGCGCCAC CGACTACAAG
CACAATGAGA TCGGCCTCGC CGTCGCAGGC GACCCGGCGT CCGATGGCGT GCGGCAGAGC
TTCACCAACA AGGAGCAAGA GGGCCGCCTC GAGGCGCAAC TGACGCCGTT CAATGCCCGC
TTCGCCACGG TCACGACGGC GGTGGGCGTG CAGGCCAGCC ACCAGGAACT GACAGCGCCC
AGCCCCGACG ATCCGACCAG CCCGCTCAAC GGGCTGTTCG ATCCCAATAA GAACACCAAG
CTCGCCGGCT ACGTCTTCAA CGAGCTGCGC TTCACCGAGA GCACCAAGGC GCAGGTGGCC
GGACGAATCG AGCACGTCGA CCTGTCCGGA ACGACGCCGG CTTTCGTACC CGGCCTGTTC
GACCTCTCCA CCGACCCCGG CGCGATCGGC CCTGCGACGT CGCGCAACCT GTCGTTCACG
CCGAAGAGCT TCAGCCTCGG CCTGATCCAG GCGTTGCCGT GGGGGCTTTC GGCCAGCATC
ACCGGGCAAT ATGTCGAGCG TGCGCCGAAG CCGGCGGAGC TGTTCTCGCG CGGCGGTCAC
GACGCCACCA CCACCTTCGA CATCGGCAAC CCCAATCTGG GGATCGAGAC GGCAAAATCG
GTCGAGGTCG GTTTGCGTCG GGCGGACGGC CCGTTCCGCT TCGAGATCAC CGCTTACTAC
ACGCAGTTCA GCGGCTTCAT CTATCGGCGG CTGACCGGCA ATAGCTGCGA CGACGTCTCT
TGCGTCGATC CGGCGACCGG CACGCTGGAA TTGAACCAGG CGATCTACGC GCAGCGCGAC
GCCACCTTCA GAGGCGGCGA ATTCCAGAGC CAGCTCGACG TGGCGCAAAT CTATGGCGGA
ACCTGGGGCA TCGAGAACCA GTTCGATGTC GTGCGGGCCA CCTTCGCAGA CGGCACCAAT
GTGCCGCGGA TCCCGCCGCT GCGGGTCGGC GGTGGATTGT TCTGGCGTGA CGCCAACTGG
CTGACCCGGA TCAACCTGTT GCACGCCTTC GCCCAGAACG ACGTCGCGCC GATCGCCGAG
ACAACCACCT CGGGCTACAA TCTGCTCAAG GCGGAGATCA GCTACCGGAC CAAGCTCGAT
CCGAACGCAT GGGGCGCCCG CGAGATGCTG GTCGGCCTTG TCGGCAACAA TCTGCTCAAC
GAGAACATCC GCAACGCGGT GTCCTACAGC AAGGACAACG TGCTGATGCC CGGCATCGGC
GTGCGGGCGT TCGCGAATCT GAAGTTCTGA
 
Protein sequence
MSLRFKRAFL VGSAAFAASE GLPASQALAQ QAATALPEVV VTAPSPIVRR HSAPPSRPAT 
RVAAPARQRG TAPAEAPPVV AQPAPPPPGA LPIFTDQFAT VTVVPNEEIR RNGGGTLGDL
LNNKPGITGS GYAPGASSRP IIRGLDVNRV GIVENGIGSN GASDLGEDHF VPIDPLATNQ
VEVIRGPATL RYGSTAIGGV VSASNNRIPD ALPPCATPFQ SYGLPVKAPA AIGGAAGCMN
AEVRSAMSSV DRGVESAVLL DAGGNNVAVH ADVFGRNAGD YNVPSYPYQA PGLPFNGRQP
NSATQATGAS IGGSYLFDGG FIGAAITQNN SVYRIPGTEG AEFGTRIDAK QTKVTAKGEY
RPDAAAIEAI RFWVGATDYK HNEIGLAVAG DPASDGVRQS FTNKEQEGRL EAQLTPFNAR
FATVTTAVGV QASHQELTAP SPDDPTSPLN GLFDPNKNTK LAGYVFNELR FTESTKAQVA
GRIEHVDLSG TTPAFVPGLF DLSTDPGAIG PATSRNLSFT PKSFSLGLIQ ALPWGLSASI
TGQYVERAPK PAELFSRGGH DATTTFDIGN PNLGIETAKS VEVGLRRADG PFRFEITAYY
TQFSGFIYRR LTGNSCDDVS CVDPATGTLE LNQAIYAQRD ATFRGGEFQS QLDVAQIYGG
TWGIENQFDV VRATFADGTN VPRIPPLRVG GGLFWRDANW LTRINLLHAF AQNDVAPIAE
TTTSGYNLLK AEISYRTKLD PNAWGAREML VGLVGNNLLN ENIRNAVSYS KDNVLMPGIG
VRAFANLKF