Gene RPB_3502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3502 
Symbol 
ID3911304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4007012 
End bp4009327 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content66% 
IMG OID637885404 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_487108 
Protein GI86750612 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0943385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG TTATTGCCGG CGCGAACGCC GGTCGTAGAC CTGTGCTCAA AACCCTATGG 
CTGACCAGCG CGTTGATCGT GCCGATCGTT TGGCTTCCGT CTTCCGCGGC CGCACAGGGG
GCCAAGCCGG CGGCGACCGA GCTGCCCGCC GTCGAAATCA CAGCGCCGCA GGCGAGCCGG
CGGCAGGCGC AAGCGCCGCG TGACAGAAGC CGCTCCGCCA CCGGACGCTC GCGCAGATCC
GCACAGCGTG CCGCGCAAGC GCCGCAGCCG GCACAGCCGA CGCAGCGCGC CGTGTTCGAA
CGCGGCACCG ATCCGGTGCG CGGCTTCGTG CCGAGCGTCA GCGCCAGCGG CACCAAGACC
GACACCAGGC TGATCGAGAC CCCGCAATCG ATTTCCGTGA TCAGCCGTGA CAACCTCGAC
GCGCGCGGCA TCGACACCGT CGCGCAGGCG CTGCAATACA CCGCCGGCGT CGCGGTGCAG
ACGTTCGGCG GCGACCCACG CTACGATCAG GCGCGCATCC GCGGCTTCGA AACCAACGGC
TTCTCCAACT TCCGCGACGG CCTGCGCGAC ACCGCCAACG GCTCGGCCTA TTTCTCGGTG
TTCCGCAACG AGCCCTACGG CGTGGAGCGC ATCGACGTCG TCAAGGGCCC GAGCTCGGTG
ATGTACGGCC AGAGCCCGCC CGGCGGCCTG ATCGATCTGA TCAGTAAGCG GCCGACCGAC
CAGGCATTCG GCGAGGTGGT CGGCCTGGTC GGCAGCGCCG ACCGGCTGCA GGGCGCGTTC
GACGTCGGCG GCCCGGTCGA CAAGGACAAG ACCGTGCTCT ATCGGCTCAC CGGCGTGTTA
CGGGATTCCG ACGCGCAGGT CGCGAAGTTT TCCGACAAGA TCAAGGACGA CCGCGCCTAT
ATCGCGCCGG CCATCACCTG GCGGCCGACC AACGACACCA CCCTGACGTT CCTCAGCGAC
TATCAGCACG ACGTCACCGG CATCGCCAGC CCGGTGTCGG TCGCCACCGT CCGCGGCGGC
AAGGTCGTCG ACATGCGCCC GCTGCCGCTG TATCTCGGCG ATCCGTCGTA CAACACCTTC
GATCAGACCC AGTACCGGAT CGGCTATCAG TTCGAACACC GCTTCAGCGA CGATCTGATC
GTGCGCTCGC GGGCGCGCTA CGGCCACGTC GATCTGGAGT ATCGTTCGAT CACCCTGGCC
GGCACGCCGC TCGACACCCA GACCGTGTTT GCGCGGAACG CGCGCCGCGT GCTCGAGAAC
AGCGACAGCT TCGGCACCGA CAACCACGTC ATCGCCAAGA CCACGACCGG CCCGCTGCAG
CACACCATGC TGTTCGGGAC CGACTATCAG GCGTTCAAGC TCGAAGGCGA ATCGTTCGGC
GGCCTGGCGC CGTCGCTCGA CGTGCTCAAT CCGGTCTACG GCCAGGCGGT GGCGATGCCG
ACGCTCCGGC TGCAGAGCTA CAAGCAGAAC CTGAACCAGG CCGGCGTCTA TCTGCAGGAC
CAGATCAAGC TGCAGAACTG GATCCTGACG CTCGGCGGCC GCTACGACGC GGCTCAACAG
ACCATCCTCA ACCGCCTCAC CGGCGTGCCG CAGCTCAACG ACGACACCGC CTTCACCAAG
CGCGCCGGCC TGACCTATCT GTTCGACAAC GGCCTCGCGC CCTATGTCAG CTATTCCGAA
TCGTTCCTGC CGACAGGCGG CGTCGATTTC AACTCCAACG CCTTCAAGCC CACCAAGGGC
AAGCAATACG AGGGCGGCAT CAAGTTCCAG CCGAACCGCG ATCTGCTGTT CACCGCGGCG
GTGTTCGACC TCACCCAGGA CAACGTGCTG ACTGCCGATC CGAACCATCT GAACTACAGC
ATCCAGACCG GCCAGGTGAA TTCGCGCGGC CTCGAGCTGG AGATGCTGGC CAAGCCGGTG
CCGGGACTGA ATGTTCTGGC GAGCTACACG CTGCAGAACC TGAAGAATAC CCAGAGCAAC
AACGGCGACG TCGGCAAGAT GCCGGTGCTG ATCCCCCGCC ACATGGCGTC CGCCTTCGCC
GACTACACGC TGCAGAGCGG GCCGCTCGCC GGATGGGGCT TCGGCGCCGG CTTCCGCTAC
ATCGGCGAGT CCTACATGGA CATCCTCAAC ACGTTCACCA ACGACGCCTA TACGGTGTTC
GACGCCGGGC TGCATTATCG CCAGCCGAAG GGCATCAACC TGGCGCTCAA CGTCAAGAAC
ATCGCCGACA AGGACAACGC GATGTGCACC GCCACCGGCG GCTGCCAGTA CATCGCCCCG
CGGGTGATCA CAGCGACCGC CAGCTATCGC TGGTGA
 
Protein sequence
MKSVIAGANA GRRPVLKTLW LTSALIVPIV WLPSSAAAQG AKPAATELPA VEITAPQASR 
RQAQAPRDRS RSATGRSRRS AQRAAQAPQP AQPTQRAVFE RGTDPVRGFV PSVSASGTKT
DTRLIETPQS ISVISRDNLD ARGIDTVAQA LQYTAGVAVQ TFGGDPRYDQ ARIRGFETNG
FSNFRDGLRD TANGSAYFSV FRNEPYGVER IDVVKGPSSV MYGQSPPGGL IDLISKRPTD
QAFGEVVGLV GSADRLQGAF DVGGPVDKDK TVLYRLTGVL RDSDAQVAKF SDKIKDDRAY
IAPAITWRPT NDTTLTFLSD YQHDVTGIAS PVSVATVRGG KVVDMRPLPL YLGDPSYNTF
DQTQYRIGYQ FEHRFSDDLI VRSRARYGHV DLEYRSITLA GTPLDTQTVF ARNARRVLEN
SDSFGTDNHV IAKTTTGPLQ HTMLFGTDYQ AFKLEGESFG GLAPSLDVLN PVYGQAVAMP
TLRLQSYKQN LNQAGVYLQD QIKLQNWILT LGGRYDAAQQ TILNRLTGVP QLNDDTAFTK
RAGLTYLFDN GLAPYVSYSE SFLPTGGVDF NSNAFKPTKG KQYEGGIKFQ PNRDLLFTAA
VFDLTQDNVL TADPNHLNYS IQTGQVNSRG LELEMLAKPV PGLNVLASYT LQNLKNTQSN
NGDVGKMPVL IPRHMASAFA DYTLQSGPLA GWGFGAGFRY IGESYMDILN TFTNDAYTVF
DAGLHYRQPK GINLALNVKN IADKDNAMCT ATGGCQYIAP RVITATASYR W