Gene RPB_4403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4403 
Symbol 
ID3912218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4989077 
End bp4991398 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content65% 
IMG OID637886308 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_488000 
Protein GI86751504 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.109366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT CAGTTGGTCC GGCTGCTCTG TTTGTCTCGA GCCTCAGCGC GCTGTCGCTT 
TCCGAAAGCG CACTGGCACA GGCGCCGTCT TCGCCGGCCG CCACGACGCT CGCACCGGTC
GAAATCATCG CCCCGCAGGT GCGGCCGCGG CCCCCGGGCC GCGTTCGCGC TTCGCAGAAT
CCCCGTCGCG GAGCGGCGAC CCGGCCACGC CCGCGTGAGG TTGCAACCCC GACGTCCGCT
CCTCCCGTGC CCGCCGTCCC CCCGCAGACC GCAACCGTCG GGCAGCCCCC GGTGCCCTAT
GTGGGCGGAC AGGTCGGAAC CGGCGCGCGG CTTGGTTTCC TGGGAAACAC CTCCGTCTTC
ACGGCGCCGT TCAGCGTCAC GGGGTACACA TCGAAGCTCA TGGAGGATCA GCAGGCGCGC
AGTGTCGCGG ACGTCGTCCT CAACGATCCC TCGGTGCGTA ACGACGCGCC GCCGTTCAGC
GAACGCGACT CGTTCTTTAT CCGCGGTTTT TCGGTGACCA ATCTCGACAC CGCCTATGAC
GGGCTGTTCT ACCTCGCGAA TCCGCGCCGC GCCTTCCTCG AAGGAATCGA GCGTGTCGAA
ATCCTCAAGG GCCCGAGCGC ATTGCTCAGC GGCGGCACCG GGCGCGTCGG CGGAACCATC
AATCTGATTC CGAAGCGCGC CACCGACGAA CCGCTGACGC GGCTGACGAC CAGCTACACC
AGCAACTCGC AAATCTGGAA CCACCTCGAT CTCGGTCGTC GCTTCGGAGA CAACAAGGAG
TGGGGCGTCC GCTTCAACGG CTCCTACCGC AACGGCGACA CGCCGCTGGA TCTCAATTCG
GCCGAGGTCG GTGTCGCCGC CCTCGGTCTC GACTATCGCA GCGAACGCTT CCGGGCGTCG
CTGGACCTCA ACGGCTCGAT TCAGAACATC ACGGCACCGA CGTCGCTGTT CAATTCCGCG
GCCGCGAACA TCGTCGTCCC ACCCGCGCCG AACGGCCGCA TCAATACGTC GAGCCGCGAC
GAATTCATCG ACAGCCGCTA CAAGATGATC GCCGGACGCG CCGAATACGA TCTCTTGCCG
GACACCACCA TGTACCTGGC CGGCGGTGGC AGCCAATACA ACGAGGACTT CCTCACGTCG
TCCTACCGAA TCACCAATTC GAACGGCACG GCCACCAACA CGCTCGCGGT TCAGCCCCAG
AAGCTCGAAG GATACACCGG CGAGATCGGC GTGCGTTCGA AATTCCGGAC CGGCGTCGTC
GGTCATCAGT TGAACGTCTC GGCGGTCGAA GCGAACAACG AGCTCTACCG CGGCGGCACT
CTGGGCTTCA CCTCCTTCAG CTACGTGACC AACATCTACG ATCCGGTCCG CCTGCCGCAG
GGCAGGTTCC AGACCAGCGG TTTCGCCACA TCCGACGACA GGCCCTTGCT GTCGCGGCTC
ACCGCTCGCA GCGCCGCGAT ATCCGACACG CTGTCGCTGC TCGACGACCG GCTGCTTGTG
ACGCTCGGCG GTCGCTGGCA GGACATCCTG CTGCGGGGAT TTGTGACGGC GTCGGGCCCC
ACCCTCGGCA CGGAATCGTC GCGCTATCAG GAGGCTCGTT TCAGCCCGGC CGTGGGCGCG
GTGATCCGCG CGACCGATCA GCTGTCGTTC TACGGAAACT ACATCGAGTC GCTCGAATCG
GGACCGACGG CGCCGGCCCT CGCGAACAAT CGCAATACGG TGTTTCCGCC GGTGGTCAGC
AAGCAGCAGG AGGTCGGCGC CAAATACGAT CTCGGAATCG TCGGGCTGAC GGCGTCGCTG
TTCCAGATCG AACAGCCGAA CGCCTTCACC GATCCGACCA CCAACATCTT CTCCGTCAGC
GGTCTGCAGC GCAACCGCGG CATCGAGCTG AGCGTTTTCG GCGAACCGGT CAAGGGCGTC
CGTCTGCTCG GCGGCGTCAC CCTCATGGAC GCCAAGCTCG TTTCCACGAT CGGCGGCCGC
TACGACGGCA ACGACGCGCC CGGCGTTCCG GTCACCGCGC TGAACCTCTA TGGCGAATAC
GATCTGCCCC ATTGGCTGGC GCAGGGCGTG ACGGTGACCG GCCGGGCGAT CTACACCGGC
GACGTGTTCT ACGATCAGGC GAACACGCAG ACCGTCTCCG ACTGGACACG GTTCGACATC
GGCGCGCGGT ATGCGTTCAC GGGGCCTTCG GGCAAGCCGG CCGTGCTGCG GGCCACCATC
GAGAACGTGG CCGATACGGC CTACTATCTC TCCGCCGCGC GTGGCTATCT CGCAGTCGGT
GCACCGAGGA CCTACATGGT GTCGGCGACG TTCAATTTCT GA
 
Protein sequence
MKKSVGPAAL FVSSLSALSL SESALAQAPS SPAATTLAPV EIIAPQVRPR PPGRVRASQN 
PRRGAATRPR PREVATPTSA PPVPAVPPQT ATVGQPPVPY VGGQVGTGAR LGFLGNTSVF
TAPFSVTGYT SKLMEDQQAR SVADVVLNDP SVRNDAPPFS ERDSFFIRGF SVTNLDTAYD
GLFYLANPRR AFLEGIERVE ILKGPSALLS GGTGRVGGTI NLIPKRATDE PLTRLTTSYT
SNSQIWNHLD LGRRFGDNKE WGVRFNGSYR NGDTPLDLNS AEVGVAALGL DYRSERFRAS
LDLNGSIQNI TAPTSLFNSA AANIVVPPAP NGRINTSSRD EFIDSRYKMI AGRAEYDLLP
DTTMYLAGGG SQYNEDFLTS SYRITNSNGT ATNTLAVQPQ KLEGYTGEIG VRSKFRTGVV
GHQLNVSAVE ANNELYRGGT LGFTSFSYVT NIYDPVRLPQ GRFQTSGFAT SDDRPLLSRL
TARSAAISDT LSLLDDRLLV TLGGRWQDIL LRGFVTASGP TLGTESSRYQ EARFSPAVGA
VIRATDQLSF YGNYIESLES GPTAPALANN RNTVFPPVVS KQQEVGAKYD LGIVGLTASL
FQIEQPNAFT DPTTNIFSVS GLQRNRGIEL SVFGEPVKGV RLLGGVTLMD AKLVSTIGGR
YDGNDAPGVP VTALNLYGEY DLPHWLAQGV TVTGRAIYTG DVFYDQANTQ TVSDWTRFDI
GARYAFTGPS GKPAVLRATI ENVADTAYYL SAARGYLAVG APRTYMVSAT FNF