Gene RPB_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4401 
Symbol 
ID3912216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4986141 
End bp4988411 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content66% 
IMG OID637886306 
ProductTonB-dependent receptor 
Protein accessionYP_487998 
Protein GI86751502 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.78723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0995472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAT TCGCAAGTGC TGCCGTGCTG AACTGCGGCG TATCGATGTT TGCAGTGGCG 
CTTGCCGTTG CCGATGTGCC CGAAGCGCGG GCGCAGGCAA ACAACGCTTA CAGTTTCAAT
ATCCCGGCCA AATCGCGCCT CGCCGCGCTC GCGGACTTCA CCGCGGCGAC CGGGATCCAG
GTGGTGCATC AGGGGGCGGG GGCGATCGGC GGCACCTCCC CGGTGGTGAG CGGTCGCTAC
CCTGCCGATA CGGCGCTCCG GACGATTCTG GCCGGCTCCG GCCTCAACTA TCGCTTCACG
GGGCCGCGCA CCGTCGCGAT CGAGGCGCCG GGCGGCGCAG CGGGCGCGGG CGGCGCCGTG
GCCGGCAGCG GCATCGCGCT GGATACGATC AGCGTGACCG GCGCCGGACA AGGGATCGGC
CGCGACGGCG TCAGCGAGAT CAACATCACG AGCGTGGACA TCGAGCGTCG CAATGCGACC
GACGTCAAGG GCTTGTTCCG CGGCGAGCCG AGCATCCTGG TCGGCTCCTC GCTGCCGATG
TCGCAGAAGC TCTATGTGCA GGGCATCGAG GAAACCAATC TCGCGGTTTC GATCGACGGC
TCGCGGCAGA ACAACAAGGT GTTCCATCAC AACGCGACCA CGATGATCGA TCCGAGCCTG
CTCAAGGCGG TGCGGGTCGA TGCCGGCGTG GCGCCCGCCG ACGCCGGCCC CGGCGCACTC
GGCGGCGCGA TCGCCTATGT GACCAAGGAC GCGCGGGACT ACCTGCCCAA TGACGGCTTC
GGCGGCTCGA TCAAGTCGAC CTTCAACTTC AACGGCAACA CCTCGACCAA CAATCTCACC
AGCTACACGC GCCAGGGCGG CTTCGAAGCG CTCGGCTCCT TCACCTATGC CAAGGGGAGC
GAGTTCAAGG CCGGCAACGG CCAGGACGTG CTCGGCACCG CGACCAATTT CCTCAGCGGG
CTCGGCAAGA TCGCCTATCA GAGCCTCGAG GGCCACCGCT TCTCGCTGAG CCACGAGCAG
ATGCGCGACG ACGCCCTGCG GCCGTTCCGG GCCAATGCGG TGCAGATCAT CGGCGGCAAG
CCGACGCCGC TGGTGCGCCC CTACACGCTC GACCGGCAGA ACACCGTCTT CACCTACACC
AACGTCTCCC CCGACGGATG GTGGGACCCC AAATTCGTGC TGGGCTACAA TCGCTCGAAA
GCCGCGGTGG ACCAATACAC CGGCGCCACG CTCAGTACCT ACAGCTACAC CAGCCAGGGA
ATCAGCGACA GCCTCAACGG CAAGCTGGAA AACAAGTTCG CCTTCTCCAT CGGCGACGTC
GTCGCCGGCA TCGACTTCTA CCGGGACCGC GCCGAATACA TCGACGTGAG CTATCGGACG
ATGGAGAAGG CCGACAATAT CGGCGCCTAT GCCCAGGCGC GGCTGCGGCC GTGGGAACGC
ACCAAGCTCT CCTTCGGCAT GCGCGGAGAC CACCAGAATT TCAAAGGCGT CAACGGCTTC
AGCTCCAGCG ACCAGGGCTT AAGCGGGAAC GCTTCGGGCG AATACGAGCT GACCAGCTTC
CTCAGCGCCA AGGCCGGCTA TTCGCACGTC TGGGCCGGCG TGCCGCTCGC CGAGAATTAC
GTCCAGAACC CGGCGTGGAT CTACGGTGTC GGCCCGAAAT CGGTGACCTC CGACAACTAC
ACCGCCGGTC TCGTCGCTCA CTACGGGGAT TTCCGCTTCG AGGGCGGCGT GTTCCGCACC
CAGATCAACG ATGCCCGCGT GCCGCTGTGG GCCGCCAACC AGGCGCTGCG CGCCTTCGAC
GTACAGACGC AGGGCTTCCA CGTCGGCGGC ACCTACAACT GGGGCGACGG CTTCGCGCGG
GTGCGGTTCG CGCGCACCGA CGCCGAGATC GACGGCAAGC CGGCCGATAC TTATCTCGGC
CAGTACCTCA CGGCGCCGAT CGGCGACGTG CTGACCTTTC AGCTCGCGCA CACCGTCGTG
CCGTGGAATC TGACCTTCGG AGGCGACGTC GAGATCGTGT TCGACTACGA CAAGCTGCTG
AATCCCGTGA CCGGAATCGG CAAGCTCGAA GGCTACGAAG TCCTCAACGC CTTCGTCGAG
CATCGCCCGT TCGCGCTGCC GTCGCTGACG CTGCGCGGTG AAGTCAGAAA CCTGTTCAAC
AGGAACTACG CGGCCCGCGG CACCTACGGC CTCGAATACG GCACCGGCGT CGTGCGGCCT
CTGTACGAAC CCGGCCGCTC CGTGCTGGTC AGCGCCAAGC TCGACTTCTG A
 
Protein sequence
MRKFASAAVL NCGVSMFAVA LAVADVPEAR AQANNAYSFN IPAKSRLAAL ADFTAATGIQ 
VVHQGAGAIG GTSPVVSGRY PADTALRTIL AGSGLNYRFT GPRTVAIEAP GGAAGAGGAV
AGSGIALDTI SVTGAGQGIG RDGVSEINIT SVDIERRNAT DVKGLFRGEP SILVGSSLPM
SQKLYVQGIE ETNLAVSIDG SRQNNKVFHH NATTMIDPSL LKAVRVDAGV APADAGPGAL
GGAIAYVTKD ARDYLPNDGF GGSIKSTFNF NGNTSTNNLT SYTRQGGFEA LGSFTYAKGS
EFKAGNGQDV LGTATNFLSG LGKIAYQSLE GHRFSLSHEQ MRDDALRPFR ANAVQIIGGK
PTPLVRPYTL DRQNTVFTYT NVSPDGWWDP KFVLGYNRSK AAVDQYTGAT LSTYSYTSQG
ISDSLNGKLE NKFAFSIGDV VAGIDFYRDR AEYIDVSYRT MEKADNIGAY AQARLRPWER
TKLSFGMRGD HQNFKGVNGF SSSDQGLSGN ASGEYELTSF LSAKAGYSHV WAGVPLAENY
VQNPAWIYGV GPKSVTSDNY TAGLVAHYGD FRFEGGVFRT QINDARVPLW AANQALRAFD
VQTQGFHVGG TYNWGDGFAR VRFARTDAEI DGKPADTYLG QYLTAPIGDV LTFQLAHTVV
PWNLTFGGDV EIVFDYDKLL NPVTGIGKLE GYEVLNAFVE HRPFALPSLT LRGEVRNLFN
RNYAARGTYG LEYGTGVVRP LYEPGRSVLV SAKLDF