Gene RPB_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3149 
Symbol 
ID3910950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3601499 
End bp3603856 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content66% 
IMG OID637885051 
ProductTonB-dependent receptor 
Protein accessionYP_486756 
Protein GI86750260 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTGG TTGCGCAGAC GAAGACCGAA ACAGCGATGG ACGCCAATCG GCGCGCGAGC 
CGCCGGGGAT CGTTGCGGAC GATGACCGCC GCCGTGCTGT TGTCGTCGAC CAGCCAGGCC
TGCCTCGCGC AGTCGGCGAC GCCCGGGTCG ACGATCACGC TGGAGCCGGT CGTCGTCGAA
GCGCCGCGGC AGCCGGTTCG ATCGGTCGTC CGTCAGCGCT CGCCGGCTCC GCGTTCGGCG
GCGACCGTGA TGCCCGCGGC GACTCCCTCG TTGGCGCTGC TTGCGGATGT CGTCCGTCAG
CGCTTCGAGA TCCTGCCGGG CGGCGTCGCG CTGGTGTCGC GGCAGGACAT GGCCAATCGC
GGCAATCCGA CGCTCGCCAA CAGCCTGAGC GGTGTCCCCG GACTGATCGT GCAGAACTTC
CTCGGCTCCA ACGATCAGCC GCGGATTCAG ATGCGCGGCT CCGCCCAGCA AAATCCGGCC
GAACGCGGCG TTCTCGTGCT CAGCAACGGG CTGCCGATCA ATCGCGCCGA CGGCTCCTAC
ATCATCGGCT TCGCCAATGC GCAACAGGCC GAGTCGATCG AGGTGTATCG CGGCTATATG
GCCAACCGTC TCGGCGCCAC GGTGCTGAGC GGCGCGATCA ACTTCGTGTC GCCGACCGGA
TCGAGCCAGC CGGGCACCCA GATCGGCGTC AGCGGCGGCA GCTTCGGCCA GATCAATAGC
AGTGGGCAGG TCGGCGGCAA GAAGGACAAT GTCGACGCGT TCATCCAGTT CGACACCAGC
CGCCGCGACG GCTATCGCGG CTACAATTCG TCGGAGCGCG TCAGCGTCAA CGGCAATGTC
GGCGTCGCGC TGTCGGAGAA CGTCAAGACC CGCTTCTTCA TGGGCTATAC CGATCTCGGC
TTCGACGTCC CGGGGCCGGT GAACAAGACC ACGCTGTACG CCAATCCGAA GCAGGTCAAC
CCGGGCCCGA CGGTGGTCGG AGGCGTTGCT GTCAATCCCG GTCCCAACGC CGTGCGCGAC
AAGCCGCGGC GCGAGGCCAG CCAGTTCATG GTCGGCAACC GCACCACGGC TGTGTTCGAC
GCGCATCTGT TCGACGTCGC GATGGGCTAC ATCTACACCG ACGACACGTT CCGCTTCCCG
ATCTCCTCGG GTGTGCGCAC CACGCAGGGC GGCGACTTCA CCGGCGTGGC CCGCTACGCC
TACAATCCGG CGGCGGCACT GCTGCCGCTG TTCGAGACCA CCGCGCAATA CACCGTTGGT
TCGGCCGATC GCGGCTACTA CCTCAACCAG AGCGGCCAGA CCGGGGCGCA ATTCGGCGCC
AACCGGCTGA ACGCCCAGAC GTTGTCGCTG TACACCGGCG CCAACATTCC GGTCTGGCAC
CAACTCGTGG TGTCGCCGTC GATCTCCTAC GCCTATGCGA CGCGGGACAA CGACGATGTG
TACGGTTCGG CGCGACGGCC GACGATCGCC TACAACCCGG CGAATCCCAC GGTGCTGCTG
CCGAACGGAT CTGTGGCGAC GCAAAGCACC AGCTATTCGC GCAACTATTC AGGCTGGAGT
CCGAGCCTGG CCTTGAGCTA TCGGCCGGAC GCGGTGCAGA CCTTCTTCAT CGCCGGCAGC
CACAGCTTCG AGCCGCCGAC TCACGACGAT CTGATCGCGA CGATCAACGG AACGCCGAAT
TCGAGCCCCG GCCGGCCGAC GCCGGGCAAC CCGTCGCTAG CGGCGGCGGC GTTCGCGACG
CCGAATCTTA GCGCGCAGAC GGCGAACACG GTGGAGGGCG GCTGGCGCGG TCGCGCCGAT
CGCTTCTCCT GGGACGTCGT GACTTACTAT TCGTGGGTCG ACAACGAGTT GCTCACGCTG
CGGGACGTCA CCGGCGCGCT GCTGGGGGCG GTCAACGCCG ACCGCACGAC GCATTTCGGT
GTCGAACTCG GGGCCGGAAT GAAGTTCACC GATCGGCTGT CCGGTCGCAT CGCCTACACC
TATCAGGATT TCCGCTTCGT CGACGATCCG AGCCGCGGCA ACAACCGGCT GGGCGGTGTG
GTGCCGCATC TGATCTATGC GCAATTGCAG TTGCAGGCGA CCGACGCCTG GATGGTGCAG
GGCGCGGTCC GCTGGAGTCC GGCCGAGGTG GCGGTCGACA ACATGAACAC GCTGTTCGCC
GACCCCTATG CGGTGGTCGA CCTGCGCAGC GAGTACCAGA TTGACAAGAC CTTCCGGGTG
TTCGGCGAGA TCACCAATCT GTTCGACAAG ACCTATGCGG CGACCACCCT GGTGGTCGAT
CAGGCGACGG CCAGCCAGGC GGCATTCCTG CCGGCCGACG GACGGGGCTT CTACGCGGGT
ATCAAGGCAA AGTTCTGA
 
Protein sequence
MNLVAQTKTE TAMDANRRAS RRGSLRTMTA AVLLSSTSQA CLAQSATPGS TITLEPVVVE 
APRQPVRSVV RQRSPAPRSA ATVMPAATPS LALLADVVRQ RFEILPGGVA LVSRQDMANR
GNPTLANSLS GVPGLIVQNF LGSNDQPRIQ MRGSAQQNPA ERGVLVLSNG LPINRADGSY
IIGFANAQQA ESIEVYRGYM ANRLGATVLS GAINFVSPTG SSQPGTQIGV SGGSFGQINS
SGQVGGKKDN VDAFIQFDTS RRDGYRGYNS SERVSVNGNV GVALSENVKT RFFMGYTDLG
FDVPGPVNKT TLYANPKQVN PGPTVVGGVA VNPGPNAVRD KPRREASQFM VGNRTTAVFD
AHLFDVAMGY IYTDDTFRFP ISSGVRTTQG GDFTGVARYA YNPAAALLPL FETTAQYTVG
SADRGYYLNQ SGQTGAQFGA NRLNAQTLSL YTGANIPVWH QLVVSPSISY AYATRDNDDV
YGSARRPTIA YNPANPTVLL PNGSVATQST SYSRNYSGWS PSLALSYRPD AVQTFFIAGS
HSFEPPTHDD LIATINGTPN SSPGRPTPGN PSLAAAAFAT PNLSAQTANT VEGGWRGRAD
RFSWDVVTYY SWVDNELLTL RDVTGALLGA VNADRTTHFG VELGAGMKFT DRLSGRIAYT
YQDFRFVDDP SRGNNRLGGV VPHLIYAQLQ LQATDAWMVQ GAVRWSPAEV AVDNMNTLFA
DPYAVVDLRS EYQIDKTFRV FGEITNLFDK TYAATTLVVD QATASQAAFL PADGRGFYAG
IKAKF