Gene Rsph17029_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0089 
Symbol 
ID4895258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp100998 
End bp103097 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content67% 
IMG OID640110672 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001041981 
Protein GI126460867 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.26085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGA AACACCGGTC GATGCTGCTC GCAGGCGCGA GCACCATTGC TCTGGCCCCG 
GCCGCCCTCT TCGCCCAGGA GGCGATCGAG CTCGACGCGA TCGTGGTCAC GGCCACCACC
GACGTCACAA CCCAGGCCGA TGGCTACAAG GCCGACTACA ACCAGTCCGC CACCAAATCC
GACACCCCCG TGGCCGAGAC GACCCAATCG GTCTCGGTCG TGACGGCAGA GCAGATCAAG
GATCAGGGCG CGGAAACCCT CGGGCAGGCG CTCCGCTACA GCCCCGGCGT GCTCGGCGAC
CCCTACGGCG TCGATCCGCG CTTCGACAGC CCCACGATCC GCGGCTTCGA GGCGCGCGGC
TCGCAATATG TCAACGGCCT GCGGCAGCTG CGCTACATGG GTGCCCCGGC CTACGAGACC
TTCGCGCTCC AGCAGATCGA GGTGCTGAAC GGTCCCAACT CCTCGCTCTA TGGCGCGGGC
TCGCCGGCCG GGATCATCAA CCAGGTCCAG AAACGCGCGC AGGGCTTCGA CTTCGGCGAA
CTGGGCGTGG GCCTCGACGA CAACGGCTCG CGCCAGAGCT TCTTCGACTG GAACCGCACG
GTGAGCGACA CGCTCTCGTT CCGCGCGACC GGGATCGCCA AGGACTACGA GAGCCAGGTG
GAGGAGATCG GCCTCGAACG CGGGTATCTG GGCCTCGCCG CCCGCTGGAA GCCCACCGAC
CGCACCACGC TCGACATCAT CTCCTCCTAT ACCGACGACG CGCCGACTTC GCCGCCGGGC
ATCCCCTTCG CGCTCACCGG GCAGGGGAAC GACAAGTATC TGCGCGAGCT CTACACCGGC
GAGCCGGGCT GGGACGACCA TGACCGCCAG ATCTTCAACA TCGGCTACGA GCTGAGCCAC
GAGTTCGACA GCGGCTGGAC CTTCAGCCAG GGCTTCCGCT ACGAGAAGTT CGACTGGGAA
TATACCGGCC ACTATGTGAC CGGCATCGAC GCCAGCGGCA CCGGGATCAC CCGGGGCGCC
AACTATCAGC GCGAGAACAC CACCGGCCTG AGCCTCGATT CACGGCTTGC CGGAGAGGTG
CTGACCGGCG GCATGGAGCA CAAGCTGCTC TTCGGGCTCG ACCTGCGCAA ATACGACGCC
GACACCGTCA CCGAGTTCTA CAACGCCACC GGCGGCGTCA CGAACCTCGA CTGGCGTAAC
CCGATCTATG GCGGCGTTCC CACCGGCGCG CCCTGGTATG TCAGCACGCC CGACGTGACG
CAGACCCAGA TCGGCCTCTA CGCGCAGGAC GAGATCACCG CCGGGCGCTG GCGCGGGTCG
ATCGCGCTGC GCCACGACTG GTCGAAGCAG GAGGGCACGA CCTACACGAA CTTCGCGGGC
GAAGGCGAGA TCGACCAGTC GGACAAGGCG CTGTCGGGTC GCGCGGGCCT TGGCTACGAG
ATCGCGCCCG GCGCGCTCGT CTATGCCAAC TACTCCACCT CCTTCGACCC GGAGATCGGC
GTGGACGGCG CGGGCGAGCA GCTGGAGCCG ACGACCGGCA AGCAATGGGA GCTGGGCGTG
AAATATCAGC CCGACAGCTT CAACGCGCTC TTCACTGCGG CGATCTACGA TCTGCGGCAG
GAGAACCTGA CGGTGAATCT CGGCGGGGCC GAGGGCCGTC GTCAGGTCGG CGAGGTCAAG
TCCTCCGGCC TTGAACTCGG CGCGGTGGGC GAGCTGGCAC CGGGGCTGAA CCTGCGCGCA
AGCTACGCCT ACAACGACAC CGAGCAGGTC GATCCGAGCG GCGCCAACGA CGGCAACGAG
ATGCCCAACG CGCCGCGCCA TCTGGCGAGC CTCTGGCTCG ACAAGGCCTT CGACAACGGC
GTGAGCCTCG GCGGCGGCCT GCGCTACATC GGCGAGCGCG AGGGCGATCT GGCCAACCTC
TATTCGCTCG ACTCCGTGAC GCTGCTCGAC CTCGCCGTGG GCTACAGCCG CGAGAACATG
GAGGCCTCGA TCAACCTCAA CAACCTGTCC GACGAGGTCT ACCTCGCCAA CTGCGGCTCC
TTCGGCTGCT ACTACGGCGA GGGCCGGACG ATCTCGGCCA AGATCAGCTA CAAGTGGTAG
 
Protein sequence
MKTKHRSMLL AGASTIALAP AALFAQEAIE LDAIVVTATT DVTTQADGYK ADYNQSATKS 
DTPVAETTQS VSVVTAEQIK DQGAETLGQA LRYSPGVLGD PYGVDPRFDS PTIRGFEARG
SQYVNGLRQL RYMGAPAYET FALQQIEVLN GPNSSLYGAG SPAGIINQVQ KRAQGFDFGE
LGVGLDDNGS RQSFFDWNRT VSDTLSFRAT GIAKDYESQV EEIGLERGYL GLAARWKPTD
RTTLDIISSY TDDAPTSPPG IPFALTGQGN DKYLRELYTG EPGWDDHDRQ IFNIGYELSH
EFDSGWTFSQ GFRYEKFDWE YTGHYVTGID ASGTGITRGA NYQRENTTGL SLDSRLAGEV
LTGGMEHKLL FGLDLRKYDA DTVTEFYNAT GGVTNLDWRN PIYGGVPTGA PWYVSTPDVT
QTQIGLYAQD EITAGRWRGS IALRHDWSKQ EGTTYTNFAG EGEIDQSDKA LSGRAGLGYE
IAPGALVYAN YSTSFDPEIG VDGAGEQLEP TTGKQWELGV KYQPDSFNAL FTAAIYDLRQ
ENLTVNLGGA EGRRQVGEVK SSGLELGAVG ELAPGLNLRA SYAYNDTEQV DPSGANDGNE
MPNAPRHLAS LWLDKAFDNG VSLGGGLRYI GEREGDLANL YSLDSVTLLD LAVGYSRENM
EASINLNNLS DEVYLANCGS FGCYYGEGRT ISAKISYKW