Gene Rsph17029_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4121 
Symbol 
ID4894950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp63491 
End bp65866 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content68% 
IMG OID640110519 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001041831 
Protein GI126464855 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones115 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGGC AGTCTCTTCT CCCCTGGCGG ACCGGCCGTA TGGCCCGGCT GCTGACGACC 
GCGCTTCTCT GCGGGGCGGC GCTCCCCGTG CTGGCGCAAA GCCTGCCGCA CCGGTTCGAC
ATTCCGGCGA AACCCGTGAC GCGGGCGGTC AATGACATCG GCCGCGTGGC CGGGCTCTCG
ATCGTGATGC CCGACGAGGG CGCTGTGGCG GTGCGGGGCA ATCCGGTGCA GGGCGCGATG
AGCGTCGAGG CGGCGATGGA GACGCTGCTT GCGGGCACCG GCCTCTCGTG GCGCTTCGCC
AATGCGGGCA CGATCCACGT CTTCCAGCGC ATCCCCGCGG GCGCGGCGGA CGGCTCGGCC
GGCGTGATGC TCGAGCGGAT CCGGGTGGCG GGCGACAGCA ACGGGGCCAC GAGCTTCGTC
GCGGGCGCCA GCGGCACGGC CAGCAAGAGC GGCACGCCCG TTCTGGAAGT GCCGCAGTCG
GTCAGCGTGA TCGGCCCGCG CCAGATGGAG GCGCAGGGCG CCCGCTCGGT CACCGAGGCG
CTGCGCTATG TGCCGGGCGT CAACATCGAA ACCTACGGCC CGGACCCCAA GGGCTTCGAA
TGGATCATGC TGCGCGGCTT CAACGGCCAG TCCTCCAGCG CCTATCTCGA CGGGCTGCGC
CAGATCGCCT CGAACTACAG CCATTTCCGC ACCGATCCGC ACCAGCTCGA GACGATCGAA
GTGCTGCGCG GGCCGTCCTC GGCGCTCTAC GGGCAGTCCG ATGCGGGCGG CGTGGTCGGC
AAGACCTCGA AGCGGCCGGT CACCGAACCG CTGCGCGAGG CCGAGCTCTC CTACGGCAGC
TTCGCCACGA CGCAGGCGGC GGTCGATGTG GGCGGCGCGC TCACGGAGGA CAAGACGCTC
TCCTACCGGC TGGTCGGCGT GGCGCGCGAC GGGAACACGC AATTTTCCTA CGGCGACGGC
ACGCGGATGA AGGACGACCG GCTGATGCTG GCGCCCTCGA TCACCTGGGC ACCCACCGAC
GCCACCAGCC TGACCGTCAC GGCGCAGGCG CTGCGCGACC GCTCGGGCGG CACGGCGATC
TTCTTCACGC CGACCGACAT CCTCGTGGGC GACCCGAACT TCTCGCGGAG CGAGCAGGAC
CAGAAGACGC TGGGCTACGA GTTCAGCCAC CGGATGGACA ATGGCTGGAC GGTGCGCCAG
AATCTGCGTT ACGGGCGCGT GGACTTCGAT CTCGACATGA TCGCCATGAT CGGGGCGGAT
GCCACCGGCC TCACGCGGCA GGCGCGGCGC TTCTCGGAAA GCCTCGACAG CCTTGCGGTC
GACACGAACC TCCTGGGCGA GTTCCAGACC GGGCGGCTCT CGCACAAGCT GCTGATCGGG
CTCGACCATT CGCGCAGCGA CACGGACGCG CGTCGCTGGA ACGGCACCGC GCCCTCGCTC
GATCCCTATG CGCCCGTCTA TGGCGTGGCC GTTCCCACCC CCACGACCGT GGCCTATGAC
TACACCGAAG AGTATCGCCA GACGGGCCTC TACGTTCAGG ACGAAATCCG GCTGGATCGC
TTCCTCTTCA CGCTGGGCGG GCGTCAGGAC TGGCTCAGCA CCAGAACCGA CGACCACCTG
ACCGGAACCG CGCGCGATGT CGATCTCGAC AATTTCTCGG GCCGGGTGGG GGTGAGCTAC
CTGACGGATT CGGGCCTCGT CCCCTACCTC AGCTATTCCG AGAGCTTCCT GCCGAACGCG
GGCCTCGGCT CGGACGGCCG TACCTTCGAT CCCACCCGCG GCCAGCAATG GGAGCTGGGA
GCGAAGTATC AGCCCTCGGG CACGGACGCG CTGCTGACGG TGGCGCTGTT CGACATCACC
AAGTCGAACG TGCTCACGCC CGAACTGGGG CCGGGCGGCA TCTCGACCGG CTACAACATC
GCCACAGGCG AGATCCGCTC GCGCGGGATC GAGGTCGAGG GCAAGGCCTC GCTCGGCTCG
GGCTGGGACA TCGCGGCGAA CTACAGCTAC AGCGATGTCG AGATCACCGA GGACAATGCC
GGCAACGAGG GCAACCGCCC CGCGCTGGTG CCGAAGTCGC AGGCGTCGCT CTGGTTGAAC
TACAGCTTCG GCGCGGGCGC GCTGGACGGC CTCTCGCTCG GCGGCGGGCT GCGCCATGTG
GGCACGAGCT TCGGCGACAA CGGCAATACG ATCAAGGTGG ATGGCCGGAC GGTGATCGAC
CTCGGCGCGA GCTATGCGCT GAGCGAGACC GCGCGGCTGG CGCTCAATGT GACGAACCTG
ACGGACAAGG AGTATTTCAC CACCTGCGCC AGCGCCTATT CCTGCTACGA GGGCGACCGG
CGGATGGTCA CGGCCAAGAT CGCCATCGGC TTCTGA
 
Protein sequence
MRGQSLLPWR TGRMARLLTT ALLCGAALPV LAQSLPHRFD IPAKPVTRAV NDIGRVAGLS 
IVMPDEGAVA VRGNPVQGAM SVEAAMETLL AGTGLSWRFA NAGTIHVFQR IPAGAADGSA
GVMLERIRVA GDSNGATSFV AGASGTASKS GTPVLEVPQS VSVIGPRQME AQGARSVTEA
LRYVPGVNIE TYGPDPKGFE WIMLRGFNGQ SSSAYLDGLR QIASNYSHFR TDPHQLETIE
VLRGPSSALY GQSDAGGVVG KTSKRPVTEP LREAELSYGS FATTQAAVDV GGALTEDKTL
SYRLVGVARD GNTQFSYGDG TRMKDDRLML APSITWAPTD ATSLTVTAQA LRDRSGGTAI
FFTPTDILVG DPNFSRSEQD QKTLGYEFSH RMDNGWTVRQ NLRYGRVDFD LDMIAMIGAD
ATGLTRQARR FSESLDSLAV DTNLLGEFQT GRLSHKLLIG LDHSRSDTDA RRWNGTAPSL
DPYAPVYGVA VPTPTTVAYD YTEEYRQTGL YVQDEIRLDR FLFTLGGRQD WLSTRTDDHL
TGTARDVDLD NFSGRVGVSY LTDSGLVPYL SYSESFLPNA GLGSDGRTFD PTRGQQWELG
AKYQPSGTDA LLTVALFDIT KSNVLTPELG PGGISTGYNI ATGEIRSRGI EVEGKASLGS
GWDIAANYSY SDVEITEDNA GNEGNRPALV PKSQASLWLN YSFGAGALDG LSLGGGLRHV
GTSFGDNGNT IKVDGRTVID LGASYALSET ARLALNVTNL TDKEYFTTCA SAYSCYEGDR
RMVTAKIAIG F