Gene Rsph17029_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3784 
Symbol 
ID4899186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp909513 
End bp911447 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content68% 
IMG OID640114388 
ProductTonB-dependent receptor 
Protein accessionYP_001045636 
Protein GI126464523 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.60396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCC TGTCGCCCCG AACGATCCGG CTTCTGGCCG GGAGCGCCGC CCTGAGCGGG 
TCCTGCGTCA CCGCCCATGC GCAGGATCTG GCCTTCTCGC TCGATCCCAT CGTGGTGCAG
GCGCGCGACG ATTTCGCCGA AGCCGCCGAC CGCGCCACCT CGATGTATGT CGCCGATGCC
GAGCTGGAGC GCGCGCGCAC CGGAGACCTC AAGGACGTCT TCGCGGGCAT CGCCTCTGTC
TCGGTCGGAG GGGCGCTGCC CCTCACGCAG AAGATCTTCG TGAACGGCGT CGACATGCTG
AATCTCGGCG TCAGCATCGA CGGGGCCGCG CAGAACAACC GCGCCTTCCA CCATGTCTCG
GCCAATGCGA TCGATCCGGG GCTGCTCAAG CAGGTGCGCG TGGATGCCAC GATCTCGCCG
GCCGATGCCG GGCCCCATGC GCTCGCGGGC TCGGTCGTGT TCGAGACGGT CGATGCCGCC
GACGTCCTCA GCGAAGGGCA GCGCTTCGGC GGCACCCTGC GCCTGAGCTA CGGCGACAAC
GGCAAGACGG CGCAGGGCGC CCTGACCCTC GCCGGGCGGG AGGGCGGCTT CGAGTGGCTG
GCCTATGCCA AGCGCGCCAC GGGCGACGAT TACGAGGATG GAGACGGGGC CACACGGCTC
GGGACGGCGG CCAATCTCAA GAGCGCGGCG GGAAAGCTCG CCTATGAGAG CGAGGGCGGC
CACCGGTTCG AGCTGTCGGC GCTGCGCCTG AGGGACGACG AGCTGCGCCA GTTCCGCGCG
AATTTCGGCG GTCTGGGCGG GGTGGTCGAT CAGCTGCGCC TCTACGACAC GACGCGGGAA
AGCTGGTCAT TCTCCTACGA GAACACGCAA GGTGAGGGGA TGTGGGATCC GAGCCTCCGG
CTCGGCTATT CGGAAAGCGA CGTCGTCATC CCCCTGCCCT ACGACAGCAA CGGCCTGTCG
GGCACATGGT CGGCCACGCT GCAGAACGAT TTCCACCTGA ACGCCACCGA CCGGATCTCG
GCCGGCCTCG ACTGGCAGCG CCGTTTCGGG GAATATTCGA GCCCGACCTT CGGCGAGTTC
CTCGAGGAGA CATCGCACAA CCTCGGCCTT TTCGTGCAGG CGCGGCTCGA GCCCACCGAT
CGCTGGCGGC TCTCCTTCGG CGGCCGCCTC GACCGGCAGA AGTTCGAAGG CGTCGACGGC
AGCGACCTCG GCCACTCGGG GCCCTCGGGC AACCTCTCCG CAAGCTACGC CCTGACCGAG
AGCCTCACGC TGCGCGGCGG CCTGTCCTCG ATCTTCGGCG GCATCGACAT CGAGGACAAT
TACATCTTCC GGCCGAGCTG GAGCTACGAC AGCCTCCGGC CGTCGCGGGC GCGGAACGCG
AGCCTCGGCT TCGACTGGGA CGCGGGCGCC CTCAAGGTCG GAGGCGAGCT CTTCCTGACC
GAGATCAACC GCGCGCGCAG TGCCACCGAA ATGTTCGACT TCGAGAGCCG CGGCTTCAAC
CTCGGCGCGA CCTACGGCTG GGACGGTGGC TTTGCGCGCG TCACCCTCTC CCACAGCGAG
GTGAAGGTGG ACGGCGCGAA GGCCGCGGGC TACGAGGCGC TCGACTTCGG CGCCCCTCTG
GGCACGGTGG CCGCGGTCGA GCTTCAGCAG GACACGTCAG TTGCGGGGCT GCGGCTCGGC
GGCGGGCTCG ATCTCGCGCT CGACCACGAC ATGCCCGCTA CGGCCGAACG GGATCTCGCG
GGCTATTCGG TGGTGAACCT CTTTGCCGAA TATGTGCCGC CTGAGGCGCA GAACCTCACG
CTCCGGCTGC AGGTCGACAA CCTATTCGAC AAGACCTATG CGGACCGCGC GACCTACGGG
GCGGAATACA GCGACGTCAC CTCGCTGAAG GAGCCGGGCC GGACGATCAC GCTCAGCGCG
GTCACGCGCT TCTGA
 
Protein sequence
MSILSPRTIR LLAGSAALSG SCVTAHAQDL AFSLDPIVVQ ARDDFAEAAD RATSMYVADA 
ELERARTGDL KDVFAGIASV SVGGALPLTQ KIFVNGVDML NLGVSIDGAA QNNRAFHHVS
ANAIDPGLLK QVRVDATISP ADAGPHALAG SVVFETVDAA DVLSEGQRFG GTLRLSYGDN
GKTAQGALTL AGREGGFEWL AYAKRATGDD YEDGDGATRL GTAANLKSAA GKLAYESEGG
HRFELSALRL RDDELRQFRA NFGGLGGVVD QLRLYDTTRE SWSFSYENTQ GEGMWDPSLR
LGYSESDVVI PLPYDSNGLS GTWSATLQND FHLNATDRIS AGLDWQRRFG EYSSPTFGEF
LEETSHNLGL FVQARLEPTD RWRLSFGGRL DRQKFEGVDG SDLGHSGPSG NLSASYALTE
SLTLRGGLSS IFGGIDIEDN YIFRPSWSYD SLRPSRARNA SLGFDWDAGA LKVGGELFLT
EINRARSATE MFDFESRGFN LGATYGWDGG FARVTLSHSE VKVDGAKAAG YEALDFGAPL
GTVAAVELQQ DTSVAGLRLG GGLDLALDHD MPATAERDLA GYSVVNLFAE YVPPEAQNLT
LRLQVDNLFD KTYADRATYG AEYSDVTSLK EPGRTITLSA VTRF