Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3784 |
Symbol | |
ID | 4899186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 909513 |
End bp | 911447 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640114388 |
Product | TonB-dependent receptor |
Protein accession | YP_001045636 |
Protein GI | 126464523 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.60396 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCC TGTCGCCCCG AACGATCCGG CTTCTGGCCG GGAGCGCCGC CCTGAGCGGG TCCTGCGTCA CCGCCCATGC GCAGGATCTG GCCTTCTCGC TCGATCCCAT CGTGGTGCAG GCGCGCGACG ATTTCGCCGA AGCCGCCGAC CGCGCCACCT CGATGTATGT CGCCGATGCC GAGCTGGAGC GCGCGCGCAC CGGAGACCTC AAGGACGTCT TCGCGGGCAT CGCCTCTGTC TCGGTCGGAG GGGCGCTGCC CCTCACGCAG AAGATCTTCG TGAACGGCGT CGACATGCTG AATCTCGGCG TCAGCATCGA CGGGGCCGCG CAGAACAACC GCGCCTTCCA CCATGTCTCG GCCAATGCGA TCGATCCGGG GCTGCTCAAG CAGGTGCGCG TGGATGCCAC GATCTCGCCG GCCGATGCCG GGCCCCATGC GCTCGCGGGC TCGGTCGTGT TCGAGACGGT CGATGCCGCC GACGTCCTCA GCGAAGGGCA GCGCTTCGGC GGCACCCTGC GCCTGAGCTA CGGCGACAAC GGCAAGACGG CGCAGGGCGC CCTGACCCTC GCCGGGCGGG AGGGCGGCTT CGAGTGGCTG GCCTATGCCA AGCGCGCCAC GGGCGACGAT TACGAGGATG GAGACGGGGC CACACGGCTC GGGACGGCGG CCAATCTCAA GAGCGCGGCG GGAAAGCTCG CCTATGAGAG CGAGGGCGGC CACCGGTTCG AGCTGTCGGC GCTGCGCCTG AGGGACGACG AGCTGCGCCA GTTCCGCGCG AATTTCGGCG GTCTGGGCGG GGTGGTCGAT CAGCTGCGCC TCTACGACAC GACGCGGGAA AGCTGGTCAT TCTCCTACGA GAACACGCAA GGTGAGGGGA TGTGGGATCC GAGCCTCCGG CTCGGCTATT CGGAAAGCGA CGTCGTCATC CCCCTGCCCT ACGACAGCAA CGGCCTGTCG GGCACATGGT CGGCCACGCT GCAGAACGAT TTCCACCTGA ACGCCACCGA CCGGATCTCG GCCGGCCTCG ACTGGCAGCG CCGTTTCGGG GAATATTCGA GCCCGACCTT CGGCGAGTTC CTCGAGGAGA CATCGCACAA CCTCGGCCTT TTCGTGCAGG CGCGGCTCGA GCCCACCGAT CGCTGGCGGC TCTCCTTCGG CGGCCGCCTC GACCGGCAGA AGTTCGAAGG CGTCGACGGC AGCGACCTCG GCCACTCGGG GCCCTCGGGC AACCTCTCCG CAAGCTACGC CCTGACCGAG AGCCTCACGC TGCGCGGCGG CCTGTCCTCG ATCTTCGGCG GCATCGACAT CGAGGACAAT TACATCTTCC GGCCGAGCTG GAGCTACGAC AGCCTCCGGC CGTCGCGGGC GCGGAACGCG AGCCTCGGCT TCGACTGGGA CGCGGGCGCC CTCAAGGTCG GAGGCGAGCT CTTCCTGACC GAGATCAACC GCGCGCGCAG TGCCACCGAA ATGTTCGACT TCGAGAGCCG CGGCTTCAAC CTCGGCGCGA CCTACGGCTG GGACGGTGGC TTTGCGCGCG TCACCCTCTC CCACAGCGAG GTGAAGGTGG ACGGCGCGAA GGCCGCGGGC TACGAGGCGC TCGACTTCGG CGCCCCTCTG GGCACGGTGG CCGCGGTCGA GCTTCAGCAG GACACGTCAG TTGCGGGGCT GCGGCTCGGC GGCGGGCTCG ATCTCGCGCT CGACCACGAC ATGCCCGCTA CGGCCGAACG GGATCTCGCG GGCTATTCGG TGGTGAACCT CTTTGCCGAA TATGTGCCGC CTGAGGCGCA GAACCTCACG CTCCGGCTGC AGGTCGACAA CCTATTCGAC AAGACCTATG CGGACCGCGC GACCTACGGG GCGGAATACA GCGACGTCAC CTCGCTGAAG GAGCCGGGCC GGACGATCAC GCTCAGCGCG GTCACGCGCT TCTGA
|
Protein sequence | MSILSPRTIR LLAGSAALSG SCVTAHAQDL AFSLDPIVVQ ARDDFAEAAD RATSMYVADA ELERARTGDL KDVFAGIASV SVGGALPLTQ KIFVNGVDML NLGVSIDGAA QNNRAFHHVS ANAIDPGLLK QVRVDATISP ADAGPHALAG SVVFETVDAA DVLSEGQRFG GTLRLSYGDN GKTAQGALTL AGREGGFEWL AYAKRATGDD YEDGDGATRL GTAANLKSAA GKLAYESEGG HRFELSALRL RDDELRQFRA NFGGLGGVVD QLRLYDTTRE SWSFSYENTQ GEGMWDPSLR LGYSESDVVI PLPYDSNGLS GTWSATLQND FHLNATDRIS AGLDWQRRFG EYSSPTFGEF LEETSHNLGL FVQARLEPTD RWRLSFGGRL DRQKFEGVDG SDLGHSGPSG NLSASYALTE SLTLRGGLSS IFGGIDIEDN YIFRPSWSYD SLRPSRARNA SLGFDWDAGA LKVGGELFLT EINRARSATE MFDFESRGFN LGATYGWDGG FARVTLSHSE VKVDGAKAAG YEALDFGAPL GTVAAVELQQ DTSVAGLRLG GGLDLALDHD MPATAERDLA GYSVVNLFAE YVPPEAQNLT LRLQVDNLFD KTYADRATYG AEYSDVTSLK EPGRTITLSA VTRF
|
| |