Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0100 |
Symbol | |
ID | 4895336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 114016 |
End bp | 115590 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640110683 |
Product | extracellular solute-binding protein |
Protein accession | YP_001041992 |
Protein GI | 126460878 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.080172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGAC GGAGTTTCCT TCGATACGGC GCACTGGCGG GCACGGCCCT CGGGGCGGCG CGCATCAACC CCGACTTCTT CTTCAGCTCC GCCTTCGCGC AGGAGTCCCG CCCGCTGGTG TTCCTCTCGG CCGAGAACAT CACCGGCAAC TGGGACCCGA CCGCGCATAC GACGCTGTCG CAGACCAATA TCGAGGGCTT CGTCATGGGT TATCTGACCC GGGCCCCGAT GCGCCCGGAA GAGCCCGACA AGGTGGTCTA TGAACTCGCG ACCGAGATCA CCGAGCTCGA CGCCCACCGG CTGCAGATCA AGCTGCGCGA CGGTGTCACC TTCCACGACG GCAAGCCCTT CACCGCCGAG GATGTGAAGG CGACCTTCGA GTACGGCGCC AAGCTCGACC GCCCGAAACA GGTCTATCCG GGCGGCCCGG AGACCTTCTC CGTCGAGACG CCCGACGATC ATACCGTGAT CGTCGACACG TCGAAGGGCG GCTACGGCGC CTCGCTCTTC ATCTTCCTGG CCTCCTACCT GCCGATCCTG TCGGCCAAGG ACGTGGCCGA AGGCCCCAAA GGCCCGCTGT CGCAGCGGCT GAACGGCACC GGCCCCTTCC GCTTCGTCGA GCAGCGCGGC AACGACACGG TGATGGAAGC CTACGATGGC TATTTCCGCG GCGCGCCGAA GGTCACCGGC GTCACCTTCT CGTTCGTGGG CGATGCGACG ACGCGCATGC TGTCGCTGAT GAACGGGCAG GCCGATGTGA TCGAACGGCT GGAGCCCGAG CAGGTGGAAA CCCTGCAGGC GCGCGATGAC ATCAAGATCT CGCGGCTGGT CTCGGTCGAG AACAAGTATC TGTGGTTCCG CTGCTCGAAG CCGCCCTTCG ACGACTGGCG CGTGCGCAAG GCCGCCTGCC ATGCCATCGA TCGCAGCATG ATCATGGAGA TCATGGGGTC GGCGGGCGAG GCCTCGTCGA ACTTCGTCTC GCCGATCAAG TTCGGCTATA TCGATCTGGA GAACTACCCC GAATACAATC CCGAGGAATG CCAGCGCCTG CTGGCCGAAG CGGGCTACCC GAACGGCGAG GGCCTGCCCG AGCTGGAATA TATCACCTCG ACCGGCTTCT ATCCCAAGAC CAAGGAATAT GGCGAGCTGA TCGCGGCGCT TCTGCAGGAG CAGGGCTTCC CGGTCACGCT GAACGTGATG GAGGTCGCGG CCTGGAATGA GCGGCTCTAC GACCGGCCGG GCGGCGGCCC GGGCCATATG GTCGATTGCG GCTGGTCCAC CGGGTCTCCC GAGCCCGATC TGGTCCTGCG CACCCACTTC CACTCCACCG CCAAGCGGAT CTGCGGCATC GTCGATCCCG AGATCGACGC CGCCCTCGAT GCGGAGCGTG ACGCGCCCTC GCTCGAGGCG CGCAAGGAGA GCCTGCAGAC CAACCTGATG CCGATGCTGG CCGACAAGGC GCCGGCGCTG AGCCTCTTCA CCTCGGTCCT GATCCACGGG ATGCGAGCCA ATGTGGAGGG ACTGTTCATC TACCCGGATG GCCAGTCGGA CGCCTCGCAG ACGACGCTCG GCTGA
|
Protein sequence | MDRRSFLRYG ALAGTALGAA RINPDFFFSS AFAQESRPLV FLSAENITGN WDPTAHTTLS QTNIEGFVMG YLTRAPMRPE EPDKVVYELA TEITELDAHR LQIKLRDGVT FHDGKPFTAE DVKATFEYGA KLDRPKQVYP GGPETFSVET PDDHTVIVDT SKGGYGASLF IFLASYLPIL SAKDVAEGPK GPLSQRLNGT GPFRFVEQRG NDTVMEAYDG YFRGAPKVTG VTFSFVGDAT TRMLSLMNGQ ADVIERLEPE QVETLQARDD IKISRLVSVE NKYLWFRCSK PPFDDWRVRK AACHAIDRSM IMEIMGSAGE ASSNFVSPIK FGYIDLENYP EYNPEECQRL LAEAGYPNGE GLPELEYITS TGFYPKTKEY GELIAALLQE QGFPVTLNVM EVAAWNERLY DRPGGGPGHM VDCGWSTGSP EPDLVLRTHF HSTAKRICGI VDPEIDAALD AERDAPSLEA RKESLQTNLM PMLADKAPAL SLFTSVLIHG MRANVEGLFI YPDGQSDASQ TTLG
|
| |