Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2465 |
Symbol | |
ID | 4897342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2597676 |
End bp | 2599265 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640113063 |
Product | extracellular solute-binding protein |
Protein accession | YP_001044339 |
Protein GI | 126463225 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.352303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATCC GAGCGCGCTT GCTGGCGACG GCGGCCGTTG CCGCATTCGC CCTGACCACG GCTCCGCTTT CCGCCGAGAC GCCGCCCGAC ACCTTCATCC AGGCCTGGGC CATCGACGAC ATGATCACGC TCGACCCGGC CGAGGTGTTC GAGTTCACCG CAGCGGAGAT CATCGGCAAC AGCTACGAGA CGATCATCGG CTACGACGTC AACGACGTGT CGAACATCTT CGGGCGCGTG GCCGAGAGCT GGGAGCTGTC GGAGGACGGC CTGACCATGA GCTTCAAGGT GCGCGAGGGC AAGACATTCG CCTCGGGCAA CGACCTGACC GCCGAGGATG TGGTCTACAG CCTCGTGCGG GCCGTGAAGC TGGACAAGTC GCCGGCCTTC ATCCTCGGCC AGTTCGGCCT GACGCCCGAC AATGTCGAAG AGAAGATCAA GCAGACCGGC GACTATGCCT TCACCTTCGA GATGGACAAG GCCTATGCGC CGACCTTCCT GCTCTACTGC CTGACGGCGA CGGTCGCCTC GGTGGTGGAC AAGGATCTCG TGCAGTCCAA CGAAGCGGAC GGCGACTGGG GCTACAACTG GCTCAAGACG AATTACGCAG GCTCCGGCCC CTTCACCATC CGCGAATGGC GCGCGAACGA GGCCGTCGTG ATGGAGCGGA ACGACAACTG GGACGGCGAG AAGCCCGCCA TGGCGCGCGC GATCTACCGC CACATTCCCG AGGCCGCGAC CGAGCGGCTG CTGCTCGAGC AGGGCGACAT CGACATCGCG CGCAAGCTTT TGCCCGAAGA GATCGAGGCG CTGAGCCAGA ACCCCGACAT CAAGATCCAG AGCGGGGTGA AGGGCACGAT CTTCTATCTC GGCCTGAACC AGAAGAACGA GAATCTGGCC AAGCCCGAGG TGCGCGAGGC GATGAAATGG CTCGTAGACT ACGACGCCAT CGCCGAGACG CTGGTCAAGG GCATGAAGAA GAAGCACCAG ACCTTCCTGC CCGAGGGCTT CCTCGGCGCG CTGGACGAGA ACCCCTACAG CTTCGATCCG GCCAAGGCCA AGGAGCTTCT GGCGCAGGCG GGCCTGCCCG ACGGCTTCAC CGTCACGATG GACACCCGCA ACACGCCCGA GGTGACCTCG ATCGCGCAGG CGATCCAGCA GACGATGGCG CAGGCCGGGA TCACCATCGA GATCATCCCC GGCGACGGCG GCCAGACGCT CGAGAAGTAC CGGGCGCGGA CGCATGACAT CTATATCGGC CAGTGGGGCC CCGACTATCA AGACCCGCAT ACCAACGCGA CCTTCGCGCA GAACCCCGAC AATTCCGACG ATGCGGCCTC GAAGCCGCTG GCCTGGCGCA ACGCCTGGGA GATCCCCGAG CTGACCGCCC AGGCGGACGC CGCCGTGCTC GAGCGCGACA CCGAGAAGCG CGCCCAGATG TATCGCGACA TGCAGGAGGA GGTGCTGAAG ACCTCGCCCT TCGTCATCAT GTTCCAGGAA TCCGAGGTCG TCGCCATGCG CAAGAACGTC GAGGGCTACA TCATCGGCCC GTCGTTCAAC GACAACTCGT TCCGGGCCGT GACGAAGTAG
|
Protein sequence | MFIRARLLAT AAVAAFALTT APLSAETPPD TFIQAWAIDD MITLDPAEVF EFTAAEIIGN SYETIIGYDV NDVSNIFGRV AESWELSEDG LTMSFKVREG KTFASGNDLT AEDVVYSLVR AVKLDKSPAF ILGQFGLTPD NVEEKIKQTG DYAFTFEMDK AYAPTFLLYC LTATVASVVD KDLVQSNEAD GDWGYNWLKT NYAGSGPFTI REWRANEAVV MERNDNWDGE KPAMARAIYR HIPEAATERL LLEQGDIDIA RKLLPEEIEA LSQNPDIKIQ SGVKGTIFYL GLNQKNENLA KPEVREAMKW LVDYDAIAET LVKGMKKKHQ TFLPEGFLGA LDENPYSFDP AKAKELLAQA GLPDGFTVTM DTRNTPEVTS IAQAIQQTMA QAGITIEIIP GDGGQTLEKY RARTHDIYIG QWGPDYQDPH TNATFAQNPD NSDDAASKPL AWRNAWEIPE LTAQADAAVL ERDTEKRAQM YRDMQEEVLK TSPFVIMFQE SEVVAMRKNV EGYIIGPSFN DNSFRAVTK
|
| |