Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1360 |
Symbol | |
ID | 4896840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1414095 |
End bp | 1415726 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640111947 |
Product | extracellular solute-binding protein |
Protein accession | YP_001043242 |
Protein GI | 126462128 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0151381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCG ACCGACTTCA TCCCGCGGCG CTCATGCACA GGACCGAAGT GGCGAGGGGC CGCATGAGCC GACGCGAGTT CCTGACGCGG ACCACGGCGC TCGGTGTCTC GGCGGGCGCG GCCTATGCGC TTCTGGGCCT CGCGCAGCCT GTCCGCGCGC AGGAGACACC CCGCAAGGGC GGCACGCTCC GGATGGAGAT GGAGACGCGC GCGCTGAAGG ATCCGCGCAC CGCCGACTGG TCGCAGATCT CGAACGTCAC GCGCGGCTGG CTCGAATATC TCGTGGAGTA TGAGGCCGAC GGCACCTTCC GGCCGATGCT TCTCGAAAGC TGGGAGGCCA ACGACGATGC CACCGAATAT CTGCTGAAGG TCCGCCCCGG CGTCACCTGG TCGAACGGCG ATCCCTTCAC CGCCGAGGAT GTGCGGCACA ATTTCGAGCG CTGGTGCGAT GCCTCGGTCG AGGGCAATGC CATGGCGGCG CAGATGACCG CCCTTCAGGC CGAGGGCAAG CTGCGCACCG ATGCCATCGA GCTGGTGGAC GACACGACCC TGCGGCTGAA ACTCTCGCAG CCCGACATCG CGCTGATTGC GAACCTCGCC GACTATCCGG CCGCCGTGGT TCACAAAAGC TACGAAGGCG GCGACCCCGC GGCCAATCCG GTGGGGACCG GCCCCTACCT TCCCGAAACG ATCGAGGTCG GCATCCGCAT GGTGCTGGTG CGCAATGAGG CCCATCCCTG GTGGGGCACG GAGGTCTATG GCGGCCCTTG GCTCGACCGG ATCGAATATC TCGATTTCGG CTCCGATCCT TCCGCCGCGG TAGCCGCCGC GGGGTCGGGC GAGATCGACG CGACCTATCA GAGCGTGGGT GAATTCATCG ACGTGCTCGA CTCGCTCGGC TGGGACAAGT CCGAGGCCCG CACCGCCACG ACGCTGGCCA TCCGTTTCAA CCAGCAGGCC GAGGAGTACA AGGACGTCCG CGTTCGCCGC GCCCTGCAGA TGGCGGTCGA CAATGAGGTG GTTCTGGAGC TCGGCTATTC CGGGCACGGG CGGGTGGCCG AGAACCACCA TGTCTGTCCG ATCCATCCCG AATATGCCGA GCTGCCGCCC CTGACCGTGG ACCGCGCCGC GGCGCTGGCG CAACTGAAGG AGGCCGGCAT GGCCGAGCAC GAGTTCGAGC TCGTCTCGCT CGACGATGCC TGGCAGGCGG CCTCCTGCGA TGCGGTGGCG GCACAGCTGC GCGATGCCGG CATCGCCATC CGCCGCACGG TGCTTCCCGG CGCAACCTAC TGGAACGACT GGCTGAAGTT TCCCTTCTCG GCCACGGAAT GGAACATGCG CCCGCTGGGC GTGCAGGTGC TGGCGCTCGC CTATCGCTCG GGTGTGCCTT GGAACGAATC CGCCTTCTCG AACAAGGCGT TCGACGCGAA GCTCGATGAG GCCATGTCGC TCGTCGACCC GGACCGGCGC CGGGTGCTGA TGGCCGATCT CGAGCGGATC CTGCAGGAGG AGGGCGTGCT GATCCAGCCC TACTGGCGCT CGATCTTCCG CCATGTCGAT CCCAAGGTGA AGGGAGCCGA GGCGCATCCG ACCTTCGAGC ATCACCATTA CAAATGGTGG ATCGACGCCT GA
|
Protein sequence | MTTDRLHPAA LMHRTEVARG RMSRREFLTR TTALGVSAGA AYALLGLAQP VRAQETPRKG GTLRMEMETR ALKDPRTADW SQISNVTRGW LEYLVEYEAD GTFRPMLLES WEANDDATEY LLKVRPGVTW SNGDPFTAED VRHNFERWCD ASVEGNAMAA QMTALQAEGK LRTDAIELVD DTTLRLKLSQ PDIALIANLA DYPAAVVHKS YEGGDPAANP VGTGPYLPET IEVGIRMVLV RNEAHPWWGT EVYGGPWLDR IEYLDFGSDP SAAVAAAGSG EIDATYQSVG EFIDVLDSLG WDKSEARTAT TLAIRFNQQA EEYKDVRVRR ALQMAVDNEV VLELGYSGHG RVAENHHVCP IHPEYAELPP LTVDRAAALA QLKEAGMAEH EFELVSLDDA WQAASCDAVA AQLRDAGIAI RRTVLPGATY WNDWLKFPFS ATEWNMRPLG VQVLALAYRS GVPWNESAFS NKAFDAKLDE AMSLVDPDRR RVLMADLERI LQEEGVLIQP YWRSIFRHVD PKVKGAEAHP TFEHHHYKWW IDA
|
| |