Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2156 |
Symbol | |
ID | 5082934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2184746 |
End bp | 2186377 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640483719 |
Product | extracellular solute-binding protein |
Protein accession | YP_001168351 |
Protein GI | 146278192 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.470572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCG AGAGACTGCA TCCCGCGGCC CTGATGCATC AGGTCGAAGT TGCGGCGGGC CGGATGAGCC GACGCGAGTT CCTGAGCCGG GCCACCGCGC TCGGCGTTTC GGCGGGCGCG GCCTACGGCC TGCTGGGGCT GGCATCCCCC GCCCGCGCGC AAGAGGCGCC GCGGCCCGGC GGCACCCTGC GCATGGAGAT GGAGACCCGG GCGCTCAAGG ATCCGCGGAC GGCCGACTGG TCGCAGATCG CGAACTTCAC CCGCGGCTGG CTGGAGTATC TGGTGGAATA CCAGGCCGAC GGAACCTTCC GGCCGATGCT CCTCGAAAGC TGGGAGGCCA ACGACAACGC CACGGAATAC CGGCTGAACG TCCGCCCCGG CGTGCGCTGG AGCAACGGCG ACCCCTTCAC CGCCGAGGAT GTCCGGCACA ATTTCGAGCG CTGGTGCGAC GCCTCGGTCG AGGGGAACGC GATGGCGGCC CAGATGGTGG CCCTGCAGGC CGAGGGCAAG CTGCGGCCGG ACGCGATCGA AGTGGTGGAC GAGACGACAC TCCGGCTGAC ACTCTCGCAG CCCGACATCG CGCTGATTGC GAACCTCGCC GACTATCCGG CTGCGATCGT CCACCCCTCC TACGAGGGCG GCGACCCGGC GGCCAATCCC GTCGGAACCG GGCCCTATCG GCCGGAAACG GTGGAGGTGG GCATCCGCAT GGTGCTGGTC CGCAACGAGG CGCATCCCTG GTGGGGCAGC GACGTCCACG GCGGCCCCTG GCTCGACCGG ATCGAATATC TCGATTTCGG CAGCGATCCC TCGGCCGCCG TGGCCGCGGC GGGATCGGGC GAGATCGACG CGACCTATCA GACCGTGGGC GAGTTCATCG AGGTGCTGGA CGGGCTTGGC TGGGACAAGT CCGAAGCCCG CACCGCGACC ACCCTTGCGA TCCGGTTCAA CCAGCAGTCC GAGGAGTACC GCGACGTCCG CGTGCGCCGG GCGCTGCAGA TGGCGGTGGA CAATGCGGTG GTGCTGGAAC TGGGCTACTC GGGCCACGGG CATGTGGCCG AGAACCACCA CGTCAGCCCG ATCCATCCCG AATATGCCGA GCTGCCGCCG CTGACCGTCG ATCGCGCGCA GGCCCGGGCC CTGCTGGAAG AGGCCGGGAT GGACGGCCAC GAGTTCGAGC TGGTCTCGCT CGACGACGCC TGGCAGGCGG CCTCGTGCGA CGCCGTGGCG GCGCAGTTGC GGGACGCCGG CGTGTCGATC CGGCGGACGG TCCTGCCCGG CGCGACCTAC TGGAACGACT GGCTGAAGTT CCCCTTCTCG GCGACCGAGT GGAACATGCG CCCGCTGGGG GTGCAGGTTC TGGCGCTCGC CTATCGCTCG GGGGTGCCCT GGAACGAATC CGGCTTTGCC AACAAGGAGT TCGACGCCAA GCTTGATGAG GCCATGTCGA TCGTCGATCC CGACGGGCGC CGCACCCTCA TGGCCGATCT CGAGCGGATC CTGCAGGAGG AGGGCGTGCT GATCCAGCCC TACTGGCGCT CGATCTTCCG CCATGTCGAT CCGAAGGTGA AGGGTGCCGA GGCCCACCCG ACCTTCGAGC ATCACCATTA CAAGTGGTGG ATCGACGCCT GA
|
Protein sequence | MTSERLHPAA LMHQVEVAAG RMSRREFLSR ATALGVSAGA AYGLLGLASP ARAQEAPRPG GTLRMEMETR ALKDPRTADW SQIANFTRGW LEYLVEYQAD GTFRPMLLES WEANDNATEY RLNVRPGVRW SNGDPFTAED VRHNFERWCD ASVEGNAMAA QMVALQAEGK LRPDAIEVVD ETTLRLTLSQ PDIALIANLA DYPAAIVHPS YEGGDPAANP VGTGPYRPET VEVGIRMVLV RNEAHPWWGS DVHGGPWLDR IEYLDFGSDP SAAVAAAGSG EIDATYQTVG EFIEVLDGLG WDKSEARTAT TLAIRFNQQS EEYRDVRVRR ALQMAVDNAV VLELGYSGHG HVAENHHVSP IHPEYAELPP LTVDRAQARA LLEEAGMDGH EFELVSLDDA WQAASCDAVA AQLRDAGVSI RRTVLPGATY WNDWLKFPFS ATEWNMRPLG VQVLALAYRS GVPWNESGFA NKEFDAKLDE AMSIVDPDGR RTLMADLERI LQEEGVLIQP YWRSIFRHVD PKVKGAEAHP TFEHHHYKWW IDA
|
| |