Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2843 |
Symbol | |
ID | 5084221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2897612 |
End bp | 2899219 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640484413 |
Product | extracellular solute-binding protein |
Protein accession | YP_001169034 |
Protein GI | 146278875 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.430895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAT CGCACCACCT GATGATGGAC GATCTGGTCA CGCGACTGCG ACGCGGACAG CTGTCGCGCC GCGAGTTTCT GGCCCGCAGT TCGGCGCTGC TGGCGGCCGG GGCCGTGGTC GGCCTGCCGG GTGGATTGCG CGCGCAGGAG GCCGCGCCCA AGGCCGGGGG CTTCATGCGT CTGGGTCTGC ACAATGCCTC TCAGAACGAC AATCTCGACC CCGGAAGCTG GTCGACGAGC TGGACCGGCG CCTCGTTCAA CGGCGGTGTC TATAACAACC TTGTCGAGAT CCTGCCCGAC GGCTCGGTCG CGGGCGATCT CGCCGAGAGC TGGGACGCCG AGCCCGGGGC AAAGGTCTGG CGCTTCAAGC TGCGGTCGGG CGTGACCTTC CACAACGGCA AGAGCCTCGA CGCCGAAGAT GTGCGCCAGT CGCTCGAACA CCACATGAAG CCGGACTCGA CCTCCGGGGC GCGCGCCATC GTCGAGCAGA TCGAGACGAT CGAGGTGGAG GGATCGGATA CGGTCCGCAT CACCCTCTCG GAGGGCAATG CCGATCTGCC CTATCTGCTG TCCGACTATC ACCTGTCGAT CTATCCCGCG CTCGACGGCG GCGGGATCGA CATGGAGAGC GCCAACGGCA CCGGGGCCTT CACCCTCGAG AGCTTCGAGC CGGGCATCGC CACCCGCCTC AAGCGCAACC CGAACTATCA CAAGAACAAC AAGCCCTATC TCGACGAGGT GGAGTTCATC AACATCACCG ACGCCACGGC CCGGCTGAAC GCGCTGCTGA CGGGCGAGGT CGATTTCATC CAGGATCTCG ACATCCGCAA CGTGGCGATG GTCGAGCGCA GCGGCGATTT CTCGGTGCAG CGCATCCCGA GCCTGCGCCA TTTCACCTTC GACATGGACA CGCGCGTTGC GCCCTTCGAC AATCCCGACG TGCGGCTGGC GCTGAAACAT GCGCTCGACC GGGACGATGT GATCGAGAAG GTGTTCCTGG GCGAGGCCAC GAAGGGCAAC GACAACCCGG TCGCTTCGAT CCAGAAGTTC CACCACGAAC TGCCCGCGCG CGACTACAGC GTCGAAAAGG CCAGGGAGCA TCTGGCGAAG GCCGGGCTCG ATCAGGTCAG CGTTGATCTG TCGGTCGCCG AGAATGCCTT TGCCGGCGCC ATCGAGGCGG CGACGCTCTA CCAGCGGCAT GCGGCCGAGG CCGGCATCAC GATCAATATC GTCCAGGAGG CGGCCGACGG CTACTGGGAG AATGTCTGGC GCAAGAAGCC CTTCTGCGCC GTCGATTACT TCGGCCGCGC CACGGTCGAC TGGCTCTTCT CGACGAGCTA TGTCACCGGA GCGCCGTGGA ACTCGGGCTG GTCGAACGCG CGGTTCGACG AGCTGCACCA GATGGCCCGC GCCGAGACCG ACGAGGCCAA GCGCATGGCC TGCTACGCCG AGATGCAGGA GATCCTGCGC GACGATGGCA ATGTCATCAC GGTGGCCTTC GTGAGCTGGC GCAATGCCGT CTCGAACCGC ATCGGCTTTG GCGAAGTCGG CGGGCTGATG CCGCTCGACA ACATGCGGAT GTGCGAGCGG TGGTGGGTCA AGGACTGA
|
Protein sequence | MNKSHHLMMD DLVTRLRRGQ LSRREFLARS SALLAAGAVV GLPGGLRAQE AAPKAGGFMR LGLHNASQND NLDPGSWSTS WTGASFNGGV YNNLVEILPD GSVAGDLAES WDAEPGAKVW RFKLRSGVTF HNGKSLDAED VRQSLEHHMK PDSTSGARAI VEQIETIEVE GSDTVRITLS EGNADLPYLL SDYHLSIYPA LDGGGIDMES ANGTGAFTLE SFEPGIATRL KRNPNYHKNN KPYLDEVEFI NITDATARLN ALLTGEVDFI QDLDIRNVAM VERSGDFSVQ RIPSLRHFTF DMDTRVAPFD NPDVRLALKH ALDRDDVIEK VFLGEATKGN DNPVASIQKF HHELPARDYS VEKAREHLAK AGLDQVSVDL SVAENAFAGA IEAATLYQRH AAEAGITINI VQEAADGYWE NVWRKKPFCA VDYFGRATVD WLFSTSYVTG APWNSGWSNA RFDELHQMAR AETDEAKRMA CYAEMQEILR DDGNVITVAF VSWRNAVSNR IGFGEVGGLM PLDNMRMCER WWVKD
|
| |