Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3168 |
Symbol | |
ID | 4899167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 192839 |
End bp | 194443 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640113770 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045040 |
Protein GI | 126463927 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.114782 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACC CCACGGGCCC CATCGGCGGG CAGATCCTTG CGCAGGCCAT GAGCCTCGCC CCCTCGCGAC GGGCCTTTCT GGGGGGCGCG GCCGCCGTAG CCGGCGCCTT CTGCCTGCCC GCCTCGCTGC GAGCCGAGGA GGGGCCGAAG CAGGGCGGCC GGCTCCGCTA CGGCGTCAAC GACGGCTCGC AGCAGGATTC GCTCGAGCCC GGCAGCTGGG CCACCGTCAT GTGCGGTGCG GCCTTCAACG GCGCGCTCTG CAACAACCTC GTCGAGCTTC TGCCGGACGG GTCGCTGGCG GGCGATCTCG CCGAAAGCTG GGAGGAGGCC GAGGGTGCCA CCCGCTGGAC CTTCACGCTC CGCAAGGGTG TCCTGTTCCA CGACGGCCGC CCCTTCACCC CGGAGGATGC CCGGCAGTCG CTGATGCATC ACATGGGCGA GGGCAGCACC TCGGGCGCGC TCGCCATCGT CAGCCAGATC AGGGAGATCG CCGTCGAGGG TGAGGACCGG CTGATCGTGA CCCTCACGCA GGGCAATGCC GACTTCCCCT ATCTGCTGTC GGATTATCAC CTCTCGATCT TCCCGGCGAA GGAGGGCGGC GGCATCGACT GGGAGAGCGG CATCGGCACC GGCGCCTTCA AGCTCGACAG TTTCGAGCCG GGCGTCGCGG TCCGACTGCT CCGCAATCCG AACTATCACA AGCCCGGCCT GCCGCATTTC GACGAGGTCG AATTCATCGC GATCCCCGAC CGGTCCGCGC GGCTGAATGC GCTGCTGACC GGCGAGGTCG ATGTGATCGA GGATGTCGAC ATCCGCAACG TCCCCCTGAT CGAGCGCAAT CCCGATCTGG CGCTGCACCG CACGCCGAGC CTGCGGCACC TGACCTTCGA CATGAACTGC CAAACGGCGC CCTTCGACAA TCCGGTCGTG CGCAAGGCCC TGAAGCTCAG CCTCGACCGC GAGGATGTGA TCGCCAAGGT GTTCCTCGGC GAGGCCGAGA CGGGGAACGA CAACCCGGTG GCGCGCATCA TGCCCTTCTG GGCCGAGACG CCGCCCGAGC ACCGCTACGA TCCCGAGGCC GCGCGGGCGC TTCTGGCCGA GGCCGGGATC GAGGGGCTGA CGGTCGATCT CTCGGTGGCC GAATCCGCCT TCCCCGGCGC GGTCGAAGCG GGGGTGCTCT TCCGCGAACA TGCCGCCAAG GCCGGCATCA CGATCAACCT CGTGCAGGAG GCCGATGACG GCTACTGGGA CAATGTCTGG CTGGTGAAGC CCTTCAACGC CGCGGACTGG TACGGGCGGG TCACGCTCGA CTGGCTGTTC GCCACCTCCT ACACCTCCGA CGCGCCCTGG AACAACACGG GGTTCAAGAA CGCCCGCTTC GACGAGCTGC ATGCGGCGGC GCGGTCGGAG ACCGATCCCG CCACGCGGGG CGGGCACTAT GCCGAGATGC AGCAGATCCT GCATGACGAC GGCGGCGTGA TCACGGTGGC CTTCGTGTCG TGGCTGCTCG CCATGTCGCG CGCCATCGGC CATGGTGAGA CCGGAGGCAT CCTGCCCGCC GACAATCACC GCTGCGCCGA GCGGTGGTGG CGCACCGACG TCTGA
|
Protein sequence | MRYPTGPIGG QILAQAMSLA PSRRAFLGGA AAVAGAFCLP ASLRAEEGPK QGGRLRYGVN DGSQQDSLEP GSWATVMCGA AFNGALCNNL VELLPDGSLA GDLAESWEEA EGATRWTFTL RKGVLFHDGR PFTPEDARQS LMHHMGEGST SGALAIVSQI REIAVEGEDR LIVTLTQGNA DFPYLLSDYH LSIFPAKEGG GIDWESGIGT GAFKLDSFEP GVAVRLLRNP NYHKPGLPHF DEVEFIAIPD RSARLNALLT GEVDVIEDVD IRNVPLIERN PDLALHRTPS LRHLTFDMNC QTAPFDNPVV RKALKLSLDR EDVIAKVFLG EAETGNDNPV ARIMPFWAET PPEHRYDPEA ARALLAEAGI EGLTVDLSVA ESAFPGAVEA GVLFREHAAK AGITINLVQE ADDGYWDNVW LVKPFNAADW YGRVTLDWLF ATSYTSDAPW NNTGFKNARF DELHAAARSE TDPATRGGHY AEMQQILHDD GGVITVAFVS WLLAMSRAIG HGETGGILPA DNHRCAERWW RTDV
|
| |