Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2844 |
Symbol | |
ID | 5084222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2899302 |
End bp | 2900906 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640484414 |
Product | extracellular solute-binding protein |
Protein accession | YP_001169035 |
Protein GI | 146278876 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.922612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.415189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACC CCACGGGCCC CATCGGCGGG CCGATCCCGG TGCGGGCCCT GGGCCTCGCA CCCTCGCGGC GGGCCTTTCT CGGCGGCGCG GCCGTCCTGG CCGGCGCCTT CTGCCTGCCC GCCTCGCTCC GGGCCGAGGA AGGGCCGAAG CGGGGCGGCC GGCTGCGCTA CGGCGTCAAT GACGGCTCGC AGCAGGATTC GCTCGAGCCC GGAAGCTGGG CGACGGTGAT GTGCGGCGCG GCCTTCAACG GGGCGCTCTG CAACAACCTC GTCGAGCTTT TGCCGGACGG GTCGCTGGCG GGCGATCTGG CCGAACGCTG GGAGGAGGCG GAGGGGGCGA CCCGCTGGAC CTTCACGCTG CGCAAGGGCG TGACCTTCCA CGACGGACGC CCCTTCACGC CCGAGGATGC GCGGCAATCG CTGCTTCACC ACATGGGCGA AGACAGCACC TCGGGCGCGC TCGCCATCGT CAGCCAGATC AAGGAGATCG CGGTCGAGGG CGAGGATCGG CTGATCGTGA CACTCACGCA GGGCAATGCC GACTTCCCCT ATCTGCTGTC AGACTATCAC CTCTCGATCT TCCCGGCGAA GGAGGGGGGC GGCATCGACT GGGAAAGCGG CATCGGCACC GGCGCCTTCC GGCTGGACAG TTTCGAGCCC GGCGTCGCGG TCCGGCTGGT CCGCAATCCC CGCTATCACA AGCCCGGCCT GCCGCATTTC GACGAGGTCG AGTTCATCGC GATCCCCGAC CGGGCGGCCC GGCTGAATGC GCTGCTGACC GGCGAGGTCG ATGTGATCGA GGATCTCGAC ATCCGCAACG TCCCCCTCAT CGAGCGCAAC CCCGATCTGG TGCTGCACCG CACGCCCAGC CTGCGGCACC TGACCTTTGA CATGAACTGC CAGACGGCGC CCTTCGACCA TCCGGCGGTG CGTCAGGCCC TGAAGCTCAG CCTCGACCGC GAGGATGTGA TCGCCAAGGT CTTCCTCGGC GAGGGCGAGA CCGGCAACGA CAATCCGGTG GCGCGGATCA TGCCCTTCTG GGCCGAGACG CCGCCCGAGC ACCGCTACGA CCCCGAGGCC GCGCGGGCGC TTCTGGCCGA GGCCGGGATC GAGGGGCTCA CGGTCGATCT CTCGGTCGCG GAGTCGGCCT TCCCCGGCGC GGTCGAGGCG GGGGTGCTGT TCCGCGAGCA TGCGGCGCGC GCCGGCATCA CCATCAACCT CGTGCAGGAG GCCGATGACG GCTACTGGGA CAATGTCTGG CTGGTGAAGC CCTTCAACGC CGCGGACTGG TACGGGCGGG TCACGCTCGA CTGGCTGTTC GCGACCTCCT ACACGTCCGA CGCACCCTGG AACAACACGG GGTTCAGGAA CGCCCGCTTC GACGAGCTGC ACGCCAAGGC CCGGTCGGAG ACCGAACCCG CAAAGCGCGG CGCGCAGTAT GCCGAGATGC AGCAGATCCT GCATAACGAG GGCGGCGTGA TCACGGTGGC CTTCGTCTCC TGGCTCCTCG CCATGTCCCG TGCCATCGGC CATGGCGAGA CCGGAGGGAT CCTGCCCGCC GACAATCACC GCTGCGCCGA GCGATGGTGG CGCACCGACA TCTGA
|
Protein sequence | MRYPTGPIGG PIPVRALGLA PSRRAFLGGA AVLAGAFCLP ASLRAEEGPK RGGRLRYGVN DGSQQDSLEP GSWATVMCGA AFNGALCNNL VELLPDGSLA GDLAERWEEA EGATRWTFTL RKGVTFHDGR PFTPEDARQS LLHHMGEDST SGALAIVSQI KEIAVEGEDR LIVTLTQGNA DFPYLLSDYH LSIFPAKEGG GIDWESGIGT GAFRLDSFEP GVAVRLVRNP RYHKPGLPHF DEVEFIAIPD RAARLNALLT GEVDVIEDLD IRNVPLIERN PDLVLHRTPS LRHLTFDMNC QTAPFDHPAV RQALKLSLDR EDVIAKVFLG EGETGNDNPV ARIMPFWAET PPEHRYDPEA ARALLAEAGI EGLTVDLSVA ESAFPGAVEA GVLFREHAAR AGITINLVQE ADDGYWDNVW LVKPFNAADW YGRVTLDWLF ATSYTSDAPW NNTGFRNARF DELHAKARSE TEPAKRGAQY AEMQQILHNE GGVITVAFVS WLLAMSRAIG HGETGGILPA DNHRCAERWW RTDI
|
| |