Gene Rsph17025_2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2843 
Symbol 
ID5084221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2897612 
End bp2899219 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content65% 
IMG OID640484413 
Productextracellular solute-binding protein 
Protein accessionYP_001169034 
Protein GI146278875 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.430895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT CGCACCACCT GATGATGGAC GATCTGGTCA CGCGACTGCG ACGCGGACAG 
CTGTCGCGCC GCGAGTTTCT GGCCCGCAGT TCGGCGCTGC TGGCGGCCGG GGCCGTGGTC
GGCCTGCCGG GTGGATTGCG CGCGCAGGAG GCCGCGCCCA AGGCCGGGGG CTTCATGCGT
CTGGGTCTGC ACAATGCCTC TCAGAACGAC AATCTCGACC CCGGAAGCTG GTCGACGAGC
TGGACCGGCG CCTCGTTCAA CGGCGGTGTC TATAACAACC TTGTCGAGAT CCTGCCCGAC
GGCTCGGTCG CGGGCGATCT CGCCGAGAGC TGGGACGCCG AGCCCGGGGC AAAGGTCTGG
CGCTTCAAGC TGCGGTCGGG CGTGACCTTC CACAACGGCA AGAGCCTCGA CGCCGAAGAT
GTGCGCCAGT CGCTCGAACA CCACATGAAG CCGGACTCGA CCTCCGGGGC GCGCGCCATC
GTCGAGCAGA TCGAGACGAT CGAGGTGGAG GGATCGGATA CGGTCCGCAT CACCCTCTCG
GAGGGCAATG CCGATCTGCC CTATCTGCTG TCCGACTATC ACCTGTCGAT CTATCCCGCG
CTCGACGGCG GCGGGATCGA CATGGAGAGC GCCAACGGCA CCGGGGCCTT CACCCTCGAG
AGCTTCGAGC CGGGCATCGC CACCCGCCTC AAGCGCAACC CGAACTATCA CAAGAACAAC
AAGCCCTATC TCGACGAGGT GGAGTTCATC AACATCACCG ACGCCACGGC CCGGCTGAAC
GCGCTGCTGA CGGGCGAGGT CGATTTCATC CAGGATCTCG ACATCCGCAA CGTGGCGATG
GTCGAGCGCA GCGGCGATTT CTCGGTGCAG CGCATCCCGA GCCTGCGCCA TTTCACCTTC
GACATGGACA CGCGCGTTGC GCCCTTCGAC AATCCCGACG TGCGGCTGGC GCTGAAACAT
GCGCTCGACC GGGACGATGT GATCGAGAAG GTGTTCCTGG GCGAGGCCAC GAAGGGCAAC
GACAACCCGG TCGCTTCGAT CCAGAAGTTC CACCACGAAC TGCCCGCGCG CGACTACAGC
GTCGAAAAGG CCAGGGAGCA TCTGGCGAAG GCCGGGCTCG ATCAGGTCAG CGTTGATCTG
TCGGTCGCCG AGAATGCCTT TGCCGGCGCC ATCGAGGCGG CGACGCTCTA CCAGCGGCAT
GCGGCCGAGG CCGGCATCAC GATCAATATC GTCCAGGAGG CGGCCGACGG CTACTGGGAG
AATGTCTGGC GCAAGAAGCC CTTCTGCGCC GTCGATTACT TCGGCCGCGC CACGGTCGAC
TGGCTCTTCT CGACGAGCTA TGTCACCGGA GCGCCGTGGA ACTCGGGCTG GTCGAACGCG
CGGTTCGACG AGCTGCACCA GATGGCCCGC GCCGAGACCG ACGAGGCCAA GCGCATGGCC
TGCTACGCCG AGATGCAGGA GATCCTGCGC GACGATGGCA ATGTCATCAC GGTGGCCTTC
GTGAGCTGGC GCAATGCCGT CTCGAACCGC ATCGGCTTTG GCGAAGTCGG CGGGCTGATG
CCGCTCGACA ACATGCGGAT GTGCGAGCGG TGGTGGGTCA AGGACTGA
 
Protein sequence
MNKSHHLMMD DLVTRLRRGQ LSRREFLARS SALLAAGAVV GLPGGLRAQE AAPKAGGFMR 
LGLHNASQND NLDPGSWSTS WTGASFNGGV YNNLVEILPD GSVAGDLAES WDAEPGAKVW
RFKLRSGVTF HNGKSLDAED VRQSLEHHMK PDSTSGARAI VEQIETIEVE GSDTVRITLS
EGNADLPYLL SDYHLSIYPA LDGGGIDMES ANGTGAFTLE SFEPGIATRL KRNPNYHKNN
KPYLDEVEFI NITDATARLN ALLTGEVDFI QDLDIRNVAM VERSGDFSVQ RIPSLRHFTF
DMDTRVAPFD NPDVRLALKH ALDRDDVIEK VFLGEATKGN DNPVASIQKF HHELPARDYS
VEKAREHLAK AGLDQVSVDL SVAENAFAGA IEAATLYQRH AAEAGITINI VQEAADGYWE
NVWRKKPFCA VDYFGRATVD WLFSTSYVTG APWNSGWSNA RFDELHQMAR AETDEAKRMA
CYAEMQEILR DDGNVITVAF VSWRNAVSNR IGFGEVGGLM PLDNMRMCER WWVKD