Gene Rsph17029_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3169 
Symbol 
ID4898881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp194533 
End bp196140 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content66% 
IMG OID640113771 
Productextracellular solute-binding protein 
Protein accessionYP_001045041 
Protein GI126463928 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT CGCACCACCT GCTGATGGAC GATCTGGTCA CGCGGCTGCG GCGCGGACAG 
CTGTCGCGTC GCGAGTTTCT GGCCCGCAGT TCGGCGCTGC TGGCGGCCGG GGCCATGAGC
GGGCTGCCCG GTGCGGCGCT TGCCCAGCAG GCGGCGCCGA AGGCCGGCGG CTTCATGCGG
CTCGGCCTGC ACAATGCCTC GCAGAACGAC AACCTCGATC CCGGCAGCTG GTCGACGAGC
TGGACCGGCG CCTCGTTCAA CGGCGGCGTC TACAACAACC TGGTCGAGAT CATGCCCGAC
GGCTCCGTCG CGGGCGATCT GGCCGAGAGC TGGGAGGCGG AGCCCGGCGC GAAGGTCTGG
CGCTTCAAGC TGCGCTCGGG CGTGACCTTC CACAACGGCA AGAGCCTCGA GGCGGAAGAC
GTGCGCCAGT CGCTCGAGCA TCACATGAAG CCGGACTCGA CCTCGGGCGC GCGCGCCATC
GTCGAGCAGA TCGAGACCAT CGACATCGAA GGGTCCGACA CCGTCCGCAT CACCCTGTCC
GAGGGCAATG CCGACCTGCC CTACCTCTTG TCGGATTATC ACCTCTCGAT CTATCCGGCG
CTGGAGGGCG GCGGGATCGA CATGGAGAGC GCCAACGGCA CCGGCGCCTT CCTCCTCGAG
AGCTTCGAGC CGGGCATCGC CACCCGCCTC AAGCGGAACC CGAACTATCA CAAGAACAAC
AAGCCCTATC TCGACGAGGT CGAGTTCATC AACATCACCG ACGCCACGGC GCGGCTGAAC
GCGCTGCTGA CCGGCGAGGT CGATTTCATC CAGGATCTCG ACATCCGCAA CGTGGCGATG
GTCGAGCGCA GCGGCGATTT CTCGGTTCAG CGTGTGCCGA GCCTGCGCCA CTTCACCTTC
GACATGGACA CCCGCGTGGC GCCCTTCGAC AATCCCGACG TGCGGCTGGC GCTGAAATAT
GCGCTCGACC GGGACGACGT GATCGAGAAG GTGTTCCTCG GCGAGGCCAC GAAGGGGAAC
GACAACCCGG TCGCCTCGAT CCAGAAGTTC TACCACGACA TGCCCGCGCG CGAATACAGC
ATCGCGAAGG CCAAGGAGCA TCTGGCCAAG GCCGGGCTCG ATCAGGTGAC TGTCGATCTC
TCGGTGGCCG AGAATGCCTT CGCGGGCGCC ATCGAGGCGG CGACCCTCTA CCAGCGCCAT
GCGGCCGAGG CCGGCATCAA CATCAACATC GTGCAGGAGG CGGCCGACGG CTACTGGGAG
AACGTCTGGC GCAAGAAGCC CTTCTGCGCG GTGGACTATT TCGGCCGCGC CACCGTCGAC
TGGCTGTTCT CGACGAGCTA TGTCACCGGC GCGCCGTGGA ATTCGGGCTG GTCGAACGCG
CGGTTCGACG AGCTGCACCA GACGGCACGG GCCGAGACCG ACGAGGCCAA GCGCGCCGCC
TGCTACGCCG AGATGCAGGA GATCCTGCGC GACGACGGCA ACGTCATCAC CGTGGCCTTC
GTGAGCTGGC GCAACGCCGT CTCGAACCGC ATCGGCTTCG GCGAGGTCGG CGGGCTGATG
CCGCTCGACA ACATGCGGAT GTGCGAGCGC TGGTGGGTCA AGGACTGA
 
Protein sequence
MNKSHHLLMD DLVTRLRRGQ LSRREFLARS SALLAAGAMS GLPGAALAQQ AAPKAGGFMR 
LGLHNASQND NLDPGSWSTS WTGASFNGGV YNNLVEIMPD GSVAGDLAES WEAEPGAKVW
RFKLRSGVTF HNGKSLEAED VRQSLEHHMK PDSTSGARAI VEQIETIDIE GSDTVRITLS
EGNADLPYLL SDYHLSIYPA LEGGGIDMES ANGTGAFLLE SFEPGIATRL KRNPNYHKNN
KPYLDEVEFI NITDATARLN ALLTGEVDFI QDLDIRNVAM VERSGDFSVQ RVPSLRHFTF
DMDTRVAPFD NPDVRLALKY ALDRDDVIEK VFLGEATKGN DNPVASIQKF YHDMPAREYS
IAKAKEHLAK AGLDQVTVDL SVAENAFAGA IEAATLYQRH AAEAGININI VQEAADGYWE
NVWRKKPFCA VDYFGRATVD WLFSTSYVTG APWNSGWSNA RFDELHQTAR AETDEAKRAA
CYAEMQEILR DDGNVITVAF VSWRNAVSNR IGFGEVGGLM PLDNMRMCER WWVKD