Gene RSP_3525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3525 
Symbol 
ID3721940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp604763 
End bp606370 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content66% 
IMG OID640073190 
ProductABC peptide transporter, periplasmic binding protein 
Protein accessionYP_355028 
Protein GI77465525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAT CGCACCACCT GCTGATGGAC GATCTGGTCA CGCGGCTGCG GCGCGGACAG 
CTGTCGCGTC GCGAGTTTCT GGCCCGCAGT TCGGCGCTGC TGGCGGCCGG CGCCATGAGC
GGCCTGCCCG GTGCGGCGCT TGCCCAGCAG GCGGCACCGA AGGCCGGCGG CTTCATGCGG
CTCGGCCTGC ACAATGCCTC GCAGAACGAC AACCTCGATC CCGGCAGCTG GTCGACGAGT
TGGACCGGCG CCTCGTTCAA CGGCGGCGTC TACAACAACC TCGTCGAGAT CCTGCCCGAC
GGCTCGGTCG CGGGCGATCT GGCCGAGAGC TGGGAGGCGG AGCCCGGCGC GAAGGTCTGG
CGCTTCAAGC TGCGCTCGGG CGTGACCTTC CACAACGGCA AGAGCCTCGA GGCGGAAGAC
GTGCGCCAGT CGCTCGAGCA TCACATGAAG CCGGACTCGA CCTCGGGCGC GCGCGCCATC
GTCGAGCAGA TCGAGACCAT CGACATCGAA GGGTCCGACA CCGTCCGCAT CACCCTCTCG
GAGGGCAATG CCGACCTGCC CTACCTCCTG TCGGATTATC ACCTCTCGAT CTATCCGGCG
CTGGAGGGCG GCGGGATCGA CATGGAGAGC GCCAACGGCA CCGGCGCCTT CCTCCTCGAG
AGCTTCGAGC CGGGCATCGC CACCCGCCTC AAGCGGAACC CGAACTACCA CAAGAACAAC
AAGCCCTATC TCGACGAGGT CGAGTTCATC AACATCACCG ACGCCACGGC GCGGCTGAAC
GCGCTGCTGA CCGGCGAGGT CGATTTCATC CAGGATCTCG ACATCCGCAA CGTGGCGATG
GTCGAGCGCA GCGGCGATTT CTCGGTTCAG CGCGTGCCGA GCCTGCGCCA CTTCACCTTC
GACATGGACA CCCGCGTCGC GCCCTTCGAC AATCCCGATG TGCGGCTGGC GCTGAAATAT
GCGCTCGACC GGGATGACGT GATCGAGAAG GTGTTCCTTG GCGAGGCCAC GAAGGGGAAC
GACAACCCGG TCGCCTCGAT CCAGAAATTC TACCACGACA TGCCCGCGCG CGAATACAGC
ATCGCGAAGG CCAAGGAGCA TCTGGCCAAG GCCGGGCTCG ATCAGGTGAG CGTCGATCTG
TCGGTGGCCG AGAATGCGTT TGCGGGCGCC ATCGAGGCGG CGACGCTCTA CCAGCGCCAT
GCGGCCGAGG CCGGCATCAA CATCAACATC GTGCAGGAGG CGGCCGACGG CTACTGGGAG
AACGTCTGGC GCAAGAAGCC CTTCTGCGCG GTCGACTACT TCGGCCGCGC CACCGTCGAC
TGGCTGTTCT CGACGAGCTA TGTCACCGGC GCGCCGTGGA ATTCGGGCTG GTCGAACGCG
CGGTTCGACG AGCTGCACCA GACGGCGCGG GCCGAGACCG ACGAGGCCAA GCGCGCCGCC
TGCTACGCCG AAATGCAGGA GATCCTGCGC GACGACGGCA ACGTCATCAC CGTGGCCTTC
GTGAGCTGGC GCAACGCCGT CTCGAACCGC ATCGGCTTCG GCGAGGTCGG CGGGCTGATG
CCGCTCGACA ACATGCGGAT GTGCGAGCGC TGGTGGGTCA AGGACTGA
 
Protein sequence
MNKSHHLLMD DLVTRLRRGQ LSRREFLARS SALLAAGAMS GLPGAALAQQ AAPKAGGFMR 
LGLHNASQND NLDPGSWSTS WTGASFNGGV YNNLVEILPD GSVAGDLAES WEAEPGAKVW
RFKLRSGVTF HNGKSLEAED VRQSLEHHMK PDSTSGARAI VEQIETIDIE GSDTVRITLS
EGNADLPYLL SDYHLSIYPA LEGGGIDMES ANGTGAFLLE SFEPGIATRL KRNPNYHKNN
KPYLDEVEFI NITDATARLN ALLTGEVDFI QDLDIRNVAM VERSGDFSVQ RVPSLRHFTF
DMDTRVAPFD NPDVRLALKY ALDRDDVIEK VFLGEATKGN DNPVASIQKF YHDMPAREYS
IAKAKEHLAK AGLDQVSVDL SVAENAFAGA IEAATLYQRH AAEAGININI VQEAADGYWE
NVWRKKPFCA VDYFGRATVD WLFSTSYVTG APWNSGWSNA RFDELHQTAR AETDEAKRAA
CYAEMQEILR DDGNVITVAF VSWRNAVSNR IGFGEVGGLM PLDNMRMCER WWVKD