Gene RSP_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3524 
Symbol 
ID3721939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp603069 
End bp604673 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content68% 
IMG OID640073189 
ProductABC peptide transporter, periplasmic binding protein 
Protein accessionYP_355027 
Protein GI77465524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.413747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTACC CCACGGGCCC CATCGGCGGG CAGATCCTTG CGCAGGCCAT GAGCCTCGCC 
CCCTCGCGAC GGGCCTTTCT GGGGGGCGCG GCCGCCGTGG CCGGCGCCTT CTGCCTGCCC
GCCTCGCTGC GAGCCGAGGA AGGGCCGAAG CGGGGCGGCC GGCTCCGCTA CGGCGTCAAC
GACGGCTCGC AGCAGGATTC GCTCGAGCCC GGCAGCTGGG CCACCGTCAT GTGCGGTGCG
GCCTTCAACG GCGCGCTCTG CAACAACCTC GTCGAGCTTC TGCCGGACGG GTCGCTGGCG
GGCGATCTCG CCGAAAGCTG GGAGGAGGCC GAGGGTGCCA CCCGCTGGAC CTTCACGCTC
CGCAAGGGTG TCCTGTTCCA CGACGGCCGC CCCTTCACCC CGGAGGATGC CCGGCAGTCG
CTGATGCATC ACATGGGCGA GGGCAGCACC TCGGGCGCGC TCGCCATCGT CAGCCAGATC
AAGGAGATCG CCGTCGAGGG CGAGGACCGG CTGATCGTGA CCCTCACGCA GGGCAATGCC
GACTTCCCCT ATCTGCTGTC GGATTATCAC CTCTCGATCT TCCCGGCGAA GGAGGGCGGC
GGCATCGACT GGGAGAGCGG CATCGGCACC GGCGCCTTCA AGCTCGACAG TTTCGAGCCG
GGCGTCGCGG TCCGACTGCT CCGCAATCCG AACTATCACA AGCCCGGCCT GCCGCATTTC
GACGAGGTCG AATTCATCGC GATCCCCGAC CGGTCCGCGC GGCTGAATGC GCTGCTGACC
GGCGAGGTCG ATGTGATCGA GGATGTCGAC ATCCGCAACG TCCCCCTGAT CGAGCGCAAT
CCCGATCTGG TGCTGCACCG CACGCCGAGC CTGCGGCACC TGACCTTCGA CATGAACTGC
CAAACGGCGC CCTTCGACAA TCCGGTCGTG CGCAAGGCCC TGAAGCTCAG CCTCGACCGC
GAGGATGTGA TCGCCAAGGT GTTCCTCGGC GAGGCCGAGA CGGGGAACGA CAACCCGGTG
GCGCGCATCA TGCCCTTCTG GGCCGAGACG CCGCCCGAGC ACCGCTACGA TCCCGAGGCC
GCGCGGGCGC TTCTGGCCGA GGCCGGGATC GAGGGGCTGA CGGTCGATCT CTCGGTGGCC
GAATCCGCCT TTCCCGGTGC GGTCGAAGCG GGGGTCCTTT TCCGCGAACA TGCCGCCAAG
GCCGGCATCA CGATCAACCT CGTGCAGGAG GCCGATGACG GCTACTGGGA CAATGTCTGG
CTGGTGAAGC CCTTCAACGC CGCCGACTGG TACGGGCGGG TCACGCTCGA CTGGCTGTTC
GCCACCTCCT ACACCTCCGA CGCGCCCTGG AACAACACGG GGTTCAAGAA CGCCCGCTTC
GACGAGCTGC ATGCGGCGGC GCGGTCGGAG ACCGATCCCG CCACGCGGGG CGAACAGTAT
GCCGAGATGC AGCAGATCCT GCACGACGAC GGCGGCGTGA TCACGGTGGC CTTCGTGTCC
TGGCTGCTCG CCATGTCGCG CGCCATCGGC CATGGTGAGA CCGGAGGCAT CCTGCCCGCC
GACAATCATC GCTGCGCCGA GCGGTGGTGG CGCACCGACG TCTGA
 
Protein sequence
MRYPTGPIGG QILAQAMSLA PSRRAFLGGA AAVAGAFCLP ASLRAEEGPK RGGRLRYGVN 
DGSQQDSLEP GSWATVMCGA AFNGALCNNL VELLPDGSLA GDLAESWEEA EGATRWTFTL
RKGVLFHDGR PFTPEDARQS LMHHMGEGST SGALAIVSQI KEIAVEGEDR LIVTLTQGNA
DFPYLLSDYH LSIFPAKEGG GIDWESGIGT GAFKLDSFEP GVAVRLLRNP NYHKPGLPHF
DEVEFIAIPD RSARLNALLT GEVDVIEDVD IRNVPLIERN PDLVLHRTPS LRHLTFDMNC
QTAPFDNPVV RKALKLSLDR EDVIAKVFLG EAETGNDNPV ARIMPFWAET PPEHRYDPEA
ARALLAEAGI EGLTVDLSVA ESAFPGAVEA GVLFREHAAK AGITINLVQE ADDGYWDNVW
LVKPFNAADW YGRVTLDWLF ATSYTSDAPW NNTGFKNARF DELHAAARSE TDPATRGEQY
AEMQQILHDD GGVITVAFVS WLLAMSRAIG HGETGGILPA DNHRCAERWW RTDV