Gene RSP_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3747 
Symbol 
ID3721506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp872050 
End bp873228 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID640073417 
Productputative dipeptidase 
Protein accessionYP_355254 
Protein GI77465751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.93962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGTT ATTTTTCCCG ATCCGAGTAC GAGCGCCGCT GGCAGAAGGC CGAGGCGCTG 
ATGGCCGAGC GCGGCTTCGA GACGGCTGTC GTCTTCTCGC GCGGCGGCGG GACGACCGAC
AATTGCGGCG ACGTGCTCTA TCTGGCGAAC CACTATTCGG TCAGCGGGGG CACCGATTCG
ACGATCTGGT CGGCGCGGTC CTTCTCGGCG GTGATCCTGC GCCGCGGGCA GGAGCCCGAG
CTGCATATCG ACGAGCCCGA GGGACGCGCG GATCTCCTCG CCGTGGACCG GGTGGCCTGC
CACAACCATC CGTTCATCGG TGTGGCCGAA GCGCTGGTGG CGCGCGGCGT CACCGGGCGC
GTCGCGCTCT GCGGGACCCA GTTCATCCCG GTGAAATATT ACCAGCAGCT CGTGTCGCGG
ACGCCGGGGA TCGAATGGGT CGAGGCCGAT GACCTGATCC GCAGCCTGCG CCGGATCAAG
AGCGCGGAAG AACTCGACTG CTACCGGATC GCGGGCGAGG CGGCGACCGA GGCCACCACG
GTTCTGATGC AGGGCCTCCT GTCGGGGTTG TCCGAGCGCG AGGCGGCCGG CGAGGCCGCC
CGCGTGACCG TGGCGCGCGG CGGGCGGGTG CAGGCGATCG GCACCAACCA CGGCGACACG
ATGCAGTATG ACTACCGCAA CCCGCTCACG GGCTCGAGCG CCGACACGCC GGCGGTGGGC
GACATGGTGC GCGGCACGGT CCATGCGGCC TTCTTCCAGG GCTATTATCT CGATCCCGGC
CGCACCGCGG TGCGCGGCAC CCCCACTGCC GATCAGCGGC GGTTGATCGA GGCCACCAAC
GACATCGTCC AGCGGCTGAT CGGCATGATG CGCCCCGGCG CGCGTCTCCT TGACGTGGCG
GCCGAGGGGG ACCGGATGAC ACAGGCCTTC GGCGGCGAGA TCTCTCCGCT GATGAAGAAC
TTCCCCTTCT ACGGCCACGG GATCGGCCTC TCGTTCGAGC AGCCGCGGAT CTCGACCGCC
ATGTCGCTGC CGGGCGATGT GGTCGAGGAG AACATGGTCT TCGGCGTCGA GGCCTTCCTC
GCCCTCGAGG GCGTGGGGTC GGCCTTCTTC GAGGACATCG TGATCGTGAC GGCAGGCACC
CCCGAACTCC TCACCCGCAC CCCCCATTAT TTCTGGTGA
 
Protein sequence
MSRYFSRSEY ERRWQKAEAL MAERGFETAV VFSRGGGTTD NCGDVLYLAN HYSVSGGTDS 
TIWSARSFSA VILRRGQEPE LHIDEPEGRA DLLAVDRVAC HNHPFIGVAE ALVARGVTGR
VALCGTQFIP VKYYQQLVSR TPGIEWVEAD DLIRSLRRIK SAEELDCYRI AGEAATEATT
VLMQGLLSGL SEREAAGEAA RVTVARGGRV QAIGTNHGDT MQYDYRNPLT GSSADTPAVG
DMVRGTVHAA FFQGYYLDPG RTAVRGTPTA DQRRLIEATN DIVQRLIGMM RPGARLLDVA
AEGDRMTQAF GGEISPLMKN FPFYGHGIGL SFEQPRISTA MSLPGDVVEE NMVFGVEAFL
ALEGVGSAFF EDIVIVTAGT PELLTRTPHY FW