Gene Rsph17029_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3478 
Symbol 
ID4898690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp555439 
End bp556617 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID640114075 
Productpeptidase M24 
Protein accessionYP_001045343 
Protein GI126464230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTT ATTTTTCCCG ATCCGAGTAC GAGCGCCGCT GGCAGAAGGC CGAGGCGCTG 
ATGGCCGAGC GCGGCTTCGA GACGGCTGTC GTCTTCTCGC GCGGCGGCGG GACGACCGAC
AATTGCGGCG ACGTGCTCTA TCTGGCGAAC CACTATTCGG TCAGCGGGGG CACCGATTCG
ACGATCTGGT CGGCGCGGTC CTTCTCGGCG GTGATCCTGC GCCGCGGGCA GGAGCCCGAG
CTGCATATCG ACGAGCCCGA GGGGCGCGCG GATCTCCTCG CCGTGGACCG GGTGGCCTGC
CACAACCATC CGTTCATCGG CGTGGCCGAG GCATTAGTGG CAATGGGCGT CACCGGGCGC
GTCGCGCTCT GCGGGACCCA GTTCATCCCG GTGAAATATT ACCAGCAGCT CGTGTCGCGG
ACGCCGGGGA TCGAATGGGT CGAGGCCGAT GACCTGATCC GCAGCCTGCG CCGGATCAAG
AGCGCGGAAG AGCTCGACTG CTACCGGATC GCGGGCGAGG CGGCGACCGA GGCCACCACG
GTTCTGATGC AGGGCCTGCT GTCGGGACTG TCCGAGCGCG AGGCGGCCGG CGAGGCCGCC
CGCGTGACCG TGGCGCGCGG CGGGCGGGTG CAGGCGATCG GCACCAACCA CGGCGACACG
ATGCAGTATG ACTACCGCAA CCCGCTCACG GGCTCGAGCG CCGACACGCC GGCGGTGGGC
GACATGGTGC GCGGCACGGT CCATGCGGCC TTCTTTCAGG GCTATTATCT CGATCCAGGC
CGGACCGCGG TGCGCGGCAC CCCCACTGCC GATCAGCGAC GGCTGATCGA GGCCACCAAC
GACATCGTCC AGCGGCTGAT CGGCATGATG CGCCCCGGCG CGCGTCTCCT TGATGTGGCG
GCCGAGGGGG ACCGCATGAC ACAGGCCTTC GGCGGCGAGA TCTCTCCGCT GATGAAGAAC
TTCCCCTTCT ACGGCCACGG GATCGGCCTC TCGTTCGAGC AGCCGCGGAT CTCGACCGCC
ATGTCGCTGC CGGGCGATGT GGTCGAGGAG AACATGGTCT TCGGCGTCGA GGCCTTCCTC
GCCCTCGAGG GCGTGGGGTC GGCCTTCTTC GAGGACATCG TGATCGTGAC GGCAGGCACC
CCCGAACTCC TTACTCGCAC CCCCCATTAT TTCTGGTGA
 
Protein sequence
MSRYFSRSEY ERRWQKAEAL MAERGFETAV VFSRGGGTTD NCGDVLYLAN HYSVSGGTDS 
TIWSARSFSA VILRRGQEPE LHIDEPEGRA DLLAVDRVAC HNHPFIGVAE ALVAMGVTGR
VALCGTQFIP VKYYQQLVSR TPGIEWVEAD DLIRSLRRIK SAEELDCYRI AGEAATEATT
VLMQGLLSGL SEREAAGEAA RVTVARGGRV QAIGTNHGDT MQYDYRNPLT GSSADTPAVG
DMVRGTVHAA FFQGYYLDPG RTAVRGTPTA DQRRLIEATN DIVQRLIGMM RPGARLLDVA
AEGDRMTQAF GGEISPLMKN FPFYGHGIGL SFEQPRISTA MSLPGDVVEE NMVFGVEAFL
ALEGVGSAFF EDIVIVTAGT PELLTRTPHY FW