Gene Rsph17029_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3714 
Symbol 
ID4898869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp830101 
End bp832053 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content70% 
IMG OID640114324 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001045572 
Protein GI126464459 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAGGC ATGAGCCGCG GCAGGGCAAG CTGATCCCCC GCCGCAGCCT GTTCTGCCGC 
ACAGATCCTG CAGAGATGAC CCTGTCGCCC GACGGCGCGA CCATTGCCTG GCTGAACGGC
GATGCAGGGG TCTCGCGGGT AGTCGTGGCC ACAGCGGCCC CGGTCTCGAC GCGGGTCGTG
CAGCGAGTGC TGTCGCCCAC GAGGCCGGGG GTCACGCCGG TGCTGGTTTG GGCGGGCAGT
CGGCACCTGC TGGTGTTTCG CGATGCGGGG GGCGACGAAA ACTATCGCGC CTTTGCCGTG
GATGCCGCGA CCGGGGCGGA ACGTCCCTTG ACCCCGCCGG GCGCCCGCGC GTTGATACGC
CGCTGCACGC CGGACGGTAT GGTGATGTTC TGGATGAACG ACCGGGATGC CGCCTGTTTC
GATCTGGTGC AGGTCGATGT CGTGACCGGT TCGGCGCAGC GGGTGTTCAC CAACACCCAC
GCCTTCAGCA TGGTCCATTT GGACTCTGCC CTGCGCCCGG TACTGGCCGA GGCGGTGCAG
GCCGATGGTG CCCGCGTCCT TATGCGCAGC GATGGGGACA CATGGCCAGA GGTCGCCCGG
ATCGCGCCGG AAGATGCCCT TGGCTTTCGG GTGGTGGGGG TCACCGACCG CGCGGCCCAT
CTTGTCGACA GCACTGGGCG CGATACGGCC GCGCTGGTGA TGCTGGATGT GCGGAGCGGT
GCGCGGACGG TTCTGGCTGC CGATACGCAG GCGGATATCG ATGCGGTGGT TCTGTCGCCT
TCGGGTGGTG CGCCGCTGGC GGCTGGGGCA ATGCTGCACC GACGCCGCTG GCAGGCCGTT
GATCCGGCCT TTGCCCCGGT GCTGGCGGCG CTGGCGCGTC ACGCCGGGGT GGGCCAGTTC
GACATCGTGG ATGTCAGTTC GGACAGTGCC CGAGTCCTTG CCCGGATCGA GCAAGCCGAC
CGCCCGGTCG CCCATGTGAT GGTCGACGGC GGGGCGGTCC ACCGGATCGT CATGCGCGCG
GGGACGGATG GTCTGACCGA GGCGGGCCTG CGCCCGATGG AGCCCGTGAC GCTGGCGGCC
CGCGATGGCT TGCCGCTGCA TGGCTATGTG ACCCGGCCAA AGGCGGGCCA CGGCCCCGCG
CCGCTAGTGC TGTTGGTCCA TGGCGGACCC TATGACCGGG ACCGCTGGGG GTTCAGCCCC
ACGCATCAAT GGCTCGCCAG CCGAGGGTTT GCAGTGTTGT CGGTCAACTT CCGCGGTTCC
ACGGGCTTCG GCAAGGCCTT CATCGCCGCT GGCGACCGGG AATGGGGCGG ACGGATGCAG
GACGATCTGG CCGATGCCGT GGGGTGGGCC GTGGCCCAGG GGATTGCCGA TCCGGCCCGG
GTTCAGGTGA TGGGCAGCAG TTATGGCGGG TATGCCGCCC TGATGACCGC CGGTCTGCAT
CCTGACCTGT GCGCGGGCGT GGTCAGCATC GGCGGGCCAT CGTCGCTTGC GGGTTTCATG
GACGCCATCC CGCCCTACTG GCAAAGCTGG TTCGCGATGA TCCGCCAGCG GCTGGCCGAC
CCCGCGATCG CCGAGGGGAG GGCATGGCTC GATGCCCGCT CGCCGTTGGC GCATGTCGCG
ACGATCCACT GTCCCGTTCT CATGATACAT GGTCGACAGG ACGTGCGGGT GCCGCTTGTG
CAGGCCCGGG CGATGGCGGC GGCGCTGGCG GCGGCGGGCC GCCCGGTCAC GCTGGCGGTG
TTGCCCGACG AGGGGCATTT TATCAGCGGA CAGGCGAACC GGGTGGCGCT TGCAGCGCTG
GTGGAGGCGT TCCTGCAGGA TCAGGCGGGC GGGCCGGTCG AGGATGTGGC CGAAGACCTG
GTCGCATCGC GGATGGCCAT CCTGCAGGGC GGGGATTTCC TGCCCGGCGC GGTTCTGGCC
CGGCTGCGCG CGCGGAATGC CTCGCCGGCG TGA
 
Protein sequence
MPRHEPRQGK LIPRRSLFCR TDPAEMTLSP DGATIAWLNG DAGVSRVVVA TAAPVSTRVV 
QRVLSPTRPG VTPVLVWAGS RHLLVFRDAG GDENYRAFAV DAATGAERPL TPPGARALIR
RCTPDGMVMF WMNDRDAACF DLVQVDVVTG SAQRVFTNTH AFSMVHLDSA LRPVLAEAVQ
ADGARVLMRS DGDTWPEVAR IAPEDALGFR VVGVTDRAAH LVDSTGRDTA ALVMLDVRSG
ARTVLAADTQ ADIDAVVLSP SGGAPLAAGA MLHRRRWQAV DPAFAPVLAA LARHAGVGQF
DIVDVSSDSA RVLARIEQAD RPVAHVMVDG GAVHRIVMRA GTDGLTEAGL RPMEPVTLAA
RDGLPLHGYV TRPKAGHGPA PLVLLVHGGP YDRDRWGFSP THQWLASRGF AVLSVNFRGS
TGFGKAFIAA GDREWGGRMQ DDLADAVGWA VAQGIADPAR VQVMGSSYGG YAALMTAGLH
PDLCAGVVSI GGPSSLAGFM DAIPPYWQSW FAMIRQRLAD PAIAEGRAWL DARSPLAHVA
TIHCPVLMIH GRQDVRVPLV QARAMAAALA AAGRPVTLAV LPDEGHFISG QANRVALAAL
VEAFLQDQAG GPVEDVAEDL VASRMAILQG GDFLPGAVLA RLRARNASPA