Gene Rsph17029_3525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3525 
Symbol 
ID4898831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp604950 
End bp606431 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content69% 
IMG OID640114129 
Productcarboxypeptidase Taq 
Protein accessionYP_001045389 
Protein GI126464276 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG ACGTCGCCTT CACCGAACTG ATGGACTATC AGCGCCAGAC CGAGGCGCTG 
GCGCAGGTCA TGGGCCGGCT CTCCTGGGAC CAGGAGACGG TGATGCCGCG CGGCGCCGCC
GAGCAGCGGG CCGAGGAGAT GGCGGCGCTC GAAGGCGTGC TCCATGCCCG CCGCACCGAT
CCGCGTCTCG CCGAATGGCT GGAACTGGCC GAGCCCGAGG ACGAGGAGGA TGAGGCGCAG
CTTCGCCTGA TCCGTCGCAG CCACGAGCGG GCCACCAAGG TGCCCGGCCG TCTCGCGCAG
GAGATCGCCC GCGTGACCTC GGCCGCGCAG GGCATCTGGG CCGAGGCCCG GGCGGCGGAC
GATGTCTCGA TGTTCCTGCC GACCCTCACC GAGGTGATCC GCCTCAAGCG CGAAGAGGCG
GCGGCGCTGG CCGCGGGCCG CGACCGCTAC GATGCGCTCA TCGACGATTA CGAGCCCTCC
GCCACGGCGG CCTCGATCTC GGCCATCTTC GACCGGATGC GCCCGCGCCT CGTCGCGCTG
CGCGAGGCGG TGCTCGGGGC CGAGGTTCAG CCCCAGCCGC TGACGGGGCA TTTCGGGCTC
GAGGCGCAGG TCCGCATGGC GCGCGATCTG GCGGCCACCT TCGGCTACGA CTGGACCCGC
GGCCGGATGG ACATGGCGGT CCATCCCTTC TCGTCCGGCT CGGGCAGCGA CGTGCGCATC
ACCACCCGCG TGGTCGAGGC CGACCCGTTC AACTGCTTCT ATTCGACGAT CCATGAGGTC
GGCCACGCGG CCTACGAGCT CGGGATCGAC CCGGACTATG CGCTGACCCC CATCGGCGCA
GGCGTTTCGA TGGGTGTCCA CGAGAGCCAG AGCCGGATCT ACGAGAACCA GCTCGGCCGG
TCGCGCGCCT TCACCGGCTG GCTCTACGGC CGCATGAGCG AGCGGTTCGG CGATTTCGGC
ATTGCCGATG CCGAGGCCTT CTATGCCACC GCGAACCGCG TCCAGTCGGG CTACATCCGC
ACCGAGGCCG ACGAGGTGCA TTACAACCTG CACATCATGA TGCGCTTCGA TCTCGAGCGC
GGGCTGATCC GCGGCACGCT CGAACCCGAG GATCTGGAAG AAGCCTGGAA CGCCCGCTTC
CTCGAGGATT TCGGCGTGGC GGTGGACCGG CCCTCGCACG GGATGCTGCA GGATGTGCAT
TGGTCGGTGG GGCTCTTCGG CTATTTCCCG ACCTATGCTC TGGGGAACGT CTATGCGGGC
TGCCTGATGA AGGCGATCCG CGCGGCGGTG CCCGATCTCG ACGACCAGCT CGCCCGCGGC
GACACGTCGG GCGCGACCGG CTGGCTGCGC GAGAACCTGC AGCGGCACGG CGGGCTCTAC
CGGCCGCACG AGACCGTCAC CCGCGCCTGC GGCTTCGAGC CGACCGAGGA GCCGCTCCTC
GATTATCTCG AAGAGAAGTT CCGCGGCATC TACCGGCTCT GA
 
Protein sequence
MSADVAFTEL MDYQRQTEAL AQVMGRLSWD QETVMPRGAA EQRAEEMAAL EGVLHARRTD 
PRLAEWLELA EPEDEEDEAQ LRLIRRSHER ATKVPGRLAQ EIARVTSAAQ GIWAEARAAD
DVSMFLPTLT EVIRLKREEA AALAAGRDRY DALIDDYEPS ATAASISAIF DRMRPRLVAL
REAVLGAEVQ PQPLTGHFGL EAQVRMARDL AATFGYDWTR GRMDMAVHPF SSGSGSDVRI
TTRVVEADPF NCFYSTIHEV GHAAYELGID PDYALTPIGA GVSMGVHESQ SRIYENQLGR
SRAFTGWLYG RMSERFGDFG IADAEAFYAT ANRVQSGYIR TEADEVHYNL HIMMRFDLER
GLIRGTLEPE DLEEAWNARF LEDFGVAVDR PSHGMLQDVH WSVGLFGYFP TYALGNVYAG
CLMKAIRAAV PDLDDQLARG DTSGATGWLR ENLQRHGGLY RPHETVTRAC GFEPTEEPLL
DYLEEKFRGI YRL