Gene Rsph17029_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1578 
Symbol 
ID4895458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1658633 
End bp1659865 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content71% 
IMG OID640112169 
Productimidazolonepropionase 
Protein accessionYP_001043460 
Protein GI126462346 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0729988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0660872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCACAC AAATTAAGAT CAAGCGCGCT CAGAAGGCAA GATGCATGAT GATTCTCGGC 
AATCTGCGGG TGGCAACGCT GAGCGACGGC TACGGGCTGA TCCCGGACGC CGCTATCCTG
ATCGAGGGCG GCCGCATCCA ATGGGTCGGT CCCGAGGCCC ACCTGCCGCC CTCCGCCGCG
CCCCGGCACG ACATGGGCGG GCGGCTCTGC ACGCCCGCGC TGATCGACTG CCATACCCAT
GCGGTCTTTG CGGGCACCCG CGCGGCCGAG TTCGAGATGC GGCTGAAGGG CGCCTCCTAT
GCCGAGGTGG CGGCCGCGGG CGGCGGCATC GTCTCGACGG TCACGGCCAC CCGCGCCGCC
AGTGCCGACG AACTTCTGGC CGCGAGCCTG CCCCGGATCG ATGCGATGCT GGCGGGCGGC
GTCGGCACGG TCGAGATCAA GTCGGGCTAC GGGCTCGACA TCGAGACCGA GCTTCGGATG
CTGCGCGTCG CGCGCCGGAT CGGTGAGTTG CGCAAGGTCC GGGTCCGCAC GAGCTTTCTC
GGCGCCCATG CCGTGCCGCC GGACTATCGC GGCCGTCCCG ACGCCTATCT GGCCGAAGTC
GTGCTGCCCG CGCTGAAGGT GGCGCAGGAC GAGGGCCTCG TCGATGCGGT GGACGGGTTC
TGCGAGGGCA TCGCCTTCTC GCCCGCCCAG ATCGCCCATC TCTTCGCACA GGCGCACAAG
CTGCGGCTGC CGGTGAAACT TCATGCCGAG CAGCTTTCGA ACCTCGGCGG CGCGGCGCTG
GCCGCACGCC ACGATGCGCT CTCGGCCGAT CATCTCGAAT ATCTCGATGC CGAGGGCGTG
GCGGCGCTCG CCGCGGCAGG CACCGTGGCC GTGCTGCTGC CCGGCGCGTT CTATGCGCTG
CGCGAGACGC AGGCGCCCCC GGTGGCGGCG CTCCGCGCGG CGGGCGTGCC GATGGCAGTG
GCGACCGACC TGAACCCCGG CACCTCTCCG CTGGGCGCGC TGGGGCTCGC CATGAACATG
GCCTGCACCC TCTTCCGCCT CACGCCCGAG GAGGCTCTGG CCGGCACCAC GATCCATGCC
GCTCGGGCGC TCGGGCTCTC CGACACCGGC CGCATCGCGC CGGGCTTTCG CGCCGATCTC
GCCATCTGGG AGGCCGAGCA TCCGGCAGAA CTCAGCTGGC GCATCGGCCC CGCGCCCCTC
CATGCCCGCC TCCACGAGGG AGAGTTCGTC TGA
 
Protein sequence
MCTQIKIKRA QKARCMMILG NLRVATLSDG YGLIPDAAIL IEGGRIQWVG PEAHLPPSAA 
PRHDMGGRLC TPALIDCHTH AVFAGTRAAE FEMRLKGASY AEVAAAGGGI VSTVTATRAA
SADELLAASL PRIDAMLAGG VGTVEIKSGY GLDIETELRM LRVARRIGEL RKVRVRTSFL
GAHAVPPDYR GRPDAYLAEV VLPALKVAQD EGLVDAVDGF CEGIAFSPAQ IAHLFAQAHK
LRLPVKLHAE QLSNLGGAAL AARHDALSAD HLEYLDAEGV AALAAAGTVA VLLPGAFYAL
RETQAPPVAA LRAAGVPMAV ATDLNPGTSP LGALGLAMNM ACTLFRLTPE EALAGTTIHA
ARALGLSDTG RIAPGFRADL AIWEAEHPAE LSWRIGPAPL HARLHEGEFV