Gene Rsph17025_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1072 
Symbol 
ID5083364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1100499 
End bp1101677 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID640482630 
Productaminodeoxychorismate lyase 
Protein accessionYP_001167278 
Protein GI146277119 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.232656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCGCT CTGTCGCCTC CAACGCGCTC ACCCTGTTCA TCGTGGTGCT GGTGGCGGCG 
GCAGGGTTGC TGGCCTGGGG GCGGCAGGAA TACATCGGCC CCGGGCCGCT GACCGAGGCC
GTCTGCCTGC GGGTCGAGCG GGGAGACTCG CTGTCGGTCG TGAGCCGGCG GCTCGAGTCG
CAGGGCGCGG TGAGCGATGC GCGCATCTTC CGCATCGGGG CGGACTATTC CGACAAGGCC
GCGGGGCTCA AGTTCGGCAG CTATCTGCTG CCGCCGGGCG CCTCGATGGC CCAGATCCTC
GACATTCTGA CGGCGGGCGG GCAATCGACC TGCGGGCGCG AGGTGAATTA CCGGATCGGC
GTGGTGGCGG CCGAGATCAT CCTGCGGGAG TTCGACGCGG GCGAGGGCCG CTATGTCGAG
GTGGCGAAGT TCGTGCCGGG GGAGGGTGAG GCGCCCGAAG CCTACCGGGA GGCGGCCGGG
GAAGGCGACC TGCGCTGGCG TGTGACGCTG GCGGAGGGCG TGACGAGCTG GCAGGTGGTC
GAGAGCCTGC GCAAGGCCGA GTTCCTCGAG GGCGAGATCA GGGAGGTTCC GCCGGAGGGT
TCGCTGGCCC CGGACAGTTA CGAGGTGGCG CGAGGGGATG ACCGGGCGGC GCTTCTGGCG
CAGATGCAGG AGCGGCAGGC GCGCATCATG GCCGAGCTGT GGGCCGCGCG GTCCCCGAGC
GTGCCCTATG GATCGCCGGA AGAGGCGATG ATCATGGCGA GCATCGTCGA GAAGGAGACC
GGCATTTCCT CGGAGCGGCC GCAGGTGGCG AGCGTCTTCG TCAACAGGCT GGCGCAGGGG
ATGCGGCTGC AGACGGACCC GACGGTGATC TATGGCATCA CCGAGGGCAA GGGCGTCCTC
GGCCGCGGTC TGAGGCAGAG CGAGCTGCGC CGCCGCACCG ACTACAATAC CTATGTGATC
GACGGGCTGC CGCCCACCCC CATTGCCAAT CCGGGCCGGC TGTCGATCGA GGCGGCGCTG
AACCCGGCCG AGACGGATTA CCTTTATTTC GTGGCCGATG GCAGCGGGGG CCATGCCTTC
GCGCGGACCC TGGCCGAGCA CAACCGCAAT GTGGCTGCCT GGCGGCGGAT CGAGGCCGAG
CGCGGGATGC CGCCCCCGGT GGGGATCCAG GGCGAGTGA
 
Protein sequence
MWRSVASNAL TLFIVVLVAA AGLLAWGRQE YIGPGPLTEA VCLRVERGDS LSVVSRRLES 
QGAVSDARIF RIGADYSDKA AGLKFGSYLL PPGASMAQIL DILTAGGQST CGREVNYRIG
VVAAEIILRE FDAGEGRYVE VAKFVPGEGE APEAYREAAG EGDLRWRVTL AEGVTSWQVV
ESLRKAEFLE GEIREVPPEG SLAPDSYEVA RGDDRAALLA QMQERQARIM AELWAARSPS
VPYGSPEEAM IMASIVEKET GISSERPQVA SVFVNRLAQG MRLQTDPTVI YGITEGKGVL
GRGLRQSELR RRTDYNTYVI DGLPPTPIAN PGRLSIEAAL NPAETDYLYF VADGSGGHAF
ARTLAEHNRN VAAWRRIEAE RGMPPPVGIQ GE