Gene Rsph17029_0677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0677 
Symbol 
ID4897876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp682128 
End bp683174 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID640111261 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001042562 
Protein GI126461448 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.595735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC AGCAGAAGGG TCTGACCTAC GCGGACGCAG GGGTGGACAT CGACGCCGGC 
AACGCGCTCG TCGAGCGGAT CAAGCCCGCC GCCAAGCGCA CGGCGCGCCC GGGCACGGTC
TCGGGCCTCG GCGGGTTCGG CGCGCTCTTC GACCTCAAGG CCGCAGGCTA TCACGACCCG
GTGCTGGTCG CTGCCACCGA CGGGGTCGGC ACCAAGCTGC GCATCGCCAT CGACACGGGC
GAAGTGGACA CGATCGGCAT CGACCTCGTG GCCATGTGCG TGAACGATCT CGTCTGCCAG
GGCGCAGAGC CGCTGTTTTT CCTCGACTAT TTCGCGACGG GCAAGCTCGA GGTCGCGCAG
GCTGCGCGGA TCATCGAGGG AATCGCGGAA GGCTGCGCCG CCTCGGGCTG CGCGCTGATC
GGCGGCGAGA CCGCCGAGAT GCCCGGCATG TATCACAAGG GCGACTTCGA TCTCGCGGGC
TTCGCCGTGG GCGCGATGGA GCGCGGTGCC GACCTGCCGC AGGGGGTCGC AGAGGGCGAC
GTGCTGCTGG GCCTCGGGTC GAACGGGGTC CATTCGAACG GCTATTCCTT CGTGCGCAAG
GTGGTCGAGC TCTCGGGGCT CGGCTGGGAT GCGCCCGCGC CCTTCGGCGG CGACAGCCTC
GGGCGGGCGC TTCTCGCGCC GACGCGCCTC TATGTGAAGC AGGCGCTGGC GGCGGTGCGG
GCGGGGGGCG TGCATGCGCT GGCCCATATC ACCGGCGGCG GCCTCACCGA GAACCTGCCG
CGCGTTCTGC CCGAGGGTCT GGGCGCGCGC ATCGACCTTT CCGCCTGGGA GCTGCCGCCG
GTGTTCCGCT GGCTGGCCGA GACTGCTTCG ATGGCCGAGC CCGAGCTCTT GAAGACCTTC
AACTGCGGCA TCGGTATGAT CGTCGTGGTC GCGGCCGATC GCGCCGACGA GATTGCGGCC
CTGCTCGCGG CCGAGGGCGA GACGGTCACG CGGATCGGCG AAGTGATCGC AGGCGAGGGC
GTGAGCTACG ACGGCCGCCT TCTGTGA
 
Protein sequence
MAEQQKGLTY ADAGVDIDAG NALVERIKPA AKRTARPGTV SGLGGFGALF DLKAAGYHDP 
VLVAATDGVG TKLRIAIDTG EVDTIGIDLV AMCVNDLVCQ GAEPLFFLDY FATGKLEVAQ
AARIIEGIAE GCAASGCALI GGETAEMPGM YHKGDFDLAG FAVGAMERGA DLPQGVAEGD
VLLGLGSNGV HSNGYSFVRK VVELSGLGWD APAPFGGDSL GRALLAPTRL YVKQALAAVR
AGGVHALAHI TGGGLTENLP RVLPEGLGAR IDLSAWELPP VFRWLAETAS MAEPELLKTF
NCGIGMIVVV AADRADEIAA LLAAEGETVT RIGEVIAGEG VSYDGRLL