Gene RoseRS_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2554 
Symbol 
ID5209523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3166918 
End bp3167943 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID640596158 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001276880 
Protein GI148656675 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.339877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGT ATCGTGAAGC AGGCGTCGAT ATCGATGCGG CTGCGCGCGC CAGACAGTTG 
ATGGCGGAAG CCGTCCGCTC GACCTACTCA GCCGCAGTCC TGGCAGGTAT CGGAGCGTTC
GGCGGATGTT TCGACCTGCA CGCCGCTGTG GGGGACCAGA CCGACGATGT TGTTCTGGTA
GCATCGACGG ACAGCGTCGG CACCAAAACG CTCGTGGCGG CAGCGCTCGG TCGCTATGAA
ACCCTCGGCT ACGATCTGGT CAACCACTGC GTCAACGATA TTCTGGTGCA GGGAGCGCGT
CCGTTGTTCC TGCTCGACTA TCTGGCAGTT GATCGCCTCG ATCCGCAGCG TGCGGCAACG
CTGGTGGCGA GTGTTGCAGC GGCGTGCCGT GAGGTCGGAT GCGTCCTGCT TGGCGGCGAA
ACCGCTCAGA TGCCGGATGT GTACCGTGAG GGAGCATTCG AACTGGCAGG AACGATCGTC
GGTGTTGTCC GACGGGCGCA GATGCTCCCG CGCAACGTTG CGGCGGGCGA TGTGATCCTG
GCGCTGCCGT CGAGCGGATT GCACACAAAC GGGTACTCCC TGGCGCGGCG GGTGCTGGGT
CGCGGCAGCG CCTGGGGCTA CGATGCACGA CCGGCTGAAC TGAACGGCAG AAGCATCGGC
GAAGCGTTGC TCGAACCGCA CCGCGTCTAC CTGCGCGCCT TCGAGCAACT TGAAGCGGCT
GGCGTCGCGG TGCATGCAAT GGCGCACATC ACCGGCGGCG GCATCTACGA GAACCTGCCG
CGTGTGCTTC CAGAGGGATG TGGCGCCGTC ATCCGACGCC GAACCTGGAC GATCCCGCCG
ATCTGCACCC TCGTGGTGCA GGCTGCCGGT CTCGATGAAC GCGAAGCATT CCGCACCCTG
AACATGGGAC TCGGCATGCT GGTGATCGTC CCATCGGACG CCGCCGACGC CGCGCGACGC
GCCGTTCCCG AAGCATCGCC GGTCGGTGAA GTCGTCACCG GCGGCGATGT CCGGTTGATT
GACTGA
 
Protein sequence
MTVYREAGVD IDAAARARQL MAEAVRSTYS AAVLAGIGAF GGCFDLHAAV GDQTDDVVLV 
ASTDSVGTKT LVAAALGRYE TLGYDLVNHC VNDILVQGAR PLFLLDYLAV DRLDPQRAAT
LVASVAAACR EVGCVLLGGE TAQMPDVYRE GAFELAGTIV GVVRRAQMLP RNVAAGDVIL
ALPSSGLHTN GYSLARRVLG RGSAWGYDAR PAELNGRSIG EALLEPHRVY LRAFEQLEAA
GVAVHAMAHI TGGGIYENLP RVLPEGCGAV IRRRTWTIPP ICTLVVQAAG LDEREAFRTL
NMGLGMLVIV PSDAADAARR AVPEASPVGE VVTGGDVRLI D