Gene Rru_A2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2167 
Symbol 
ID3835594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2516259 
End bp2517425 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content67% 
IMG OID637826269 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_427254 
Protein GI83593502 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCGGG GAACCGCACC GGCTGCGGGC GCCCACCCGC CGCGACGAAC ACCAGTGAAG 
AACGAGGTTG CCATTTCCAC TTCCCATCTC GACGCCGGCG CCACGGGCAC CGGCCTGACC
TATAAAGATG CCGGTGTTGA CATAGACAGC GGCAATGCTC TCGTGCAAGC GATCAAACCG
CTTGCCGCAT CGACAAAGCG CCCGGGGGCC GACGCGTCCC TCGGCGGCTT CGGCGCGATC
TTCGATCTTG CCGCCGCCGG CTATTCCGAT CCCCTTCTTA TCACGGCGAC GGACGGTGTT
GGCACAAAGC TGAAGATCGC CCTCGACTCG GGCATCCATG ATAGCGTCGG CATCGACCTC
GTGGCCATGT GCGTCAATGA TCTGGTCGTC CAGGGCGGCG AGCCGCTGCT GTTTCTCGAC
TACTTCGCCA CCTCGCGCCT GCAGGTGCCG GTGGCCAGCG CCGTGGTCAA GGGCATCGCC
GAGGGCTGCC TTCAGGCCGG TTGCGCCCTG GTCGGCGGCG AGACCGCCGA AATGCCCGGC
ATGTATGGCA ATAACGACTA TGATCTGGCC GGCTTCGCCG TTGGCGCCGT CGAGCGCTCG
CAGCTTCTGA CCGATGACCG CATCGGCCTG GGCGACGTTC TGCTCGGCCT CGCCAGCTCG
GGCGTCCATT CCAACGGCTT CTCGCTGGTC CGGCGCATCG TCGAGCGCAG CGGCTTGGCC
TGGGACGCCC CGGCGCCCTT CGCCCCCGAA ACCACCCTGG CCCGCGCCCT GCTGACGCCC
ACGCGCATCT ATGTGAAATC CTGTCTGGCC CTGCACCGCG CTGGGCTGGT TCATGGCTTC
GCCCATATCA CCGGCGGCGG CTTCTGGGAG AATATCCCGC GTGTTCTGCC CCAGGGGGCT
TGCGCCCACC TTGACGGCCT GTCCTGGCCC TTCCCGCCGG TCTTCCGCTG GCTGATGGAT
CAGGGCGGCG TCAGCGCCCA TGAAATGGCC CGCACCTTCA ACTGCGGCAT CGGCATGGTG
GTTGCCGTTC CCGCCGACAA GGCCGAAGCC GCCATCGCTT TGCTTGGCGA ACACGGTGAA
ACCGTTCATC GCCTGGGCAC CATCGCCGCG CGCGGCGAGG GCGAGGCGGT GATCATCGAT
CACCTGGACG AAGCCTTCGC CCGATGA
 
Protein sequence
MVRGTAPAAG AHPPRRTPVK NEVAISTSHL DAGATGTGLT YKDAGVDIDS GNALVQAIKP 
LAASTKRPGA DASLGGFGAI FDLAAAGYSD PLLITATDGV GTKLKIALDS GIHDSVGIDL
VAMCVNDLVV QGGEPLLFLD YFATSRLQVP VASAVVKGIA EGCLQAGCAL VGGETAEMPG
MYGNNDYDLA GFAVGAVERS QLLTDDRIGL GDVLLGLASS GVHSNGFSLV RRIVERSGLA
WDAPAPFAPE TTLARALLTP TRIYVKSCLA LHRAGLVHGF AHITGGGFWE NIPRVLPQGA
CAHLDGLSWP FPPVFRWLMD QGGVSAHEMA RTFNCGIGMV VAVPADKAEA AIALLGEHGE
TVHRLGTIAA RGEGEAVIID HLDEAFAR