Gene Rru_A3655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3655 
SymbolpurH 
ID3837111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4194420 
End bp4196000 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content68% 
IMG OID637827779 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_428736 
Protein GI83594984 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCCATT CCCTGCCCAT CCGCCGCGCC CTGATCAGCG TTTCCGACAA GGGCGGGCTT 
GTGCCCTTCG CCCGTTTCCT CGCCGATCAC GACATCGAGA TCTTGTCGAC CGGAGGCAGC
GCCAAGGCGC TGGCCGATGC CGGCATTCCG GTGACCGAGG TCGCCGATTT CACCGGTTTC
CCGGAAATGC TCGATGGCCG GGTCAAGACC CTGCATCCGA AGATCCACGG CGGCATCCTG
GGCATCCGCG ACAATCCCGA GCACCAGCGG GCGATGGCCG CCCATGAGAT CTTGCCGATC
GATCTGGTGG TGGTGAACCT CTATCCCTTC GAAGCCACGG TGGCCAAGGG CGCCGCCTTC
GAGGACTGCG TCGAGAACAT CGACATCGGT GGGCCGGCCC TGATCCGCGC CGCCGCCAAG
AACCACGAGG CGGTCACCGT CGTCGTCGAT CCCGAGGATT ACCAGCCGGT GATGGACGCC
ATGACCGCCG AGGGCGGCGC CACCACGCTG GAGCTGCGGC GCAAGCTGGC TTCGGCCGCC
TTCGCCCGCT GCGGCGCCTA TGACGGCGCC ATCAGCCGCT GGTTCCAGGG GCAGGTCGGC
GACGAGACTC CGCGTCATAT CGTTTTCGCC GGCCGCCTGC GCCAGACCCT GCGCTATGGC
GAGAACCCCC ATCAGAAGGC GGCGTTCTAT GGTCACGGCA TCGCCCGCCC GGGGGTGGCC
AGCGCCGAGC AGCTTCAGGG CAAGGAGCTG AGCTACAACA ACATCAATGA TACCGACGCC
GCCTTTGATC TGGTCTGCGA ATTCGCCGAG CCGGCGGTGG CGATCATCAA GCACGCCAAC
CCCTGCGGCG TCGCCCAGGG CGCCAGCGTC GTCGAAGCCT ATAAGGCCGC CCTCGCCTGC
GATCCGGTCA GCGCCTTTGG CGGCATCGTC GCCCTCAACC GGCCGATCGA TCGCGACTCG
GCGGTGGAAA TCACCAAGAT CTTCACCGAG GTGGTCATCG CCCCCGATGC CGACGCCGAG
GCGCGGGCGA TTTTCGCGGC CAAGAAAAAC CTGCGCCTGC TGCTGACCGG CGTGGTCGCC
GATACCACGG CGCCCGGGCT GACCGTGCGC TCGGTCGCCG GCGGCATGCT GGTCCAGGAC
CGCGACGCCG CCGATCTGCT GTCGGCCGAT CTCAAGGTGG TCAGCAAGCG CACGCCGACC
GAACGCGAAC TGGCCGACAT GCTGATCGCC TTCAAGGTCT GCAAGCACGT CAAATCCAAC
GCCATCGTCT ATGTCAAGGA TGGCGCCACG GTGGGCATCG GCGCCGGCCA GATGAGCCGG
GTCGACAGCG CCCGCATCGC CTCGTGGAAG GCCGATGAGG CCGCCGAGGC GGCCGGGCTC
GCCCAATCGC CGACCCAGGG GTCGGTCGTC GCCTCCGACG CCTTCTTCCC CTTCGCCGAT
GGCCTGCTGG CCGCGGCCAA GGCCGGGGCA ACGGCGGTGA TCCAGCCCGG CGGCAGCATG
CGCGACGACG AGGTCATCAA AGCCGCCGAC GAGGCCGGCT TGGCGATGGT CTTCACCGGT
TTGCGCCACT TCCGCCATTA G
 
Protein sequence
MLHSLPIRRA LISVSDKGGL VPFARFLADH DIEILSTGGS AKALADAGIP VTEVADFTGF 
PEMLDGRVKT LHPKIHGGIL GIRDNPEHQR AMAAHEILPI DLVVVNLYPF EATVAKGAAF
EDCVENIDIG GPALIRAAAK NHEAVTVVVD PEDYQPVMDA MTAEGGATTL ELRRKLASAA
FARCGAYDGA ISRWFQGQVG DETPRHIVFA GRLRQTLRYG ENPHQKAAFY GHGIARPGVA
SAEQLQGKEL SYNNINDTDA AFDLVCEFAE PAVAIIKHAN PCGVAQGASV VEAYKAALAC
DPVSAFGGIV ALNRPIDRDS AVEITKIFTE VVIAPDADAE ARAIFAAKKN LRLLLTGVVA
DTTAPGLTVR SVAGGMLVQD RDAADLLSAD LKVVSKRTPT ERELADMLIA FKVCKHVKSN
AIVYVKDGAT VGIGAGQMSR VDSARIASWK ADEAAEAAGL AQSPTQGSVV ASDAFFPFAD
GLLAAAKAGA TAVIQPGGSM RDDEVIKAAD EAGLAMVFTG LRHFRH