Gene Daro_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3174 
Symbol 
ID3567174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3411835 
End bp3412878 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content62% 
IMG OID637681645 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_286374 
Protein GI71908787 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.0315286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA ACACTTCCCT CTCCTACCGT GATGCCGGCG TCGATATCGA TGCGGGCGAT 
GCCCTTGTCG AACGTATCAA GCCTTTGGCC AAGAAGACCC TGCGCGAAGG CGTGCTCGGC
GGCATCGGCG GCTTCGGCGC GCTGTTCGAA GTGCCGAAGC GCTACAAGGA GCCGGTGCTG
GTCTCCGGTA CCGATGGTGT CGGCACCAAG CTGCGCCTGG CTTTCGACCT GAACCGCCAC
GATACCGTTG GCCAGGATCT GGTCGCCATG AGCGTCAATG ACATTCTGGT GCTCGGCGCC
GAGTCGCTGT TCTTCCTCGA TTACTTTGCC TGCGGCAAGC TGGATGTCGA TACCGCGGCA
GCAGTCGTCG GCGGCATCGC CAAGGGTTGC GAACTGGCCG GCTGCGCGCT GATCGGTGGC
GAAACCGCCG AAATGCCGGG CATGTACCCG GCTGGTGAAT ACGATCTGGC CGGCTTTGCG
GTCGGTGTCG TTGAAAAATC CAAGGCCATC GACGGCAAGG CCTCGATTAC CCCGGGCGAT
GTCGTGCTCG GTCTGGCTTC CTCTGGCGCC CACTCGAACG GTTACTCGCT GGTCCGCAAG
ATCATCGAGC GTTCCAAGCC GGACATGAAT GCCAAGTTCG ACGGTGAGCG CACGCTGGCC
GACGTCGTCA TGGCGCCGAC CCGCATCTAC GTCAAGCAGG TGCTGGCGAC GATGCAGAAG
GTCACGATCA AGGGCATGGC CCACATTACC GGTGGCGGCC TGCTTGAAAA CGTGCCGCGC
GTGTTGCCGG AAAACACCGT GGCCGAGCTG GAAAAGGCTG CCTGGCCGCG TCCGAAGCTG
TTCGACTGGA TGCAGGCCGA AGGCAATGTC GCCGAAAACG AAATGCATCG CGTCTTCAAC
TGCGGTATCG GTCTGGTCAT CGTGGTTGCT GCGGCCGATG CCGATGCCGC CATGGCCGAA
CTGAAGGCGC AGGGCGAAGC GGTTTATCGC ATTGGCAAGA TCCGGGCGCG TTCGGGTGAC
GAAGCGCAGA CCCTGGTGGT CTAA
 
Protein sequence
MTQNTSLSYR DAGVDIDAGD ALVERIKPLA KKTLREGVLG GIGGFGALFE VPKRYKEPVL 
VSGTDGVGTK LRLAFDLNRH DTVGQDLVAM SVNDILVLGA ESLFFLDYFA CGKLDVDTAA
AVVGGIAKGC ELAGCALIGG ETAEMPGMYP AGEYDLAGFA VGVVEKSKAI DGKASITPGD
VVLGLASSGA HSNGYSLVRK IIERSKPDMN AKFDGERTLA DVVMAPTRIY VKQVLATMQK
VTIKGMAHIT GGGLLENVPR VLPENTVAEL EKAAWPRPKL FDWMQAEGNV AENEMHRVFN
CGIGLVIVVA AADADAAMAE LKAQGEAVYR IGKIRARSGD EAQTLVV