Gene TM1040_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1118 
Symbol 
ID4077239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1199601 
End bp1201766 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content61% 
IMG OID638006422 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_613113 
Protein GI99080959 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAC CAGCCATTAC GCCTGAAGTG ATCGAAAACC ACGGACTGAA GCCCGATGAA 
TACGATCTGA TCCTCGAGAT CATCGGCCGC GAGCCGACCT TCACCGAGCT CGGCATTTTC
TCCGCCATGT GGAACGAGCA CTGTTCCTAT AAATCATCGA AAAAATGGCT GCGCACCCTG
CCGACCTCCG GCCCGCAGGT GATCTGCGGC CCCGGCGAGA ACGCCGGCAT CGTGGACATC
GGCGATGGCG ATGCGGTCGT CTTCAAGATG GAAAGCCACA ACCACCCCTC CTACATCGAG
CCCTATCAAG GTGCCGCCAC TGGTGTTGGC GGCATTTTGC GTGATGTCTT CACCATGGGC
GCGCGGCCTA TCGCGTCGAT GAATTCGCTG TCCTTTGGTG AGCCTGCGCA TCACAAGACC
CGCCAATTGG TAAACGGCGT GGTTGAGGGC GTCGGCGGCT ACGGCAACTG TTTTGGTGTG
CCTTGTGTCG GCGGCGAAGT GCGCTTTCAC CCCGCCTACA ATGGCAACTG CCTGGTGAAC
GCCTTTGCGG CAGGCCTCGT AAAAACCGAC ATGATCTTCT ACTCCGCCGC CTCCGGTGTG
GGCATGCCCG TTGTGTACCT CGGCGCCAAG ACCGGCCGCG ACGGGGTTGG TGGTGCAACC
ATGGCGTCGG CAGAATTCGA CGACACCATC GAAGAAAAGC GCCCCACCGT GCAGGTTGGT
GACCCGTTCA CCGAAAAACG CCTGATGGAA GCCACGCTGG AGCTGATGCA GACCGGTGCC
GTGATCTCCA TTCAGGACAT GGGCGCGGCA GGCCTCACCT GCTCCGCTGT GGAAATGGGT
GACAAGGGCG GCCTTGGCGT GCGTCTGGAT CTGGAAAAGG TTCCGCAGCG CGAAGAAAAC
ATGACCGCTT ACGAGATGAT GCTCTCGGAA TCGCAAGAGC GCATGCTGAT GGTGCTGAAG
CCCGAGCTGG AGGCCGAGGC CAAAGCCGTC TTTGAGAAAT GGGACCTCGA TTTTGCCATA
GTGGGTGAGA CCATTGCCGA AGATCGCTTC CTCATCATGC ACAACGGCGA GGTCAAAGCG
GATCTGCCGC TGTCAAAGCT TTCGTCCTCG GCGCCGGAAT ACGACCGCCC GTGGATCGAG
GTCGAGGCCC CCGCGGCGCT TACGGATGCG GATGTGCCGA CCATTGACCC GATCGACGGC
CTGAAGGCGC TGATCTCCAG CCCCAACTAT GCCGGCAAAC AGTGGGTCTA TGAGCAATAT
GACACCACCG TGATGGGCGA CACCGCGCGT CGTCCGGGTC TGGGTGGCGG CATGGTCCGC
GTGCATGGCA CCGACAAGAA ACTGGCCTTT ACCTCTGACG TGACCCCGCG TTACGTCAAG
GCGAACCCGG TTGAGGGCGG CAAACAGGCC GTGGCCGAAG CCTATCGCAA CCTCTGCGCT
GTCGGGGCCA AGCCATTGGC GACCACCGAC AACCTCAATT TCGGCAACCC CGAAAAGCCC
GAGATCATGG GCCAATTTGT CGGCGCACTC AAAGGCATCG GCGAGGCGGT CTCGGCGCTT
GATATGCCAA TCGTCTCGGG CAACGTCTCG CTTTACAACG AAACAGACGG CCAGGCGATC
CTGCCGACAC CGACCATTGG CGCGGTAGGT CTGGTTGCGG CGGGCGAAGA GCCGATCCTG
GGCGAGGCCC GCGACGGTCA TGTGCTGCTG CTGGTGGGCG AAACCATTGG TCATCTTGGC
CAGTCGGCGC TTCTGCACGA GGTCTTCAAC CGCGAGGACG GCGACGCCCC CGCAGTGGAT
CTTGAGATCG AAAAGCGCAA CGGCGAGTTC ATCCGCAACA ACCGCGATTT CATCAAGGCC
TGCACCGACA TCAGCGACGG TGGGCTTGCG CTCGCGGCGT TTGAGCTTGC AGAGGCCGCA
GGCGTAGGGG TGCAGATCGA CGCCAGCGAC ACCCCGACGC TCTTTGGTGA GGATCAGGCA
CGCTATCTGG TGGCCTGCAA CTTCGACCAG GCCGAGGCGC TGATGATCGC TGCCGGTCAG
GCCGGGGTGC CGCTTGAGAC CGTTGGCAAG TTTACCGGCG ACACGGTGAA GATGGGCGGA
TCGGAGGCAA CGCTCGAAGA GCTGAGCCAG ATCTTCCGCA CGAGCTTTGC CGAAGCAGTC
GCCTAA
 
Protein sequence
MQEPAITPEV IENHGLKPDE YDLILEIIGR EPTFTELGIF SAMWNEHCSY KSSKKWLRTL 
PTSGPQVICG PGENAGIVDI GDGDAVVFKM ESHNHPSYIE PYQGAATGVG GILRDVFTMG
ARPIASMNSL SFGEPAHHKT RQLVNGVVEG VGGYGNCFGV PCVGGEVRFH PAYNGNCLVN
AFAAGLVKTD MIFYSAASGV GMPVVYLGAK TGRDGVGGAT MASAEFDDTI EEKRPTVQVG
DPFTEKRLME ATLELMQTGA VISIQDMGAA GLTCSAVEMG DKGGLGVRLD LEKVPQREEN
MTAYEMMLSE SQERMLMVLK PELEAEAKAV FEKWDLDFAI VGETIAEDRF LIMHNGEVKA
DLPLSKLSSS APEYDRPWIE VEAPAALTDA DVPTIDPIDG LKALISSPNY AGKQWVYEQY
DTTVMGDTAR RPGLGGGMVR VHGTDKKLAF TSDVTPRYVK ANPVEGGKQA VAEAYRNLCA
VGAKPLATTD NLNFGNPEKP EIMGQFVGAL KGIGEAVSAL DMPIVSGNVS LYNETDGQAI
LPTPTIGAVG LVAAGEEPIL GEARDGHVLL LVGETIGHLG QSALLHEVFN REDGDAPAVD
LEIEKRNGEF IRNNRDFIKA CTDISDGGLA LAAFELAEAA GVGVQIDASD TPTLFGEDQA
RYLVACNFDQ AEALMIAAGQ AGVPLETVGK FTGDTVKMGG SEATLEELSQ IFRTSFAEAV
A