Gene TM1040_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3004 
Symbol 
ID4078034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3172003 
End bp3173220 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content63% 
IMG OID638008333 
Productimidazolonepropionase 
Protein accessionYP_614998 
Protein GI99082844 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.783832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.525072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATA CATACGTGTT GAGCGGTGCT CGGTTGGCGA CCATGGAGGC CGAGGGGTCT 
TATGGGCTTG TGGAAGACGG GGCGATCGCC ATTCAGGGGG ATGAAATTCT CTGGTGCGGC
GCGCGGGGCG CGCTACCGGA CACCTACGCA GCTTGCCCCA GCACCGACCT CAATGGACGT
CTGGTCACGC CGGCCTTCAT TGATTGTCAC ACGCACATCG TTTTTGGCGG AGACCGTGCG
GGCGAGTTCG AGATGCGCCT CGAGGGGGCA ACCTATGAAG AGGTGGCCAA GGCTGGCGGC
GGGATTGTGT CGACCGTCAC GGCCACGCGT GCCGCAAGCC TGGATGCGCT TGTGACCGGG
GCCTTGCCGC GGCTGGATGC ACTGATTGCG GAAGGGGTGA GCACGGTAGA AGTCAAATCC
GGCTACGGGC TCGACCGCGA GACCGAGCTC AATATGCTGC GCGCGGCGCG TCGTCTGGCT
GAGCACAGAG ATGTTACGGT CAAAACCACC TTTCTGGGCG CGCATGCAGT CCCTGCAGAG
TATGCGGGTC GGGCAGATGC CTATCTTGAT GAGGTCTGCC TGCCGACGCT GCGCGCAGCG
CATGCCGAGG GGTTGGTCGA TGCGGTAGAT GGCTTTTGTG AAGGGATCGC TTTTTCGGCG
GCGCAGATCG CAAAGGTTTT TGACGTGGCA GCAGAGTTGG GTTTGCCGGT GAAGCTCCAT
GCCGAACAGC TGTCGCATCA GGGCGGCACC AAATTGGCGG CAGAGCGCGG CGCACTGTCT
GTGGATCATG TGGAATACGC CACAGAAGCG GACGCACGGG CAATGGCGGC TTCGGGGTCC
GTTGCGGTCC TGTTGCCCGG CGCGTTCTAC ACAATCCGCG AAACGCAAGT CCCGCCCGTG
GCAGAGTTTC GCATGCATGG CGTGCCGATG GCACTGGCGA CAGACTGCAA CCCCGGATCA
TCGCCGCTCA CGTCGCTTCT GCTGACGCTC AATATGGGCT GCACATTGTT TCGCCTGACC
CCGGAAGAGG CGCTTGCGGG TGTGACCCGT AATGCCGCCC GCGCGCTGGG CATGCAGGAT
CGTGGGCGCA TCGCCCCCGG GCTTCGTGCG GATCTGGCGG TCTGGGATGT CTCTCGCCCG
GCAGAGCTGG CTTATCGCAT TGGTTTCAAC CCGCTCTACG CGCGTGTCAT GGGCGGCAAA
ATGGAGGTTC GTACATGA
 
Protein sequence
MRDTYVLSGA RLATMEAEGS YGLVEDGAIA IQGDEILWCG ARGALPDTYA ACPSTDLNGR 
LVTPAFIDCH THIVFGGDRA GEFEMRLEGA TYEEVAKAGG GIVSTVTATR AASLDALVTG
ALPRLDALIA EGVSTVEVKS GYGLDRETEL NMLRAARRLA EHRDVTVKTT FLGAHAVPAE
YAGRADAYLD EVCLPTLRAA HAEGLVDAVD GFCEGIAFSA AQIAKVFDVA AELGLPVKLH
AEQLSHQGGT KLAAERGALS VDHVEYATEA DARAMAASGS VAVLLPGAFY TIRETQVPPV
AEFRMHGVPM ALATDCNPGS SPLTSLLLTL NMGCTLFRLT PEEALAGVTR NAARALGMQD
RGRIAPGLRA DLAVWDVSRP AELAYRIGFN PLYARVMGGK MEVRT