Gene TM1040_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3003 
Symbol 
ID4078033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3170563 
End bp3171927 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content62% 
IMG OID638008332 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_614997 
Protein GI99082843 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.54016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.775669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAA TTTTTGCCGC CACCGCGTGT TTGCCTGAGG GCTGGGCGAA GGATGTCCGC 
CTCACCATCA AGGACGGCCA AATCTCTCGG ATCGAGACCG GCTCAGCGCC CGTGCCCGGC
GACACGCGCG TCGATGTGCT CTTGCCAGCG CTTGCCAACC TGCATTCACA TTCGTTCCAG
CGCGCGATGG CGGGCCGCAC CGAATTTCGG GGGGCCGGTC AAGACAGTTT CTGGACCTGG
CGAGAGCTGA TGTATCGCTT TCTCGACCAT CTCACCCCCG ATCAATACGA GGCCATTGCG
GCACTTACAT TCATGGAAAT GATGGAGGCC GGGTATGCCT CTGTGGGTGA GTTCCACTAT
GTTCACCATC AGCCGGGCGG TACCGCCTAC CAGAGCCTCA GCGAACTCAG TCAACGCGTG
ATGGCCGGCG CCCAGCAAAC CGGCATTGGC CTCACCCATC TGCCGGTGCT CTACACCTAC
GGAGGCGCAC AACAGCAACC ACTTACCGGT GGCCAGATGC GCTTTGGCAA TGATGTGGAG
CGCTTCTCGC GTCTTGTCAC CGAAGCCCGC GATGCGGCGC AAGAGCTTGG GCGCGACACG
CGTGTCGGTA TCGCGCCGCA TTCCCTGCGC GCCACCAGCC CCGAGGACCT CGCCGCCGTA
TTGCCGCTCG CCGCGGACAG CCCCATCCAC ATCCATATCG CCGAACAGCC CCGCGAAGTG
GCCGAGATCA AGGTCTGGCT GGGAGCGCGC CCCGTAGAGT GGCTGCTCGG GAACGCCCCC
GTAGACAATC AATGGTGCCT GATCCACGCC ACCCACATGA CCGAGACCGA AACCCGGCAC
ATGGCCCATT CCGGCGCGGT TGCCGGGCTT TGCCCCATCA CCGAGGCAAA CCTCGGAGAT
GGCCCGTTTA ACGGCGCGCA CTATCTGCGC GAGGGAGGAC GCTTTGGTGT GGGATCGGAC
TCAAATGTAC GGATCTCCCT CGTTGAAGAG CTGCGCACGC TGGAATACTC CCAACGCCTT
CGGGATCTCG CCCGCAACGT TCTGGTCCCG GCAGAGGGAT CTGTCGGTGA AACCCTCTAC
CTTGGCGCGG CAAGAGGGGG TGCGCAGGCT TTGGGCCGCG ATGCCGGTCG GCTCGAGATT
GGCGCCCTTG CTGATCTGGT GGCGATTGAT TGCGCACGTC CTGCTCTTTT TGGGCTTCCA
GAGCATCAAA TCCTGGATGG GCTGTGTTTT GCAGCGGATG ATCATAGCGT CACCGACGTC
TGGGCCGCAG GGCGTCATAT GGTACAAACA GGTCGTCACA TCGCGCGAGA CAGCATTCTT
GCCAGCTATC GCAAGGCGAT CACGTCTCTT TTGGCGGAAC TCTAA
 
Protein sequence
MQTIFAATAC LPEGWAKDVR LTIKDGQISR IETGSAPVPG DTRVDVLLPA LANLHSHSFQ 
RAMAGRTEFR GAGQDSFWTW RELMYRFLDH LTPDQYEAIA ALTFMEMMEA GYASVGEFHY
VHHQPGGTAY QSLSELSQRV MAGAQQTGIG LTHLPVLYTY GGAQQQPLTG GQMRFGNDVE
RFSRLVTEAR DAAQELGRDT RVGIAPHSLR ATSPEDLAAV LPLAADSPIH IHIAEQPREV
AEIKVWLGAR PVEWLLGNAP VDNQWCLIHA THMTETETRH MAHSGAVAGL CPITEANLGD
GPFNGAHYLR EGGRFGVGSD SNVRISLVEE LRTLEYSQRL RDLARNVLVP AEGSVGETLY
LGAARGGAQA LGRDAGRLEI GALADLVAID CARPALFGLP EHQILDGLCF AADDHSVTDV
WAAGRHMVQT GRHIARDSIL ASYRKAITSL LAEL