Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3003 |
Symbol | |
ID | 4078033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 3170563 |
End bp | 3171927 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638008332 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_614997 |
Protein GI | 99082843 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.54016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.775669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAA TTTTTGCCGC CACCGCGTGT TTGCCTGAGG GCTGGGCGAA GGATGTCCGC CTCACCATCA AGGACGGCCA AATCTCTCGG ATCGAGACCG GCTCAGCGCC CGTGCCCGGC GACACGCGCG TCGATGTGCT CTTGCCAGCG CTTGCCAACC TGCATTCACA TTCGTTCCAG CGCGCGATGG CGGGCCGCAC CGAATTTCGG GGGGCCGGTC AAGACAGTTT CTGGACCTGG CGAGAGCTGA TGTATCGCTT TCTCGACCAT CTCACCCCCG ATCAATACGA GGCCATTGCG GCACTTACAT TCATGGAAAT GATGGAGGCC GGGTATGCCT CTGTGGGTGA GTTCCACTAT GTTCACCATC AGCCGGGCGG TACCGCCTAC CAGAGCCTCA GCGAACTCAG TCAACGCGTG ATGGCCGGCG CCCAGCAAAC CGGCATTGGC CTCACCCATC TGCCGGTGCT CTACACCTAC GGAGGCGCAC AACAGCAACC ACTTACCGGT GGCCAGATGC GCTTTGGCAA TGATGTGGAG CGCTTCTCGC GTCTTGTCAC CGAAGCCCGC GATGCGGCGC AAGAGCTTGG GCGCGACACG CGTGTCGGTA TCGCGCCGCA TTCCCTGCGC GCCACCAGCC CCGAGGACCT CGCCGCCGTA TTGCCGCTCG CCGCGGACAG CCCCATCCAC ATCCATATCG CCGAACAGCC CCGCGAAGTG GCCGAGATCA AGGTCTGGCT GGGAGCGCGC CCCGTAGAGT GGCTGCTCGG GAACGCCCCC GTAGACAATC AATGGTGCCT GATCCACGCC ACCCACATGA CCGAGACCGA AACCCGGCAC ATGGCCCATT CCGGCGCGGT TGCCGGGCTT TGCCCCATCA CCGAGGCAAA CCTCGGAGAT GGCCCGTTTA ACGGCGCGCA CTATCTGCGC GAGGGAGGAC GCTTTGGTGT GGGATCGGAC TCAAATGTAC GGATCTCCCT CGTTGAAGAG CTGCGCACGC TGGAATACTC CCAACGCCTT CGGGATCTCG CCCGCAACGT TCTGGTCCCG GCAGAGGGAT CTGTCGGTGA AACCCTCTAC CTTGGCGCGG CAAGAGGGGG TGCGCAGGCT TTGGGCCGCG ATGCCGGTCG GCTCGAGATT GGCGCCCTTG CTGATCTGGT GGCGATTGAT TGCGCACGTC CTGCTCTTTT TGGGCTTCCA GAGCATCAAA TCCTGGATGG GCTGTGTTTT GCAGCGGATG ATCATAGCGT CACCGACGTC TGGGCCGCAG GGCGTCATAT GGTACAAACA GGTCGTCACA TCGCGCGAGA CAGCATTCTT GCCAGCTATC GCAAGGCGAT CACGTCTCTT TTGGCGGAAC TCTAA
|
Protein sequence | MQTIFAATAC LPEGWAKDVR LTIKDGQISR IETGSAPVPG DTRVDVLLPA LANLHSHSFQ RAMAGRTEFR GAGQDSFWTW RELMYRFLDH LTPDQYEAIA ALTFMEMMEA GYASVGEFHY VHHQPGGTAY QSLSELSQRV MAGAQQTGIG LTHLPVLYTY GGAQQQPLTG GQMRFGNDVE RFSRLVTEAR DAAQELGRDT RVGIAPHSLR ATSPEDLAAV LPLAADSPIH IHIAEQPREV AEIKVWLGAR PVEWLLGNAP VDNQWCLIHA THMTETETRH MAHSGAVAGL CPITEANLGD GPFNGAHYLR EGGRFGVGSD SNVRISLVEE LRTLEYSQRL RDLARNVLVP AEGSVGETLY LGAARGGAQA LGRDAGRLEI GALADLVAID CARPALFGLP EHQILDGLCF AADDHSVTDV WAAGRHMVQT GRHIARDSIL ASYRKAITSL LAEL
|
| |