Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0998 |
Symbol | |
ID | 5704680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1121802 |
End bp | 1123160 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641270513 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_001535900 |
Protein GI | 159036647 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.648434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.167136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGTT GGCTCACCGA GTACGCGTGG CTTCCCGAGC ACCCTGAACC GACCCCGGAC GTGCTGATCG AGACCACCGG CGGCCGGATC ACCGGAATCA CCCCGCTGGC GGCCGAAAGC CGGCCGACCG CCGGGGTCGA GGTCCTCGCC GACGCGGTCC GCCTGCCCGG ACTCACCCTG CCGGGGCTGG CCAACACGCA CTCGCACGCC TTCCACCGCG CGTTGCGGGG CCGCACCCAC GGCGGTCGCG GGGACTTCTG GACCTGGCGC GACCGGATGT ACGAGGTGGC CACCCGGCTG GACCCGGACA GCTACCTCGC CCTCGCCCGC GCCGCCTACG CGGAGATGGC GCTGGCCGGA ATCACCTGCG TCGGCGAGTT CCACTACCTG CACCACGGCC CGGACGGCAC CCCGTACGCC GACCCGAACG CGATGGGATC CGCCCTGGTC GAGGCGGCGG CGCAGGCCGG GATCCGGCTG ACCCTGCTGG ACGCCTGCTA CCTGACCGCC ACCGTGGCCG GCGATCCGCT GGTCGGACCA CAACGGCGCT TCGGGGACGG TGACGCCCAC CGCTGGGCGG AGCGGGCGGC GGCGTTCGCC CCCACCGGCG CGCACCTCCG GGTCGGCGCC GCGATCCACT CGGTGCGCGC CGTGCCCGCC GACCAACTGG CGACGGTGGC CGCCTCCGCG AACGACCGGG ACATGCCGCT CCACGCGCAC CTCTCCGAGC AGCCGGCCGA GAACGACGCC TGCCGAGCCG AGCACGGCTG CACCCCCACC CGGCTGCTGG CCGACCGGGG AGCGCTCGGC CCACACACCA CCGTCGTCCA CGCCACGCAC CCCACCAGCT CGGACATCAC CGTGCTCGGG GACAGCCGTA CCCGGGTCTG CCTCTGCCCC ACCACCGAGC GGGACCTCGC CGACGGGATC GGACCGGCCC GGCGAATGGC CAACGCCGGC AGCGCACTGA GTCTCGGCAG CGACAGCCAC GCGGTGGTCG ACCTCTTCGA GGAGGCGCGC GCGGTGGAGC TGGACGAACG CCTGCGCACC CGGCAACGCG GCCACTTCAC CGCCGGCGAG TTGGTCACCG CGGCCACCGT CGCCGGACAC GTCGCCCTCG GATGGGGCGA CGCCGGCCGG CTGGCCGTCG GCGACCGGGC CGACCTGGTC ACCGTCCGGC TGGACAGCCC CCGGACCGCG GGCGTACCAG CGGCCGGAGC GTTCTTCGCC GCCACCGCGG CGGACGTCAG CCAGGTGGTG GTGGACGGCC AGGTGGTGGT GCGAGACGGG CGGCACCAGA TGGTGGACGT GCCCGCCGAA CTGGCCACGT CGATCGCGGA GGTGACCGGG ACACCATGA
|
Protein sequence | MTRWLTEYAW LPEHPEPTPD VLIETTGGRI TGITPLAAES RPTAGVEVLA DAVRLPGLTL PGLANTHSHA FHRALRGRTH GGRGDFWTWR DRMYEVATRL DPDSYLALAR AAYAEMALAG ITCVGEFHYL HHGPDGTPYA DPNAMGSALV EAAAQAGIRL TLLDACYLTA TVAGDPLVGP QRRFGDGDAH RWAERAAAFA PTGAHLRVGA AIHSVRAVPA DQLATVAASA NDRDMPLHAH LSEQPAENDA CRAEHGCTPT RLLADRGALG PHTTVVHATH PTSSDITVLG DSRTRVCLCP TTERDLADGI GPARRMANAG SALSLGSDSH AVVDLFEEAR AVELDERLRT RQRGHFTAGE LVTAATVAGH VALGWGDAGR LAVGDRADLV TVRLDSPRTA GVPAAGAFFA ATAADVSQVV VDGQVVVRDG RHQMVDVPAE LATSIAEVTG TP
|
| |