Gene Sare_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0998 
Symbol 
ID5704680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1121802 
End bp1123160 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content75% 
IMG OID641270513 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_001535900 
Protein GI159036647 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.648434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.167136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGTT GGCTCACCGA GTACGCGTGG CTTCCCGAGC ACCCTGAACC GACCCCGGAC 
GTGCTGATCG AGACCACCGG CGGCCGGATC ACCGGAATCA CCCCGCTGGC GGCCGAAAGC
CGGCCGACCG CCGGGGTCGA GGTCCTCGCC GACGCGGTCC GCCTGCCCGG ACTCACCCTG
CCGGGGCTGG CCAACACGCA CTCGCACGCC TTCCACCGCG CGTTGCGGGG CCGCACCCAC
GGCGGTCGCG GGGACTTCTG GACCTGGCGC GACCGGATGT ACGAGGTGGC CACCCGGCTG
GACCCGGACA GCTACCTCGC CCTCGCCCGC GCCGCCTACG CGGAGATGGC GCTGGCCGGA
ATCACCTGCG TCGGCGAGTT CCACTACCTG CACCACGGCC CGGACGGCAC CCCGTACGCC
GACCCGAACG CGATGGGATC CGCCCTGGTC GAGGCGGCGG CGCAGGCCGG GATCCGGCTG
ACCCTGCTGG ACGCCTGCTA CCTGACCGCC ACCGTGGCCG GCGATCCGCT GGTCGGACCA
CAACGGCGCT TCGGGGACGG TGACGCCCAC CGCTGGGCGG AGCGGGCGGC GGCGTTCGCC
CCCACCGGCG CGCACCTCCG GGTCGGCGCC GCGATCCACT CGGTGCGCGC CGTGCCCGCC
GACCAACTGG CGACGGTGGC CGCCTCCGCG AACGACCGGG ACATGCCGCT CCACGCGCAC
CTCTCCGAGC AGCCGGCCGA GAACGACGCC TGCCGAGCCG AGCACGGCTG CACCCCCACC
CGGCTGCTGG CCGACCGGGG AGCGCTCGGC CCACACACCA CCGTCGTCCA CGCCACGCAC
CCCACCAGCT CGGACATCAC CGTGCTCGGG GACAGCCGTA CCCGGGTCTG CCTCTGCCCC
ACCACCGAGC GGGACCTCGC CGACGGGATC GGACCGGCCC GGCGAATGGC CAACGCCGGC
AGCGCACTGA GTCTCGGCAG CGACAGCCAC GCGGTGGTCG ACCTCTTCGA GGAGGCGCGC
GCGGTGGAGC TGGACGAACG CCTGCGCACC CGGCAACGCG GCCACTTCAC CGCCGGCGAG
TTGGTCACCG CGGCCACCGT CGCCGGACAC GTCGCCCTCG GATGGGGCGA CGCCGGCCGG
CTGGCCGTCG GCGACCGGGC CGACCTGGTC ACCGTCCGGC TGGACAGCCC CCGGACCGCG
GGCGTACCAG CGGCCGGAGC GTTCTTCGCC GCCACCGCGG CGGACGTCAG CCAGGTGGTG
GTGGACGGCC AGGTGGTGGT GCGAGACGGG CGGCACCAGA TGGTGGACGT GCCCGCCGAA
CTGGCCACGT CGATCGCGGA GGTGACCGGG ACACCATGA
 
Protein sequence
MTRWLTEYAW LPEHPEPTPD VLIETTGGRI TGITPLAAES RPTAGVEVLA DAVRLPGLTL 
PGLANTHSHA FHRALRGRTH GGRGDFWTWR DRMYEVATRL DPDSYLALAR AAYAEMALAG
ITCVGEFHYL HHGPDGTPYA DPNAMGSALV EAAAQAGIRL TLLDACYLTA TVAGDPLVGP
QRRFGDGDAH RWAERAAAFA PTGAHLRVGA AIHSVRAVPA DQLATVAASA NDRDMPLHAH
LSEQPAENDA CRAEHGCTPT RLLADRGALG PHTTVVHATH PTSSDITVLG DSRTRVCLCP
TTERDLADGI GPARRMANAG SALSLGSDSH AVVDLFEEAR AVELDERLRT RQRGHFTAGE
LVTAATVAGH VALGWGDAGR LAVGDRADLV TVRLDSPRTA GVPAAGAFFA ATAADVSQVV
VDGQVVVRDG RHQMVDVPAE LATSIAEVTG TP