Gene Smal_2348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_2348 
Symbol 
ID6476831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2639974 
End bp2641167 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content69% 
IMG OID642731529 
Productcytosine deaminase 
Protein accessionYP_002028734 
Protein GI194366124 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.10962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.523388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCAG TCTTGGCTTT CCGGGAATTC CCGATGACCC TGCAGTTCAT CCACGGCGGT 
GTCGACCGCG ACGGTGCACC GCTGCACTTC CACATCCGTG ATGGCCGCAT CAGCGGGCTC
AACGGCCATA CCGGCCCGGC TGAAGGGGCG GAAAGCGTCG ATCTGCAGGG CTTTGCGGTG
CTGCCGGGGT TGGTGGATGG CCACATCCAC CTGGACAAGA GCTTCGTCGG CGACCGTTGG
CACCCGCATC AGCCGGTGAA CAGCCTGCGT GAGCGGTTGG CGGTGGAAAA GGCGGCGATG
GCGGCTGCGG CGCCGATGGT TGATCGCGCC GAGGCGCTGA TCCGCCAATG CAGCGGCTTC
GGCACAGTGG CAATGCGCTG CCATGTCGAT ATCGATGGCA GCACCGGCCT GCGTCACCTG
GAAGCGGTGC GCGAGGCCGC GCTGCGCTGC GCCGGCATCA TGCGCATCCA GCTGGTGGCG
TTCCCGCAGG CCGGCGTGAT GTCCTGCGCT GGTACCGCAG CGGTGCTTGA GCAGGCCCTT
GCAGCGGGTG TGGACGTGCT GGGCGGCATC GATCCGACCA CGCTCGATGG CGATGCCGAG
GGCCAGCTGG CACTGTTGTT CGGCCTGGCC GAGCGCTATG GCGTGCAGCT GGACATCCAC
CTGCATGAGC CCGGCGAAAC CGGACTGGCC CAGCTGCTGC GCATCGCCGC GCGGACCCGG
GCCGCCGGCC TGCAGGGAAG GGTGGCGGTC AGTCATGCTT ATTCGCTGGG CGAGGTGCCG
CTGGCACGCG CGCTGCAGGT GGGCGAGGCG CTGGCGACGG CGGGCGTGGC GATCATGAGC
AACGCGCCGG GCGATCATCC ATTCCCGCCG CTGCGCGCGC TGCATGATGC CGGTGTGCAC
GTGTTCGCGG GCAACGACAA CATCCGTGAC TGCTGGTGGC CGTATGGCAA TGGCGATCTG
TTGCAGCGGG CGATGCTGCT GGGCTACCGC TCCGGCTTCA ATACCGATGC CGACCTGATG
CTGGCGCTGG ACATGGTGAC CACCCATGCT GCACAGGTGA TCGGGCTGCC ACAGTACGGC
CTTGCAGAAG GACATCCGGC CACCTTCGTG GCGGTGCGTG CCGACCACGG CCCGGCCGCC
GTGGCCGGGG TGCCGGTAGA GCGCCGCGTG GTGGTCGACG GCCGCTGGCT GTAG
 
Protein sequence
MAPVLAFREF PMTLQFIHGG VDRDGAPLHF HIRDGRISGL NGHTGPAEGA ESVDLQGFAV 
LPGLVDGHIH LDKSFVGDRW HPHQPVNSLR ERLAVEKAAM AAAAPMVDRA EALIRQCSGF
GTVAMRCHVD IDGSTGLRHL EAVREAALRC AGIMRIQLVA FPQAGVMSCA GTAAVLEQAL
AAGVDVLGGI DPTTLDGDAE GQLALLFGLA ERYGVQLDIH LHEPGETGLA QLLRIAARTR
AAGLQGRVAV SHAYSLGEVP LARALQVGEA LATAGVAIMS NAPGDHPFPP LRALHDAGVH
VFAGNDNIRD CWWPYGNGDL LQRAMLLGYR SGFNTDADLM LALDMVTTHA AQVIGLPQYG
LAEGHPATFV AVRADHGPAA VAGVPVERRV VVDGRWL