Gene Smal_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_4032 
Symbol 
ID6474926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4557800 
End bp4559041 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content66% 
IMG OID642733245 
ProductMembrane dipeptidase 
Protein accessionYP_002030414 
Protein GI194367804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.771689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.591323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCGT TGCGTCGCTT GTCCGTAGTG CTTGCCCTGG CGCTTTGCGC GCCGCTGTCC 
GCCAACGCCA TTGAGTTCAG CGCGCAGGAG CTGGCCCGTG CCAAGGCACT GCAGCAGCGC
CTGGTCACTC TCGACAGCCA CCTGGATACA CCGGCCAACT TCGGTCGCAG CGGCTTTGAC
ATCGAGCAGC GCCACGACCG CAACGCGCTT TCGCAGGTGG ACTACCCGCG CATGGTCGAA
GGTGCGCTGG ATGGTGGCTT CTGGGCGATC TACACCGACC AGGGCGATCG CAGTGCCGCC
GCGCACTTGG CCGAGCGTGA CCACGGCCTG CAGCGGCTGC TGCAGATCCG CGAGATGCTG
GCGGCCAATC CCGATCGCTT CGCGCTGGCG CTGACCGCCG ATGATGCGGC GCGGATCAAG
GCCGCGGGCA AGCGCGTGGT CTACATCAGC ATGGAAAATG CCAGCCCGCT GGTGGCCGAT
CCCAGCCTGC TGTCCTTCTA CCATCGCGCC GGCCTGCGCC TGCTCAGCAC CGTGCACTTT
GCCAACAACG AGTTCGCCGA TTCGGCCACC GACCCCAAAG GCGCGGAGTG GAAGGGCCTG
AGCCCGGCCG GCAAGGATCT GGTACGGCAG GCAGTGAAGC TGGGCATCGT GATCGACCAG
TCGCATGCGT CCGATGCAGT GTTCGACGAC CTGCTGGCGA TGATGCCGGT GCCGTTCGTG
CTGTCGCACA GTTCGGCCAA GGCGGTCTAC AACCATCCAC GCAACCTTGA TGATGCGCGG
TTGCGCTCGC TGGCCAAGGC CGGCGGCGTG ATCCAGGTGA ATGCCTATGG CGGCTACCTG
ATCGACACGG CAAAGACGCC CGAGCGCAAG CAGGCCGAAG AAGCGCTGAG CAAGCAGCTC
GGCGGCTGGG AAGGCATGGG CATCGAACAG GGCGTTGCGC TGTTGAAGGC CGAGCAGGCG
CTGGACCACG AGCATCCGGT GCGGCATGCC AGCCTGGACG ATTTCTTCGC CCACTTCGAG
CACATCCTCA AGGTGGTCGG TCCCGAGCAC GTGGGCATCG GCCTGGACTG GGACGGCGGC
GGTGGCCTGA GTGACCTGCC CGATGTCAGC CAGCTGCCGA AGATCACCGC GTGGCTGCTG
CGCAAGGGCT ATACCGAGAA GCAGATTGCC GGCATCTGGG GCGGCAACCT GCTGCGGGTG
ATGCGCCAGG CGCAGGATTA CGCTGCAAAG CAGGGCGGTT GA
 
Protein sequence
MPSLRRLSVV LALALCAPLS ANAIEFSAQE LARAKALQQR LVTLDSHLDT PANFGRSGFD 
IEQRHDRNAL SQVDYPRMVE GALDGGFWAI YTDQGDRSAA AHLAERDHGL QRLLQIREML
AANPDRFALA LTADDAARIK AAGKRVVYIS MENASPLVAD PSLLSFYHRA GLRLLSTVHF
ANNEFADSAT DPKGAEWKGL SPAGKDLVRQ AVKLGIVIDQ SHASDAVFDD LLAMMPVPFV
LSHSSAKAVY NHPRNLDDAR LRSLAKAGGV IQVNAYGGYL IDTAKTPERK QAEEALSKQL
GGWEGMGIEQ GVALLKAEQA LDHEHPVRHA SLDDFFAHFE HILKVVGPEH VGIGLDWDGG
GGLSDLPDVS QLPKITAWLL RKGYTEKQIA GIWGGNLLRV MRQAQDYAAK QGG