Gene Smal_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3933 
Symbol 
ID6474817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4426541 
End bp4428910 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content67% 
IMG OID642733136 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002030315 
Protein GI194367705 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGAG TGCTGCCCCT GTCCCTCCTG CTGTCGGCCG TGCTGGCCGC ACCGGCTGCG 
CATGCCGCGC CGACGCCGAT CACCATCGAA CAGGCCATGG CCGACCCGGA CTGGATCGGT
CCGCCGGTCG AGAAGGCCTG GTGGTCGTGG AACAGCCAGC AGGTGGAATA CCAGCTCAAG
CGCAGCGGCA GCCCGGTGCG CGATACCTTC CGCCAGCCGG TGGCGGGTGG CGTGGCCGCA
CAGGTGGCCG ATGACCAGCG CGGCACCCTG GACGTGGCCG AGCCGGTCTA CGACCGCAGC
CGCAGCCGCA GCGCCTTCGT CCGCAATGGC GACGTGTTCG TGCGCGACCT GCGCAGCGGC
GCGCTGACGC AGCTGACCCG CAGCAACGAG CGCGCGGGCA ACGTGAACTT CGCCGCCGAC
AATGGCGTGA TCTGGCGCGT CGGCCAGAAC TGGTTCCACT GGACCGCCGC CAGCGGCGTG
CAGCAGGTGG CCAGCCTGAA GGCCGAAAAG GACCCACGTA CGCCGCCGAA GGCCGACGTG
CTGCGCGACC AGCAGCTGCG CACGCTGGAA ACCCTTCGCC GCGACCGTGA CCAGCGCGAA
GCACTGAAGG ACCAGGACCA GCGCTGGCGC CAGGCTGACC CGACCCGCGC GCCGGGCCCG
GTGTACCTGG GCGCCGACGT GGAGATCGCC GACAGCGTGC TGTCGCCTGA CCTGACCCAC
CTGGTCGTGG TGACCAAGCC GAAGGACTTC GATGACGGCC GTGGCGGCAA GATGCCGCTG
TACGTGACCG AGTCGGGCTA CGAGGAAACC GAAGACACCC GCACCCGCGT CGGCCGTAAC
GGCTTCGAGC CGCACACGCT GTGGTACGTG GACGTGCGCA CCGGCAAGGC CGAGAAAGTC
TCGCTGGCCG GCCTGCCGGG CATCAGCACC GATCCGCTGG CCGAGCTGCG CCGCAAGGCC
GGCAAGGACG CGCTGAAGGG CGAGCGCAGC CTGCAGGTGA TGAGCGACTT CATGGGCGGT
GGCATCCGCT GGAGCCCCGA TGGCCAGCAG GCCGCGGTGA TGCTGCGTGC CAACGACAAC
AAGGACCGCT GGATCATCAG CGTCAACGCT ACCGATGGCC GCGTGCAGAA CCGCCATCGC
CTGACCGACA GCGGCTGGAT CAACTGGGGC TTCAACGATT TCGGCTGGAT GGCCGATGGC
CGCACGCTGT GGCTGCTGTC CGAGGAATCG GGCTTCTCGC ACCTGTACAC CCAGGCCGGC
ACAGCCAAGC CGCAGGCGCT GACCAGCGGC AAGTGGGAAA CCTCGGCGCC GGTGCCGTCG
GCCGATGGCA AGGGCTTCTA TTTCCTGTGC AACCAGCAGG CACCGCATGA CTACGAAGTC
TGCGCGGTGG ACACCGGCAA CCGCCAGGTG CGCGAGCTGA CCAGCCTCAA CGGCGTGGAA
GACTTCTCGT TGTCGCCGGA CGGCCAGCAG CTGCTGGTGC GCTATTCCGG TGCCTACCTG
CCGCCGCAGC TGGCCGTGCT GCCGAGCACC GGTGGCCAGG CCCGCGTACT GACCGACACC
CGCACTGCCG ACTACAAGGC GCGCCAGTGG ATCCAGCCGA AGCTGGTGGC GGTGCCGTCC
AAGCACGGTG CCGGCGTGGT CTGGGCCAAG TACTACGAAC CGGAAAACAA GGAACCCGGC
AAAAAGTACC CGATCGTGAT GTTCGTGCAC GGTGCCGGCT ACCTGCAGAA CGTGCACCAG
CGCTACCCGG CCTACTTCCG CGAGCAGATG TTCCACAACC TGCTGGTGCA GAAGGGCTAC
ATCGTGCTGG ACATGGATTA CCGTGGCAGC GAGGGCTACG GCCGCGACTG GCGAACGGCG
ATCTACCGCA ACATGGGCCA CCCGGAACTG GAAGACTACA AGGACGGCCT GGACTGGCTG
GTCGATACCC AGCAGGGTGA CCGCGATCAT GCCGGCATCT ACGGCGGTTC CTACGGCGGC
TTCATGACCT TCATGGCCCT GTTCCGCTCG CCGGGCACGT TCAAGGCCGG CGCCGCGCTG
CGCCCGGTGG TCGACTGGCA CCAGTACAAC CACGGCTATA CCAGCAACAT CCTCAACACC
CCGGACATCG ATCCGGAGGC GTACCGCGTG TCCTCGCCGA TCGAGTACGC GCAGAACCTG
CAGGACAACC TGCTGATCGC CCACGGCATG ATGGATGACA ACGTGTTCTT CCAGGACTCG
GTGAACCTCA CCCAGCGCCT GATCGAACTG CACAAGGACA ACTGGTCGAT CGCACCGTAC
CCGCTGGAGC GCCACGGCTA CGTGCGCGCC GATTCCTGGC TGGACCAGTA CAAGCGCATC
CTCAAGCTGT TCGAGCAGAA CCTGAAGTGA
 
Protein sequence
MSRVLPLSLL LSAVLAAPAA HAAPTPITIE QAMADPDWIG PPVEKAWWSW NSQQVEYQLK 
RSGSPVRDTF RQPVAGGVAA QVADDQRGTL DVAEPVYDRS RSRSAFVRNG DVFVRDLRSG
ALTQLTRSNE RAGNVNFAAD NGVIWRVGQN WFHWTAASGV QQVASLKAEK DPRTPPKADV
LRDQQLRTLE TLRRDRDQRE ALKDQDQRWR QADPTRAPGP VYLGADVEIA DSVLSPDLTH
LVVVTKPKDF DDGRGGKMPL YVTESGYEET EDTRTRVGRN GFEPHTLWYV DVRTGKAEKV
SLAGLPGIST DPLAELRRKA GKDALKGERS LQVMSDFMGG GIRWSPDGQQ AAVMLRANDN
KDRWIISVNA TDGRVQNRHR LTDSGWINWG FNDFGWMADG RTLWLLSEES GFSHLYTQAG
TAKPQALTSG KWETSAPVPS ADGKGFYFLC NQQAPHDYEV CAVDTGNRQV RELTSLNGVE
DFSLSPDGQQ LLVRYSGAYL PPQLAVLPST GGQARVLTDT RTADYKARQW IQPKLVAVPS
KHGAGVVWAK YYEPENKEPG KKYPIVMFVH GAGYLQNVHQ RYPAYFREQM FHNLLVQKGY
IVLDMDYRGS EGYGRDWRTA IYRNMGHPEL EDYKDGLDWL VDTQQGDRDH AGIYGGSYGG
FMTFMALFRS PGTFKAGAAL RPVVDWHQYN HGYTSNILNT PDIDPEAYRV SSPIEYAQNL
QDNLLIAHGM MDDNVFFQDS VNLTQRLIEL HKDNWSIAPY PLERHGYVRA DSWLDQYKRI
LKLFEQNLK