Gene Smal_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_4003 
Symbol 
ID6474897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4522966 
End bp4526346 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content67% 
IMG OID642733216 
Producthypothetical protein 
Protein accessionYP_002030385 
Protein GI194367775 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.417635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.020693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCT CCAAGCACAC TCCCGCCCTG TTCAACGAAG GCCTGCCGGA TCCGCGCCTG 
CAGCAGTTCC GCATGCGCCG CCTGCAGGTG CACAACTGGG GCACCTTCAA CGGCCTGACC
GAAGTGCCGA TCGCCGAGCG CGGCTTCCTG TTCGTCGGCC GTTCCGGCTC GGGCAAGTCG
ACCCTGCTCG ATGCCATGTC CGCGCTGCTG ACGCCGCCGG CCATCGTCGA CTTCAACGCT
GCCGCGCGTG AAGCCGAGCG CAGTGGCCGC GACCGCAACC TGGTGTCCTA TGTGCGTGGC
GCGTGGGCCG ACCAGCAGGA CAGCGGTACC GGCGAAATCG CTACGCAGTA CCTGCGCAAG
GGCGCGACCT GGACCGCACT GGTGCTGGAA TACCGCGCCG GCGACGGCCG CGTGGTCAGC
CTGGTGCGCC TGCTGTGGAT CTCCGGCAAC GGCACCTCGG CCGGCGACGT GCGCAAGCAC
TACATGATCG CCGAGCGTCC GTTCGACATC GCCAAGGATC TCGGCGGCTT CGACCTGGAC
CTGCGCAAGC TCAAGCAGAA GCTGGCCGAC CTGCACCACT TCGACACGTT CTCCGGCTAC
GCCGAGCGCT TCCGCGACCT GCTCGGCATC GACAACGAGA TGGCGCTGCG CCTGCTGCAC
AAGACGCAGT CGGCGAAGAA CCTGGGCGAC CTCAATGTCT TCCTGCGCGA CTTCATGCTC
GACACGCCGA AGACCTTCGA CGCCGCCGAG CGCCTGGTCA GCGACTTCGC CGAACTCGAT
GGCGCGCACC AGGCCGTGGT CACCGCTCGC CGCCAGGTGG AAACCCTGCT GCCGGCACGC
GCCTACTACG GCGACCTGAA GCAGATGCAC CGCCAGCGTG GCGACGATGA AGCGCTGAAG
CTCGGCGTCG ACAGCTTCCG CGAAAGCCGC CGCCAGGCAC TGATCGAAGC ACGCCTGCGC
GAGATGGACG TGCGTGACCG TGGCCTGCTC GGCGAAGAAG CCCAGCGCCG TGCCGCGCTC
GACAACCACA CCGAGCGCCT GGCCGAACTG GAGCTGCAGC GCCGTCAGCA GGGCGGTGAG
CGCATCGAGG AACTGGAACG TGAGCAGGGC CGTGCCGAAG CCGAGCGCGA CCGCCGCAAG
GCCAAGCGTG ACCAGGCCCA GGAAGCGGCA CAGCAGCTGC AGGCCGAACT GCCCGATGAC
GCCCATGGCT TCGCCGAACT GGTCGAGCGC GCGCAGAACG AACTGCAGGA CCGCCAGCGT
GCCTCGGCGG CACTGGACGA TGCGATCAGC GATCGCCTCG GCGGCAAGCG CGACGACGAG
CGCCGCTTCG GTGAAGTGCG CAGCGAACTC GAAGCGATGC AGCGCACCCC CTCCAACATT
CCGGCGCCGA TGCAGAAGCT GCGCGCACGC CTGGCCGAGG AAACCGGCAT CGCCGAAGCG
GCGCTGCCGT TCGTCGGTGA GCTGATCCAG GTCCGCTCGG AAGAGCAGGG CTGGCAGGGT
GCCATCGAGC GCGTGCTGGG CGGTTTCGCG TTGTCGCTGC TGGTCGATGA CAAGCACTAC
AACGAGGTCG CCGAGTGGGT GAATCGTACC CACCTGGGCA TGCGCTTCAC CTATTACCGC
GTGCGCCGCA ACGATGATGC GTTCGCCCGT GAGCCATCGG CGAAGTCGCT GCTGCACAAG
CTGGAACTGC GCGACCACGT GTTCGAAAGC TGGCTGCGGC GTGAGCTCGG CAAGCGCTTC
GACTACGAGT GCGTGGATGC CAAGCAGCTG CGCAACGTCG ACCGCGGCAT CACCCGCGAA
GGCCAGGTCA AGCATCCGGG CGACCGCTTC GAGAAGGACG ACCGCAGCGC GGTGGGTGAC
CGCCGTCGCT GGATCCTTGG CTTCAACAAC CACGACAAGG TGGGCGCCTT CGAGCGCGAA
GCACAGGAAC TGGCCAAGCG CATCGCCAGC TGCGAGACCG ATATCGCGCG GCTGCGCGGC
CAGCGTGACC GCGACAATGA ACGTCGCCTG GCCTGCCACG AACTGGTCAG CATCAGCTGG
AACGAGATCG ACATTGCCGC CCCGCAGCAG CGCCTGAGCG ACATCGAGGC CACCCTGCGC
GACCTGCGCG AAGGCAATGC CGATCTGGCC AAGCTGGCCA AGCAGATCGA CGCCGTGCGC
GCCGACATCG AGCAGTCCCG CCGCACCTAT GAGGACGTCC GCGTCGAGCG CGGCCAGCTG
GTCAAGGAAC GAGATCGCTT GGACCGTGCA CGCCAGCAGA GCCGTGCACT GGTGCTGCCG
AGCCTGGCCC AGGAGCACGA GGCTGGACTT GCCGAGCGCC TGCAGGAGCA GGGCCCGCTC
AGCCTGGAAA CGCTGGAAGC GCACATGCGC CAGGTCAGCA ACGCACTGAA CGAGCAGCTG
TCGTCTTCGC AGCAGGACCT CAACCGCATC GAGAACCAGC TGATCGGCTG CTTCCGCCGC
TTCATCCAGC AGTGGCCGGA GGAATCGGGC GACTTCACCG TGTCGGTGGC CTCGGCCGAA
GACTTCCTGG CCCGCCTCGA ACGCCTGGAG CGTGATGGCC TGCCGCAGCA CGAAGAGCGC
TTCTTCGACC TGCTGCAGAA CCAGAGCAAG AACAACCTGC TGGCGCTGCA GCGTCACAGT
GCCGAGGCCC GCAAGTCGAT CGGCCAGCGC CTGGACGAAG TGAACGCGAG CCTGGAACAG
GTGCCGTTCA ACCGCGGCAC GCTGCTGACC ATCGAACTGA ACGATCGCCG CCTGCCGGAA
GTCGGCGAGT TCCACCTGCA GCTGCGCGAG GTGCTGTCGC AGCAGCAGAC CGAGCAGCGT
GAGCTGGCCG AATCGCAGTT CACCGTGCTG CGCCAGCTGG TCAACCGCCT GGGCTCACAG
GAAGGCGAAG ACAAGCGCTG GCGCGAGCTG GTGCTGGACG TGCGCATGCA CGTGGAGTTC
ATCGGCGTCG AGCTGGATGC GGAGACGCGC CAGCAGGTCG AGATCTACCG CAGCGGCGCT
GGCAAGTCCG GCGGCCAGCG CCAGAAGCTG GCCACCACCT GCCTGGCCGC GGCCCTGCGC
TACCAGCTCG GTGGCGCCGA CAGCCAGCTG CCCAGTTACG CCGCGGTCGT GCTCGACGAA
GCCTTCGACA AAGCCGACAA CGAGTTCACC GCACTGGCGA TGAACATCTT CGACAACTTC
GGCTTCCAGA TGGTGGTCGC CACCCCGCTG AAGTCGGTGA TGACGCTGGA ACCGTTCATC
GGTGGCGCCT GCTTCGTCGA AATCAGCGGC CGCCACGACT CCGGTGTGCT GCTGATCGAG
TACGACGAAG AAGGCAAGCG CCTGAAGCTG CCCGAGCGCA GCCGCCAGCA GGCCAACGAA
CCGGAAGAAG CGGAGGCCTG A
 
Protein sequence
MAISKHTPAL FNEGLPDPRL QQFRMRRLQV HNWGTFNGLT EVPIAERGFL FVGRSGSGKS 
TLLDAMSALL TPPAIVDFNA AAREAERSGR DRNLVSYVRG AWADQQDSGT GEIATQYLRK
GATWTALVLE YRAGDGRVVS LVRLLWISGN GTSAGDVRKH YMIAERPFDI AKDLGGFDLD
LRKLKQKLAD LHHFDTFSGY AERFRDLLGI DNEMALRLLH KTQSAKNLGD LNVFLRDFML
DTPKTFDAAE RLVSDFAELD GAHQAVVTAR RQVETLLPAR AYYGDLKQMH RQRGDDEALK
LGVDSFRESR RQALIEARLR EMDVRDRGLL GEEAQRRAAL DNHTERLAEL ELQRRQQGGE
RIEELEREQG RAEAERDRRK AKRDQAQEAA QQLQAELPDD AHGFAELVER AQNELQDRQR
ASAALDDAIS DRLGGKRDDE RRFGEVRSEL EAMQRTPSNI PAPMQKLRAR LAEETGIAEA
ALPFVGELIQ VRSEEQGWQG AIERVLGGFA LSLLVDDKHY NEVAEWVNRT HLGMRFTYYR
VRRNDDAFAR EPSAKSLLHK LELRDHVFES WLRRELGKRF DYECVDAKQL RNVDRGITRE
GQVKHPGDRF EKDDRSAVGD RRRWILGFNN HDKVGAFERE AQELAKRIAS CETDIARLRG
QRDRDNERRL ACHELVSISW NEIDIAAPQQ RLSDIEATLR DLREGNADLA KLAKQIDAVR
ADIEQSRRTY EDVRVERGQL VKERDRLDRA RQQSRALVLP SLAQEHEAGL AERLQEQGPL
SLETLEAHMR QVSNALNEQL SSSQQDLNRI ENQLIGCFRR FIQQWPEESG DFTVSVASAE
DFLARLERLE RDGLPQHEER FFDLLQNQSK NNLLALQRHS AEARKSIGQR LDEVNASLEQ
VPFNRGTLLT IELNDRRLPE VGEFHLQLRE VLSQQQTEQR ELAESQFTVL RQLVNRLGSQ
EGEDKRWREL VLDVRMHVEF IGVELDAETR QQVEIYRSGA GKSGGQRQKL ATTCLAAALR
YQLGGADSQL PSYAAVVLDE AFDKADNEFT ALAMNIFDNF GFQMVVATPL KSVMTLEPFI
GGACFVEISG RHDSGVLLIE YDEEGKRLKL PERSRQQANE PEEAEA