Gene Smal_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_2065 
Symbol 
ID6476243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2316733 
End bp2319624 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content64% 
IMG OID642731247 
ProductTonB-dependent receptor 
Protein accessionYP_002028452 
Protein GI194365842 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGAC CCGCCGGAGC AGTGCACAGC ATCTCGTTGT CCCCCACACC CCTGATCGTG 
GCGCTGCTCG CCGTGCTGGC GGGCCTGCCC ACGGCCCAGG CGCAATCCAG CAAGGACGCG
GCAGGGCAGG ATCCCACCAC GCTGGACCAG GTGCAGGTCA CCGGCATCCG CGAGTCCATG
CAGAGTTCGA TCAACAAGAA GCGCGACGAT ACGGTCATTG CCGATGTGCT CTCCGCCGAC
GACATCGGCG ACCTGCCGGC GCCGTCGCTG GCCGATGCCA TCGAGACCCT GACCGGTGCC
GCCTCGACCC GCGACAAGAC CGGTGCCTCG GAAATCTCGA TCCGTGGCCT GGGCGCCTTC
CTCAGCAGCA CCAATTTCAA CGGCCGCGAG ATCACCAACG GCAGTGGCGA CCGCTCGGTG
AACTTCAATA TGTTCCCGGC TGAGCTGATC AACACCGTGG CCATCTACAA GAGCCAGCGC
GCCGACATCA TCGAAGGTGG CGTGGCTGGC ACCATCGGCC TGGAAACGGT ACGTCCATTG
GAGTACGGCA AGCGCTCGGC GCAGATCGAT ATGCGTGGCA GCTGGGCCGA GTATGACAAG
AAGTACCGCG ACGACGACGG CATCGGCTAC CGTGGCACCG CCAGCTACAT CGACCAGTTC
GAGTTCGGCA ACGGGCAGAA GCTGGGCATA TCGCTGGGCT TCCAGCGCCT GGAAGGTACC
GATCCGGAAG AAAGCATCAC CAGCGGTTCC ACCTGGTATG CCTGCGATGG CACGCAGAAC
GTGGCCAATG CCAACTGTGG CGAAGTCAGT GCACAGGCCA TCGCCAACGG TGCGCCGTAT
TACCTGGTGC CGAGCAGCCG CATCTACCGC CTCAAGCAGG AGCGCAACGA TCGGCAGAGC
GAGTTCGCCG CACTGCAGTG GCGGCCCAAT GACGTGGTCG AATTGAATGT CGACTTTGAG
CACACCCAGC GCAACTGGTA CGAGAACCGC AGTGACCTGA GCCTGTCCAA TGCGCGCCGC
GGCATTACCC AGCGCGAGGT GGACGACGAG GGTATCGTGC GCCACCTGCA TGGCAGCACT
TCGATCGATT CCACCTCCAA CCGCTATTGG CGCGGTGAGG AATACACCGG TGGTGGCCTG
AATCTGATCC TGCGGCCAAG CGCGGCGTGG GAACTGTCCA CCGATCTTTC CTACTCGCAC
ACCAATCGTC TCGACAGCGA GCGCATGACC CGGCTGCGCG CCAACCAGCG CGACGTCAAC
AATGCCATCG TTCCCGGCAT CAGCAGTGGC GCCACCGGGT ACGTGGACTA CGACTGGGAC
TGGCATGGCG AAGTGCCCAG TGTTGCCCTG GCGCCGAACT TCGACCCCAA CAACTGGGAC
GCCTACACCG GCGCGGCGCG CGTCACCTCC AGCGCAACGG AGAACGACCA CAGGATCAAG
GCCGGCCGCT TCGATGCCAG CTTCATGCCG GAGTCCGGTT TCTTCACCCG CATCAAAGGC
GGCGTGCGCG CCAGCCAGGC CGACTACCGC CTGCGCGACA ACACCCTGGT AACCGACTAC
GACCAGCGCG TGGCGGCCGA CAAAGCGAAG ATCATCGCCG CCAACCAGGC CTGCCGCGCG
CCGTTCCCGC AGGATGACTT CATGGATGCG GCCAGCGGCA ACACGATTTC GTCCTGGGCG
TACTTCGATC CGAACTGCCT GTACCAGTCG TTCCGTGGCA GCCTGGACAG CGGCCTCGAT
CCGGGCTTCC AGGATCCGAA CAATGTCGAT ATCACCGAGA AGACCCGCGC GCTCTACCTG
ATGGGCGAGT TCAGCAGCAC GCTGTTTGGG CTGCCGGTGA CCGGCAACCT TGGCCTGCGC
TGGGTGAAGA CCGATGTGCG TTCCGAAGGG GTGCGTACCG GCCTGCGCAT TGAAGACAAC
GGAGACGGTA CGATCCGTCT GCAGCCCACC GGCGACTACA GCACCCAGGT GTTCAAGGCG
GGCAACGACA AGCTGCTGCC CAGCCTGAAC GCGGCTTTCG AACTGCGGCC GGACCTGTTG
CTGCGGGTCG GCGCCTACCG CGCGATGTCA CGGCCGGATA TCGCCGCGCT GGGCGCAGGG
CGCACCATCA ACGTCAGCAG CGATGCCACC TACGGCAATC TGGCCGATGC GCTGGACGAC
ATCAGCGCCA GTGGCAACCC AGCTGCACAA CCCCTGATGT CCTGGAACGG AGACCTGTCG
CTGGAGTGGT ACCCGAACCC GGACACCCTG CTGGCCGGTG CGGTGTACTG GAAACAGTTC
AACGGCGGCA CCGCCACGGC GCTGGTGCCG GAGACCTACA CCATCGATGG ACAGAGCGTG
ACCGTGCCGG TGCGGCAGCA GGTGACCACC GACGAAGACA GCACGCTGAC CGGCTTCGAG
CTGAGCGCTA CCCATCGACT GTCGTACCTG CCCAAGCCGT TCGACGGGCT GGGCTTCAAG
GTCAGCTACA ACTACGCCGA TGCCGACTAC CACACCCAGG ATTCGCGGCT GGGCGAGCAG
CTGGCCAGTG ATGGCACGAT CATTCCGGCC ATCGTGCCAC CGGCGGGTCT GAGCGGTTTC
TCGCGGCACG TGCTGTCCGG TTCGATCTAC TGGGACGTTG GGCGCTTCAA CATCCAGGCC
ATCGGCAAGT TCCGCTCGCA CTATTACCAG GACTTCACTG GCAATACCGC TCAGCAGAAC
CGCTACTACG ACGACAACAC CAGCGTCGAC CTGCGCCTGC GTTACCGGGT GAACAAGCAG
TTGTCGCTGT CGCTGGAACT GATGAACCTG ACCAACGAGC CGCGCGTGGC CTACCAGCCG
CTTTATGGGA ACTTCCGCGA AGTGGTGACC TACGGGCGGC GTGCGTATTT CGGTGTACGA
TACAAGTTCT GA
 
Protein sequence
MRRPAGAVHS ISLSPTPLIV ALLAVLAGLP TAQAQSSKDA AGQDPTTLDQ VQVTGIRESM 
QSSINKKRDD TVIADVLSAD DIGDLPAPSL ADAIETLTGA ASTRDKTGAS EISIRGLGAF
LSSTNFNGRE ITNGSGDRSV NFNMFPAELI NTVAIYKSQR ADIIEGGVAG TIGLETVRPL
EYGKRSAQID MRGSWAEYDK KYRDDDGIGY RGTASYIDQF EFGNGQKLGI SLGFQRLEGT
DPEESITSGS TWYACDGTQN VANANCGEVS AQAIANGAPY YLVPSSRIYR LKQERNDRQS
EFAALQWRPN DVVELNVDFE HTQRNWYENR SDLSLSNARR GITQREVDDE GIVRHLHGST
SIDSTSNRYW RGEEYTGGGL NLILRPSAAW ELSTDLSYSH TNRLDSERMT RLRANQRDVN
NAIVPGISSG ATGYVDYDWD WHGEVPSVAL APNFDPNNWD AYTGAARVTS SATENDHRIK
AGRFDASFMP ESGFFTRIKG GVRASQADYR LRDNTLVTDY DQRVAADKAK IIAANQACRA
PFPQDDFMDA ASGNTISSWA YFDPNCLYQS FRGSLDSGLD PGFQDPNNVD ITEKTRALYL
MGEFSSTLFG LPVTGNLGLR WVKTDVRSEG VRTGLRIEDN GDGTIRLQPT GDYSTQVFKA
GNDKLLPSLN AAFELRPDLL LRVGAYRAMS RPDIAALGAG RTINVSSDAT YGNLADALDD
ISASGNPAAQ PLMSWNGDLS LEWYPNPDTL LAGAVYWKQF NGGTATALVP ETYTIDGQSV
TVPVRQQVTT DEDSTLTGFE LSATHRLSYL PKPFDGLGFK VSYNYADADY HTQDSRLGEQ
LASDGTIIPA IVPPAGLSGF SRHVLSGSIY WDVGRFNIQA IGKFRSHYYQ DFTGNTAQQN
RYYDDNTSVD LRLRYRVNKQ LSLSLELMNL TNEPRVAYQP LYGNFREVVT YGRRAYFGVR
YKF