Gene Smal_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_2047 
Symbol 
ID6476225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2293406 
End bp2295817 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content64% 
IMG OID642731229 
ProductTonB-dependent receptor 
Protein accessionYP_002028434 
Protein GI194365824 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGA TGGACCGGGG AATGACGAAC GTATCCGCTG CCATGCCACG CGCGCTCATG 
GCCCTCAGCA TTGCCGCTGC ACTGCTGCCC AGCCTGCCCG CACTGGCCCA GAACTCCGGC
AATGACCGCG GTTCGAAAGC GGCTGGCGCA GCGCCCCAGG CACCGACTGC GACCCTGGAT
CAGGTACTGG TGGTCGGCAG CAACATCCGC GATGCCATTG GCGGCGGCGC ATCCCCGATC
ATCGTGATCG ACAAGGAGGC CATCGACCGC ACCGGCGTGG CTACCGTGCA GCAGCTGTTC
GAGAAACTGC CACAGAACTT CGGCGGAGGC GCCAATGGTG CGAACGTCGC CAACCTTGGC
GTCGACCGCG ATACCGGCAA CAACTTCGGC CAGGGCACCG CCATCAACCT GCGTGGCCTG
GGAACCGGCA CCACGTTGAC CCTGATCAAC GGCCATCGCG TGACCTCGTC GAACCGCTAC
CAGTACGTAG ACGTCTCGCT GATACCGCTG AGTGCGGTGG AACGCGTTGA GATCCTCACC
GATGGTGCCT CGGCCATCTA CGGCACCGAC GCCGTCGGTG GTGTGGTCAA CATCATCCTG
CGCCGCGACT TCACCGGGTA TGAGACCGCT GTGCGCTACG GCACCGTCAC CGCCGGTGGC
ATGGAGGAGT ACCAGGCATC GCAGTCGGCA GGCTGGTCGT GGGATGGCGG CCATGTGCTG
GCCAGCTACG AGTTCCTGAA GCAGAGCAAC CTGCCGGCGG TGGACAAGGA CTTCTCGAAG
AATGTGCGGG TCAAACCGTA CGACCTTTAC CCCGGTTCGA AGCGCCACAG CCTGTATGTG
GACGGCGTCC AGCAGCTCAG CGATGTGCTG ACCCTGAACG TGACCGGCTC GTTCGCCAAG
CGCGAGATGG ATACCACCAT TTCCGGCACC GCCGACGAAA CCCGGCTGTT CCCGCACACG
CGGCAGTTCG ACCTGTTCGC CGGACTCACG CTGGATCTTC CGCGCCAGTG GCAGGCGCGC
CTGAATTCCG GGTTCGGCAG GAGTGACGTT TCCTATCAGC GCACCACCAT CACCGGCAGT
TCGGCCAGTA CGGCGCCACC GACCGATACG AACTCCGAAT CCCGCTACCT TGATGTGGTC
GCCGACGGCG AGTTGTTCAG CCTGCCCGCC GGGCCGATAC GCGCGGCGTT CGGCGCGGGC
TACCGCCGCG ATGGCTATGA GCTGATCGAT CATCGCGGCC TGGAAAAGCC ACTCGACCTG
CATCGAACGA TCAGGTCCGC CTTCGGCGAG CTCAACATCC CGCTGCTGAA GGACATGCCG
GGCGTGCGCA GCCTGTCCTT CACTGCGGCA GCGCGCTATG ACGATTACAG CGATTTCGGT
TCCACGCTGA ATCCCAAGCT CGGCCTGCTG TGGGAGGCGA CCCAGGGGCT TTCGTTCCGT
ACCAGCTATG GTCGTTCCTA CCGGGCGCCG GTGTACCAGG ACATGCAGTT GAACAACACC
GTGGTGGTGG CGAATGTGCC CAACCCGAGC GCAGCCAATG GCAACACCAT CCTGATGATG
CTCTCCAATG GCAACCCCGA CCTGGGCCCC GAACGGGCCA AGACCTGGAC CGGCGGTTTC
TCATTCGCGC CGCCGTCGCT GCCGGGGCTC AAGATCGATG CGAACTACTA CCACATCGAA
TACGCCGACC GGATCGGCAG TGGCTTCGGC GGCAGCTTCC CGTCGCTGTT CCTGCAGTCC
ACGGCACCCT ACGCGGACAT CCTGACCAGC AATCCGACCC AGCAGCAGAT CCAGCAGGCG
CGCCAGCTGG GCATCTCGGG GCTGGGCCTG TTCGTCTCGC GCGTGGGGCC GTATGCGCTG
CCACCGGGAA CGGACGAAAC CAACAGCCGG GTGATCCTGG ACAACCGCTT CCGCAACAAC
GCCTTCACCC AGCAGCGCGG TGTCGATTTC AGCGCCGGCT ATGACGTTGA TGCCGGCCAG
ACCCGCATCG CGATGAGCCT GGCCGGGCAG TACATCATCG AATCAAAGCG TCGTGTGACC
AGTACGTCGC CCGAGGTGGA TGCGGTGAAT TCGGTGTACT ACCCGGTGGA TCTCAAGATG
CGCGGTGGCA TCGCGCTGAG CCGGCAACAG GCCGCGGCGG GGATCTTCGT GAACTACGTG
GACAGTTACC GCGACCCCGC CAATGTCGCC CGGCCCCACG TCAGCTCCTG GACGACGGTC
GATCTGAACC TCGCCTACCA CTTCGGTGCG TCAGCGGACC CGCAGCGTGG CACCTCGCTT
GCGTTCAACG TGCAGAACCT GTTCGACCGG GATCCGCCGT TCATCGTCAA CAGCATCAAC
ACCGGCTACG ACCCCACCAA TGCGACAGCG CTGGGCCGCT TCCTGTCCAT GTCGCTGACC
CACCGCTGGT AG
 
Protein sequence
MGLMDRGMTN VSAAMPRALM ALSIAAALLP SLPALAQNSG NDRGSKAAGA APQAPTATLD 
QVLVVGSNIR DAIGGGASPI IVIDKEAIDR TGVATVQQLF EKLPQNFGGG ANGANVANLG
VDRDTGNNFG QGTAINLRGL GTGTTLTLIN GHRVTSSNRY QYVDVSLIPL SAVERVEILT
DGASAIYGTD AVGGVVNIIL RRDFTGYETA VRYGTVTAGG MEEYQASQSA GWSWDGGHVL
ASYEFLKQSN LPAVDKDFSK NVRVKPYDLY PGSKRHSLYV DGVQQLSDVL TLNVTGSFAK
REMDTTISGT ADETRLFPHT RQFDLFAGLT LDLPRQWQAR LNSGFGRSDV SYQRTTITGS
SASTAPPTDT NSESRYLDVV ADGELFSLPA GPIRAAFGAG YRRDGYELID HRGLEKPLDL
HRTIRSAFGE LNIPLLKDMP GVRSLSFTAA ARYDDYSDFG STLNPKLGLL WEATQGLSFR
TSYGRSYRAP VYQDMQLNNT VVVANVPNPS AANGNTILMM LSNGNPDLGP ERAKTWTGGF
SFAPPSLPGL KIDANYYHIE YADRIGSGFG GSFPSLFLQS TAPYADILTS NPTQQQIQQA
RQLGISGLGL FVSRVGPYAL PPGTDETNSR VILDNRFRNN AFTQQRGVDF SAGYDVDAGQ
TRIAMSLAGQ YIIESKRRVT STSPEVDAVN SVYYPVDLKM RGGIALSRQQ AAAGIFVNYV
DSYRDPANVA RPHVSSWTTV DLNLAYHFGA SADPQRGTSL AFNVQNLFDR DPPFIVNSIN
TGYDPTNATA LGRFLSMSLT HRW