Gene Smal_3314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3314 
Symbol 
ID6476442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp3713924 
End bp3716845 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content66% 
IMG OID642732512 
ProductTonB-dependent receptor 
Protein accessionYP_002029696 
Protein GI194367086 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.510297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACT CCCATCGTCC GGGCCGCGTG CTGCGCCCGT CCCGCCTGTG CATCGCGTTG 
CTGGCTGTCG GCCTCGCCTG CGCCGCACCA TCGTCCGTAC TGGCGCAGAG TGCTGCCAGC
GGCCATGCGG CCCTGCAGCG TTTCGACATT CCCGCGCAGC CGCTTGATGA AGCGCTGCGC
AGCTACATGC GCCAGTCCGG TGTGCAGGTG GTGTATCCGG CCACGCTGGC CCAGGGCGTG
AGCTCGCACG CGGTCAGCGG TAACCTGTCT TCCGATGAGG CCCTGGCTCG CCTGCTGCAG
GGCAGTGGCC TGGCGGTGCG CCGGGTCGGC ACCGATGCGG TCACCCTGGA ATCGGCAACT
CCCGTGCAGG CCGACAGCGG CGTGATCGTC ACCGACACGC TGAGCGTGGC CGGCGACCGC
GTCGACAGTG GCGCAACCAG CGACGAGGCG CGCCTGCTCG ACGGCTATCG CAGCGTCGGC
TCCACCACCA CGCTCAACCG CACCCACCTG GAGCGATTCC GCGGTACGTC CAACGGCGAC
ATCGTCAAGG GCGTGGCTGG CGTCACCGCC GGTGATCCGC GCGTGGGCAA CGGCTTTGAC
GTCAACATCC GTGGCATCCA GGGCCAGGGC CGCGTGCCGG TGATCATCGA CGGTGGCCAG
TCCAGCATGG ATACCTACCG TGGTTATGCC GGCCAGTCGC AGCGCACCTA CCTCGACCCG
GACCTGATCT CCAGCCTGAC CATCACCAAG GGCCCGAGTC TGCAGGCCAA TGCCTCCGGT
GGCATCGGCG GCGTGGTCGA GATGGAGACG CTGAAGATCG GTGACGTGCT GCGCGAAGGG
CGGGACTTCG GCGTGCGCGT GCGTGGCGGC CTGGCCAACA ACAGCGCCAA CAACCTGCCT
TCCTACAGTG CCGCGCCGCG CAACGATCGC AGCGCCACCG GCAGCCAGTT CTTCAACGTG
GCCGCGGCCG GTCATTGGGA TCGTTTCGAT CTCGTGGCCG CCTATGCCTA CCGTGACAAT
GGAAACTACT TCTCCGGCAA GCACGGCTAC GACGATTTCC CGCAGACCCG GCGCACGCTG
GCACCGCTGA ATCCGCCGCG TACCGAGGTC TTCAATACCT CGGCGCGCTC GAAGTCGGCG
CTGCTCAAGG GCACCTGGCG CATCGACGAT GCACAGATGC TGGAAGCGGG CTACCGCCGC
TACGAGGGCA CCGCCGGCGA GATCATGGCC TCGCAGATCA TCCGCGTCGA TCGCGACCGC
ATTCCGCAAT GGGATCCGGG CCACGTCGAC ATGGACAGCT ACACCCTGCG CTATCGCTTC
AACCCGGACA GCGAGCTGGT CGACCTGCGC GTCAATGCCG CGTACACCGA CACCGACAGC
GTGATGTACA ACAGCCTGAC CGGCATCACC CCGTGGTACT TCGATCGCCG CACCGAGTGG
TACGACGCGC CCAGCTTCAG TGGCGACCCC GGCTACAAGG ATGCGTACCG CAACCCGCTG
CGACAGAAGC GCTTCGGCCT GGATGCCAGC AATACCTCGA AGTTCGACAC TCGTGCCGGC
GCGTTCACCC TCGACTATGG CCTGTCCTAC AGCGACGAAG ACATCGCGCC GGGCAGTTCG
GGGCCGGTCA TGCACGACGA TCTGATCAAC AACCGTTTCC TGCGCAACGC CGAGCGCAAG
GAATACAGTG CCGTGGCCTC GCTGAAGTGG CAGCCCGATG AGCATTGGGA ACTGCTGGCC
GGCGGCCGCT GGAACCGCGT GGATGTGCAT GACCGCAACC GCCTGGCCAC GCCCGATGCC
TATGAGGTGC AGGGCCAGTA CCGCTACACC GAACTGCTCA ACGGCAACCC GGCGTTGCCG
TCGTGGCGGG CCAAGCGCAT CGCGCTGCTG AACTGGTATC CGGATGCCAA CGGCAACTTC
ACCCAGGAAT CGCTGCTGGC CTCACCGTAC AAGAAGGGTA CGGTGGGCGA CATCGGTGGC
TGGAACTTCT ACGACGCCGG CAAGGCGCAG GACCTGGAAG TGCCGGTCAG CTGGACCTGG
TCGCAGCCGA TCCGCCGTCG TGACACCGCG TTCTCGCCGA CCGCCAGCGT GGCTTACCGC
TTCAGCGAAG ACACCATGGT CTATGTGAAG TACGCCGAAG GCACCAAGCT GCCGAGCCTG
TTCGAGACCA CGCTGGGCCT GTTCACTGCC GCCAAACCGG TGGGCGAGCT GAAGCCGGAG
CGCGCGCGCA GCTGGGAAAT CGGCGCCAGT ACCATCCGCT ACGACCTGTT CACTGCCGGC
GACCGCCTGG CACTGAAGCT GGCCTACTTC GATACCCGCA TCGATGACCT GATCACCCGC
GACTACCGCA CGCTGTCGGC CGGCCTGATC CGCAACGTCG ACCAGTTCAA GGTCTCGGGC
ATGGAGTTCC AGTCCAGCTA TGACAGCGGC AAGGTATTTG CGGATCTGTC CGCGCACTAC
TACTTCAAGG CCAAGACCTG TGCACCGGAC ATCGCCGCCG AGCGCCGTGC CTATGGGGCA
CAGCGCAGGA ACGATGAGCT GGCCAACACG CCCAACTGCG TGGACGGTGG CTTCGAGGGC
TCGTACACCA ATACCCAGAA TCCGCCGCGC TACATGGTCA ACCTGACCCT GGGCTCGCGC
CTGTTCGACG AGCGCCTGAC CTTCGGCACC CGCGTGGTCC ACAACGCCGG CCCGATCAGC
AAGCTGGACA AGGACTGGAA CGTGGGCCTG TCGGCGATCC AGCAGCTGTA CCGGCCGACT
ACCCTGGTCG ACCTGTTCGC CAGCTGGAGC TTCAACGACC AGCTGTCGGT GGAAGCGGGC
GTGGACAACG TCACCGACCG CTACTACCTG GATCCGCTGG CGCTGGGCGT GATGCCGGCC
CCAGGTCGCA CCGCGCGGCT GGCATTGACC TGGAGGTACT GA
 
Protein sequence
MFDSHRPGRV LRPSRLCIAL LAVGLACAAP SSVLAQSAAS GHAALQRFDI PAQPLDEALR 
SYMRQSGVQV VYPATLAQGV SSHAVSGNLS SDEALARLLQ GSGLAVRRVG TDAVTLESAT
PVQADSGVIV TDTLSVAGDR VDSGATSDEA RLLDGYRSVG STTTLNRTHL ERFRGTSNGD
IVKGVAGVTA GDPRVGNGFD VNIRGIQGQG RVPVIIDGGQ SSMDTYRGYA GQSQRTYLDP
DLISSLTITK GPSLQANASG GIGGVVEMET LKIGDVLREG RDFGVRVRGG LANNSANNLP
SYSAAPRNDR SATGSQFFNV AAAGHWDRFD LVAAYAYRDN GNYFSGKHGY DDFPQTRRTL
APLNPPRTEV FNTSARSKSA LLKGTWRIDD AQMLEAGYRR YEGTAGEIMA SQIIRVDRDR
IPQWDPGHVD MDSYTLRYRF NPDSELVDLR VNAAYTDTDS VMYNSLTGIT PWYFDRRTEW
YDAPSFSGDP GYKDAYRNPL RQKRFGLDAS NTSKFDTRAG AFTLDYGLSY SDEDIAPGSS
GPVMHDDLIN NRFLRNAERK EYSAVASLKW QPDEHWELLA GGRWNRVDVH DRNRLATPDA
YEVQGQYRYT ELLNGNPALP SWRAKRIALL NWYPDANGNF TQESLLASPY KKGTVGDIGG
WNFYDAGKAQ DLEVPVSWTW SQPIRRRDTA FSPTASVAYR FSEDTMVYVK YAEGTKLPSL
FETTLGLFTA AKPVGELKPE RARSWEIGAS TIRYDLFTAG DRLALKLAYF DTRIDDLITR
DYRTLSAGLI RNVDQFKVSG MEFQSSYDSG KVFADLSAHY YFKAKTCAPD IAAERRAYGA
QRRNDELANT PNCVDGGFEG SYTNTQNPPR YMVNLTLGSR LFDERLTFGT RVVHNAGPIS
KLDKDWNVGL SAIQQLYRPT TLVDLFASWS FNDQLSVEAG VDNVTDRYYL DPLALGVMPA
PGRTARLALT WRY