Gene Smal_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_2021 
Symbol 
ID6476199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2263683 
End bp2264873 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID642731203 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002028408 
Protein GI194365798 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.104034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCCA CGTTGAAACA GGTCCCCTGG CGGCCCGACT ATGTGGGCGT CGCACTACCT 
TTCGAATACG TGATGGCCCC TATCCCCAAG AATTTGCCGG CATTCAAGGA CTGGCCTGTC
GCCGGCGAGG ATCCCGCGCT GGATGCTGCC GGTCTCCAGG CTTCTGAAGA TCCAAACGCC
GCAAGCGAAT CGCAGATTGC CGAGATAGTG CCTACCGGGG CGAGGGCAAT CCTCGGCTTG
GTCGAGGACC GGGGGTTCTC TCCGGAAGAG CTATGCCGCG GGCTCGGCTT CACTTATCGC
GATCTATCAA TGAGGGACGT GAGGCTGTCC TACCGCCAGA TGCGCCAGTT GTTCATGCGG
GCTGAGCGCT TGCTGGGCGA ACCTGCGCTG GGCCTGGCAC TGGGAGCCAG ACAGACGCCC
ATTTCTTGGG GCGTACCCGG ATTGGCAATG CTCACCTGCG AGACCTACGG CGATGCACTG
ACGTACGGCC TCACCCATCA GCAAGCCATC GGCTCGATGC TGATCCACAC GGTGGAGGAG
GTGGGAAGGG AAGTGAGGAT GGAAGTCCGG TTCAAACGAT TCGACATTCA ACTGGAGTCG
GTGCTGGTCG AGGACGCGTT CGCCGGATTC GTTGCGGTCA GCAGATACGT GATTGGACCA
TCATTCGCGC CACTCAGGGT TGATTTCTCA CTCCCCAAGC CCTCTGATCC TGAAGTCTAT
CGACGATTCT TCCAGTGCCC TGTCCGCTTC GATGCTGGCG TCAATCGCCT GACCATAGAC
TCACATTGGC TGAGCGCGCG CTTGCCCGGT TTCGATCGGG TCAATTCAAG AATTGTCCGA
GAGCAGTTGG ATTCACTTCT TCCAACGCGA GGGGGCCGCA ATGAGATTGT TGAATCCCTG
TCGAGCCACC TTCGATGTGA TATCGAATCA ACGACCAAGC AGAGCGAACT TGCGAGCCTG
ATCAATGTCA GTGAAAGAAC GCTCCGCCGC CGCCTGAGCC GCCAGGATTC CAGTTATAGG
GAGATTCGGG ACGAAGCGAG GTATGAGCGC GCCCGCGATC TCCTGCTGAA CTCAGAGTTG
AGCATCGCCG AAATTGCAGA CGCGGTTGGA TATTCCGACG CCCGTGCATT CCGCCGCGCA
TTCAAGCGTT GGGCGGGTTG CCTGCCAACC GAGTTCCGGG AATCCAGGTA G
 
Protein sequence
MSATLKQVPW RPDYVGVALP FEYVMAPIPK NLPAFKDWPV AGEDPALDAA GLQASEDPNA 
ASESQIAEIV PTGARAILGL VEDRGFSPEE LCRGLGFTYR DLSMRDVRLS YRQMRQLFMR
AERLLGEPAL GLALGARQTP ISWGVPGLAM LTCETYGDAL TYGLTHQQAI GSMLIHTVEE
VGREVRMEVR FKRFDIQLES VLVEDAFAGF VAVSRYVIGP SFAPLRVDFS LPKPSDPEVY
RRFFQCPVRF DAGVNRLTID SHWLSARLPG FDRVNSRIVR EQLDSLLPTR GGRNEIVESL
SSHLRCDIES TTKQSELASL INVSERTLRR RLSRQDSSYR EIRDEARYER ARDLLLNSEL
SIAEIADAVG YSDARAFRRA FKRWAGCLPT EFRESR