Gene Smal_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_0847 
Symbol 
ID6478044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp983834 
End bp986848 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content65% 
IMG OID642730009 
Productouter membrane autotransporter barrel domain protein 
Protein accessionYP_002027235 
Protein GI194364625 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCAT TTCCCGCCCT GCCCGCCCAC CGCAAAGCCA TCCTCGGCAG CGCCCTTCTT 
GCAGCGCTCC TCGGCATGGC GGCCGCGCCC GCAGCCCGTG CCAGTGCCTT CCATGTGAAT
GGAGACACGC TTTACTACAC CGGCAATGTG CTGGCGACCG ACCCCGGTAC GCTGGAGGCC
TATGTCCTTG CCGCCAAGGC ACGCGGCACG CCACTGCGCC AGGTGGTGTT CCGCAACTCA
CCCGGTGGCG CCGCCTCCGG TGGCGTGGGA ATGGGCGCCA TCATCCGCCA GCACGGGCTG
GATACCGTGT TGGACGGCGG TTGCTACTCG GCATGCGCGG ATGCTTTCGT CGCAGGCGTG
AACCGGAAGG TGACGCAGTT CGGCCTGCTG CCAGCACATG GCAACTACAC CCAGACCGTG
CTGGGGATCC ATGGTGAATC CGGCCCCAAT GGACCCACGC CCTATCCGGG GCAGGACAAG
TACATCAAGT ATTACCGGGA CATGTTCGGC CCCACCGATT TCGCAGTCAT CGAAGCACGC
ATCATCCAGG CCCACTACGA ACTGACCCAG CAATCGGGCT TCCTGCGCTA TTTCGACCCC
ACCACCGTGG CCGTGGCCAC GCGCTTCTGC CCGACCCGTG ACCAGAGCGC TGCAGGCAAC
TGCACTGAAT ATCCAGGTGT CACGATGTAC AGCGATCGCG TGGTGACCGA GGCCGGCTAC
GTGATGCCGG GCGATGTCCT CTCCGTCGAG AAGGAGGTGA GTGGGGATCT CAATCCGCAT
CTGGCGGTTC GCGGTCGCTA CACCAACGCG GTCGCCAACC TGCGCTACTT CCAGCAGTCG
GCGTCGGACA TCCGGCTGGA CGATGCCTTC GGGGTCATCC GCATCGGCAA GGGCGGCGTC
TGGTCATTGG ATACTCCTTC CGCTGCCCAG TACGTGGTCG TGGACGGTGG CACCCTGCGG
CTGCAGCAGA ATGGCTGGAT ACATCACGCC GAAGCCATCG GTGCCCGTAA TGGTGGCCGG
ATCGAACTAC AGGGCGGCGC GCTGCGCAAC CTGACCCTGG TCGAACGTGG CGGTGTACTT
GCCGGCCACG GCTCGTTGAA TGCCACGATC GACATCAACG AAGGCGGCAC GCTGGCACCG
TCCGGCATCG TCATGCGCCC CTACTCGCTG GAAGCGCTGG CACCAGGCGG CGACGACGTT
TTCACCAGCG AACGCTTCCG GATCGGAAAG GGCCGCTACA TCAACCTCAA TGACGGGGCG
AACCTGGCCT TCCAGGTCGA CAGCCAGCGC ACCGCACCGT CACTGACGCT GGAAGAGGCC
CGTTACTTCC TGGTCGAGGA GGACAGGCGA TCGAGCGACT ACAACCTGAT CGGAAACATC
AACCGCGCAG TACTGGGCAT TGCGCCCACT GCCAGGCTGA CGCTGGACGT GAAGCAGGCA
TTCTTCCCTG CAGGGCAGGA GAACCATCTG GTGGGAACCC AGGTGGATTC CGCCGCCCTG
CAATTGCCTG TCGCCCCCTG CAGCCCATCC AATGGCGAGT ACGCAGGCCT GTGGTGCCGA
ACGGCTACGC CCGCCACCCT GACCCGCCCG GCTCCGGTAT CGGACTTCAT CGAAGGACGC
TTCCAGTACG TTGAACGCAG TGGCGAAGCG GGGAGCGCGG TTGATCTCAC GGCGCCGGGC
GCCGTGTTCC GTCCCCGCCA CAACTCGCTG CTCAGCTTCA CTGTCAACCA GACCGGCAGC
GGCCTGTGGC TGACGGCCAA TCCTTCCTTC GATGACACCG GCCTGTTCGC CAATGCTCGA
TCCGGTGATG GGTTGGGCGT TGCTCTGCAA CAGGCAGCGG CACAGCGCAC GGCATCGATG
AGTAGCCTGC TGGGCGCTCT GCAGTTCGCC GACCGCGACG TCATCGCGCG CCAGGCGGGG
GCGCTGCGCG GCGATGCGCA TGCCAGCCTG CGCCTGGCCG ACAACGCACT GGTCGGCAGC
ATCGGCAACG TGGTGCAGCA GCATCAGGCC GCATCGCGCA GCGGTGGCGA TGCCGATGGC
CTGGCGGCAC AGGCTGCACA GGCGGCGTCG GTGCAATCGG CGATGACCGG CAATCGGCTG
TTCAACCAAC TGGCGATGCA TCTGGTCGCA CCATCGGATG CAGGCAACGA TACGGACGAT
ACGGGCCGCA ACCGTGGTCT CTGGGCGCGG GGTTTCGCCA GCCATGGCCG TATCGAGGGT
GACGGCGGTG TCGCCGGCCT GAGCCACACC ATCGGTGGCA TTGCGCTCGG TGCCGATACC
CGCCTCGCAG ACGACCGCGT AACACTGGGC GTAAGCCTTG CCGCGGCCGA CATGTCGACA
AAATCCAGCG GTGGCGCAGG CTTCACTGGC GATGTCCGTG CACTGGATAT TGGCGGCTAC
GTGGATGCGC TCTACTCGCG CGGCTACCTG TCCGCTGCCG TCCGCTACAC CGATCTTCGC
CACGACACCC GCCGCAGCAT CGATGGCATC GACGGCCTGC AGGCTCCCCT GCGCGCGAAA
TACAACAACG ATGCGATCAC CGCACGCGTG GAGCACGCGT TCTCCTTCAC CACGGGCAAG
GGTCTGGTGA TCCAGCCGCT GCTGCCAGTG ATCGACTACA CCCGCACCTC AGCCACGCGC
TTCAACGAGG GCCAGGACGC GGCCGCACTG GTCGGCCGCA GTGGCAGCCT GGAGAGCATC
CGCGTAGGTG CCGGCCTGCA GTTGTTCAAG ACGTTCGAAG GCAACAGCGG CGAGCGCATC
ACCCCGCGTG CCCGCGTGGT CTGGCAGAAG GAACTGGGCG ATACCCAGGC CCGCTACAGC
ACCTCCTTTG CTGCCGCCCC GGACCTGTTG TTCGGCAGCA GCAGCCAGGT GGTGGGCGAA
CAGGTGCTGG CCTGGAACCT GGGCGTGACC AGTCGTGCCA GCGAGCGGCT GTCGATCATG
GTCGACTACG TGGGCGAGCG TCGTGACGGG CAGTCCCAGA ACGGCGTACA GCTGGGGCTG
GGCTACAGGT TCTGA
 
Protein sequence
MHPFPALPAH RKAILGSALL AALLGMAAAP AARASAFHVN GDTLYYTGNV LATDPGTLEA 
YVLAAKARGT PLRQVVFRNS PGGAASGGVG MGAIIRQHGL DTVLDGGCYS ACADAFVAGV
NRKVTQFGLL PAHGNYTQTV LGIHGESGPN GPTPYPGQDK YIKYYRDMFG PTDFAVIEAR
IIQAHYELTQ QSGFLRYFDP TTVAVATRFC PTRDQSAAGN CTEYPGVTMY SDRVVTEAGY
VMPGDVLSVE KEVSGDLNPH LAVRGRYTNA VANLRYFQQS ASDIRLDDAF GVIRIGKGGV
WSLDTPSAAQ YVVVDGGTLR LQQNGWIHHA EAIGARNGGR IELQGGALRN LTLVERGGVL
AGHGSLNATI DINEGGTLAP SGIVMRPYSL EALAPGGDDV FTSERFRIGK GRYINLNDGA
NLAFQVDSQR TAPSLTLEEA RYFLVEEDRR SSDYNLIGNI NRAVLGIAPT ARLTLDVKQA
FFPAGQENHL VGTQVDSAAL QLPVAPCSPS NGEYAGLWCR TATPATLTRP APVSDFIEGR
FQYVERSGEA GSAVDLTAPG AVFRPRHNSL LSFTVNQTGS GLWLTANPSF DDTGLFANAR
SGDGLGVALQ QAAAQRTASM SSLLGALQFA DRDVIARQAG ALRGDAHASL RLADNALVGS
IGNVVQQHQA ASRSGGDADG LAAQAAQAAS VQSAMTGNRL FNQLAMHLVA PSDAGNDTDD
TGRNRGLWAR GFASHGRIEG DGGVAGLSHT IGGIALGADT RLADDRVTLG VSLAAADMST
KSSGGAGFTG DVRALDIGGY VDALYSRGYL SAAVRYTDLR HDTRRSIDGI DGLQAPLRAK
YNNDAITARV EHAFSFTTGK GLVIQPLLPV IDYTRTSATR FNEGQDAAAL VGRSGSLESI
RVGAGLQLFK TFEGNSGERI TPRARVVWQK ELGDTQARYS TSFAAAPDLL FGSSSQVVGE
QVLAWNLGVT SRASERLSIM VDYVGERRDG QSQNGVQLGL GYRF