Gene Smal_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3787 
Symbol 
ID6474669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4260170 
End bp4261480 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID642732988 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_002030169 
Protein GI194367559 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.510654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGC ACATCCTGCC TACCGTGGGC GCCCTGGCGC TGGTCGTGAT GCTGTCGGCC 
TGTGCCGGCA AGGCCGAATA CCACCCCACC GACCAGTCGG GTGCGAAGCC GCCTCTGCCG
GCACCGAAGA ACTTCCTGAT GCCGCCGATG CAGGTGCCCA AAGGCGTGGG TTGGGCCGAC
GGCCAGTCAC CCACCGTGGC CGAAGGCCTG AAGATCGAAC GCATCGCCGC CAACCTGCAG
CATCCGCGGC GATTGTTGAC CCTGCCCAAC GGCGATGTGC TGGTGGTGGA AGGCAATGGC
CCCGGCGAAG AGCCGGTCAC CACGCCCAAG CAGTGGATCG CCGGCAAGGT GAAGGCGCGC
TCGGGCAAGG CCGGCAAGGG CGGCAACCGG GTCACCCTGC TGCGCCGCAC ACCGGGCACG
AACACCTGGA CCCAGCACGT GTACATCGAA GGCCTGCATT CGCCGTTCGG CATCCAACTG
ATCGGTGACA CGCTGTACGT GGCCAACACC GGCAACATCA TGCAGTACCA CTACGTGCCG
GGCGAAACCC GCATGTCCGA CAAGGGCCGC GAGTTCACCG ACCTGCCCAG CACCATCAAC
CACCACTGGA CCAAGGAACT GCTGGCCAGC CGCGATGGCA GCAAGCTGTA CGTGGGCGTG
GGCTCCAACA GCAACATCAC CGAGAATGGT TTGGCAGTTG AATACCGCCG CGCGGTGGTG
CTGGAAGTGG ACGTGGCCAC CCGTGGCAGC CGCATCTTTG CGTCGGGCAT CCGCAACCCG
ACCGGACTGG ACTGGGAGCC GAGCACCGGC ACGCTGTGGG CCGTGGCGAA CGAACGCGAT
GAAATCGGTG CGGATCTGGT GCCCGATTAC CTCACCTCGG TGAAGGAAGA TGGCTTCTAC
GGCTGGCCCT ACAGCTACTA CGGTCAGCAC GTGGACGAGC GCGTGCAGCC GCAGCGGCCG
GACCTGGTGG CGAAAGCCAT CACGCCGGAC TACGCCATCG GCTCGCACGT GGCACCGCTG
GGCCTGCTGT TCTACACCGG CCAGGCACTG CCGGCGCAGT ACCACGGCGG CGCCTTCATC
GGCGAGCACG GCAGCTGGGA TCGCTCACCA TTGAGCGGCT ACGAAGTGGT CTACGTACCG
TTCAAGGACG GCAAGCCGAC CGGGCGACCG CAGACCGTGG TCAGCGGCTT CGCCTCCAAG
GACGAGAAGA CCCTGATGGG CGCACCGGTA GGCATGGCGA TGGATGCCGA AGGCGCACTG
CTGGTCGCCG ACGACGTGGG CGACGTGGTG TGGCGGGTGT CGGCCAAGTA G
 
Protein sequence
MAKHILPTVG ALALVVMLSA CAGKAEYHPT DQSGAKPPLP APKNFLMPPM QVPKGVGWAD 
GQSPTVAEGL KIERIAANLQ HPRRLLTLPN GDVLVVEGNG PGEEPVTTPK QWIAGKVKAR
SGKAGKGGNR VTLLRRTPGT NTWTQHVYIE GLHSPFGIQL IGDTLYVANT GNIMQYHYVP
GETRMSDKGR EFTDLPSTIN HHWTKELLAS RDGSKLYVGV GSNSNITENG LAVEYRRAVV
LEVDVATRGS RIFASGIRNP TGLDWEPSTG TLWAVANERD EIGADLVPDY LTSVKEDGFY
GWPYSYYGQH VDERVQPQRP DLVAKAITPD YAIGSHVAPL GLLFYTGQAL PAQYHGGAFI
GEHGSWDRSP LSGYEVVYVP FKDGKPTGRP QTVVSGFASK DEKTLMGAPV GMAMDAEGAL
LVADDVGDVV WRVSAK