Gene EcSMS35_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3871 
SymbolbisC 
ID6145520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3940209 
End bp3942542 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content55% 
IMG OID641618698 
Productbiotin sulfoxide reductase 
Protein accessionYP_001745837 
Protein GI170681133 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAACT CATCCTCACG ATATTCCGTT CTGACTGCCG CCCACTGGGG GCCTATGCTG 
GTTGAAACCG ACGGCGAAAC CGTGTTTAGC TCGCGTGGCG CGTTAGCCAC AGGAATGGAA
AACTCCTTGC AGAGCGCGGT TCGCGACCAG GTTCACAGCA ATACGCGGGT GCGATTTCCA
ATGGTGCGCA AAGGCTTTCT TGCGTCACCG GAAAATCCGC AAGGCATTCG TGGGCAGGAT
GAATTTGTTC GCGTGAGTTG GGATGAGGCG CTGGAGCTTA TTCACCATCA ACATAAACGC
ATTCGTGAGG CTTATGGTCC GGCATCGATT TTTGCGGGTT CCTACGGCTG GCGTTCAAAC
GGCGTGCTGC ATAAGGCCTC GACATTATTA CAACGCTATA TGGCGCTGGC AGGCGGTTAT
ACTGGGCATC TGGGGGATTA TTCGACCGGC GCGGCACAGG CGATCATGCC GTATGTCGTG
GGGGGCAGCG AAGTTTATCA ACAGCAAACC AGTTGGCCGC TGGTGCTGGA ACATAGCGAT
GTCGTGGTGC TGTGGAGTGC TAACCCACTC AATACGCTGA AAATTGCGTG GAATGCATCC
GATGAGCAGG GGCTTTCTTA CTTTTCCGCA CTGCGTGACA GCGGGAAAAA GCTGATCTGC
ATTGATCCAA TGCGATCGGA AACCGTCGAT TTCTTTGGCG ATAAAATGGA ATGGGTGGCA
CCGCACATGG GCACCGATGT TGCGCTGATG CTGGGGATCG CCTATACGCT GGTGGAAAAT
GGTTGGCACG ACGAAGCGTT TCTGGCGCGT TGCACCACAG GTTATGCCGT CTTCGCCTCT
TATTTGCTGG GCGAGAGTGA CGGAATAGCG AAAAACGCCG AATGGGCGGC AGAGATTTGT
GGTGTTGGCG CAGCGAAAAT CCGCGAGCTG GCGGCTCTTT TCCACCAAAA TACCACCATG
CTGATGGCTG GCTGGGGAAT GCAACGTCAA CAGTTTGGCG AGCAAAAGCA CTGGATGATC
GTCACGCTGG CGGCAATGTT GGGGCAAATC GGCACACCCG GCGGCGGTTT TGGTCTTTCT
TACCATTTTG CCAATGGTGG TAACCCCACG CGCCGCGCTG CGGTGCTCTC TTCCATGCAG
GGTAGCTTGC CTGGCGGCAC CGATGCGGTG GATAAAATCC CTGTTGCCCG CATTGTTGAA
GCACTGGAAA ACCCCGGTGG CGCATATCAA CATAACGGTA TGGACCGACA TTTCCCGGAT
ATTCGTTTTA TCTGGTGGGC GGGCGGTGCC AACTTTACTC ATCATCAGGA TACCAATCGC
CTGATCCGTG CCTGGCAAAA ACCGGAGCTG GTGGTGATCT CTGAATGCTT CTGGACGGCT
GCGGCGAAAC ACGCGGATAT CGTTCTGCCT GCGACCACTT CGTTTGAGCG TAATGATCTC
ACCATGACCG GCGATTACAG TAATCAGCAT CTGGTGCCGA TGAAGCAAGT GGTGCCTCCA
CGTTATGAAG CGCGTAATGA TTTTGATGTC TTTGCCGAGT TGAGTGAACG TTGGGAGAAG
GGCGGCTATG CGCGGTTTAC GGAAGGAAAA AGTGAGCTGC AATGGCTGGA AACGTTTTAT
AACGTTGCCC GGCAGCGCGG GGCAAGCCAG CAGGTTGAAT TGCCGCCATT TGCTGAGTTC
TGGGAAGCCA ACCAGTTAAT TGAGATGCCG GAAAACCCGG ACAGCGAGCG GTTTATTCGC
TTCGCCGATT TTCGCCGCGA TCCGCAGGCG CATCCATTAA AAACCGCCAG CGGTAAGATT
GAAATCTTCT CGCAGCGTAT TGCCGATTAC GCTTATCCGG ATTGCCCTGG GCATCCAATG
TGGCTGGAGC CGGACGAATG GCAGGGCAAT GCCGAACCGG AACAGTTGCA GGTACTTTCT
GCTCATCCGG CACATCGTCT GCACAGCCAG CTGAATTACA GTTCTCTGCG CGAATTGTAC
GCGGTGGCAA ATCGTGAGCC TGTCACCATT CATCCTGACG ATGCCCAGGC GCGCGGCATA
CAAGATGGCG ATATTGTTCG GTTGTGGAAC GCACGCGGGC AAATTCTTGC CGGAGCGGTC
ATTAGCGAGG GAATTAAACC TGGCGTGATT TGCATTCATG AAGGGGCATG GCCGGATCTG
GATTTAACCG CTGACGGTAT TTGTAAAAAC GGCGCGGTGA ACGTTCTGAC CAAAGATCTC
CGCAGCTCGC GGCTGGGGAA TGGCTGTGCG GGTAATACGG CGCTGGCATG GCTGGAAAAA
TACAACGGTC CGGAACTGAC ACTTACAGCG TTTGAACCAC CGGCCAGCTC ATAA
 
Protein sequence
MANSSSRYSV LTAAHWGPML VETDGETVFS SRGALATGME NSLQSAVRDQ VHSNTRVRFP 
MVRKGFLASP ENPQGIRGQD EFVRVSWDEA LELIHHQHKR IREAYGPASI FAGSYGWRSN
GVLHKASTLL QRYMALAGGY TGHLGDYSTG AAQAIMPYVV GGSEVYQQQT SWPLVLEHSD
VVVLWSANPL NTLKIAWNAS DEQGLSYFSA LRDSGKKLIC IDPMRSETVD FFGDKMEWVA
PHMGTDVALM LGIAYTLVEN GWHDEAFLAR CTTGYAVFAS YLLGESDGIA KNAEWAAEIC
GVGAAKIREL AALFHQNTTM LMAGWGMQRQ QFGEQKHWMI VTLAAMLGQI GTPGGGFGLS
YHFANGGNPT RRAAVLSSMQ GSLPGGTDAV DKIPVARIVE ALENPGGAYQ HNGMDRHFPD
IRFIWWAGGA NFTHHQDTNR LIRAWQKPEL VVISECFWTA AAKHADIVLP ATTSFERNDL
TMTGDYSNQH LVPMKQVVPP RYEARNDFDV FAELSERWEK GGYARFTEGK SELQWLETFY
NVARQRGASQ QVELPPFAEF WEANQLIEMP ENPDSERFIR FADFRRDPQA HPLKTASGKI
EIFSQRIADY AYPDCPGHPM WLEPDEWQGN AEPEQLQVLS AHPAHRLHSQ LNYSSLRELY
AVANREPVTI HPDDAQARGI QDGDIVRLWN ARGQILAGAV ISEGIKPGVI CIHEGAWPDL
DLTADGICKN GAVNVLTKDL RSSRLGNGCA GNTALAWLEK YNGPELTLTA FEPPASS