Gene SbBS512_E3929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3929 
Symbol 
ID6268969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3657131 
End bp3659191 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content53% 
IMG OID641727780 
ProductAsmA family protein 
Protein accessionYP_001882213 
Protein GI187731640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG CAGGCAAAAT AACCGCTGCG ATTTCAGGGG CTTTCTTGTT GTTGATTGTC 
GTGGCGATCA TTTTGATTGC AACTTTTGAC TGGAATCGAC TCAAACCGAC CATCAACCAG
AAAGTCTCTG CGGAGTTGAA TCGTCCGTTC GCTATCCGTG GCGATCTGGG CGTGGTGTGG
GAGCGGCAAA AACAAGAAAC TGGCTGGCGC AGCTGGGTGC CGTGGCCTCA TGTACATGCA
GAAGACATCA TTCTTGGCAA TCCACCGGAT ATTCCCGAAG TCACGATGGT GCATTTGCCG
CGAGTAGAGG CAACGCTGGC ACCGCTGGCG CTGCTGACCA AAACGGTCTG GCTGCCGTGG
ATCAAGCTCG AAAAGCCCGA CGCGCGCCTG ATTCGCCTCT CCGAAAAGAA CAATAACTGG
ACGTTTAATC TCGCCAACGA TGATAACAAA GACGCGAATG CAAAGCCGTC GGCATGGTCG
TTTCGGCTGG ATAATATTCT TTTCGATCAA GGGCGGATCG CCATTGATGA CAAAGTAAGC
AAAGCGGATC TGGAAATTTT TGTTGATCCG TTAGGCAAGC CGCTGCCGTT CAGCGAAGTT
ACTGGATCGA AAGGTAAAGC GGATAAAGAA AAGGTGGGCG ATTACGTTTT TGGCCTGAAG
GCGCAGGGAC GTTATAACGG TGAACCGCTC ACGGGTACGG GAAAAATAGG CGGTATGCTG
GCGCTGCGTG GCGAAGGGAC GCCGTTTCCG GTACAGGCTG ATTTCCGTTC AGGTAATACC
CGTGTTGCTT TTGATGGCGT CGTGAATGAC CCAATGAAGA TGGGCGGTGT CGATTTACGG
CTTAAATTTT CTGGCGATTC ACTGGGTGAT CTCTATGAAC TGACGGGCGT TCTGCTGCCC
GATACCCCGC CGTTTGAAAC GGATGGTCGG CTGGTAGCGA AAATCGACAC TGAAAAATCG
TCGGTCTTTG ATTATCGCGG TTTTAATGGA CGAATTGGCG ATAGCGATAT CCACGGTTCT
CTGGTCTACA CCACCGGAAA GCCACGACCA AAACTGGAAG GTGATGTCGA GTCGCGGCAA
TTGCGGCTGG CGGACCTGGG ACCGTTGATT GGCGTTGATT CGGGGAAAGG GGCAGAAAAG
TCGAAACGGT CTGAACAGAA GAAGGGCGAA AAAAGCGTTC AGCCTGCGGG CAAAGTGCTG
CCTTATGACC GCTTCGAAAC CGATAAATGG GACGTTATGG ATGCCGATGT TCGCTTCAAA
GGCCGACGTA TCGAGCATGG CAGTAGCCTG CCGATTAGCG ATCTTTCTAC TCATATCATC
CTCAAAAATG CTGACCTGCG CCTGCAACCG CTGAAATTTG GCATGGCGGG CGGCAGCATT
GCGGCGAATA TCCATCTGGA AGGCGATAAA AAGCCGATGC AGGGGCGGGC AGATATTCAG
GCTCGTCGAC TGAAACTGAA AGAACTGATG CCCGATGTGG AACTGATGCA GAAGACGCTG
GGGGAAATGA ACGGTGACGC GGAACTACGC GGTAGCGGTA ACTCGGTGGC GGCGCTTTTA
GGCAACAGTA ACGGCAACCT GAAACTGTTG ATGAATGACG GGCTGGTGAG CCGCAACCTG
ATGGAGATTG TTGGGCTGAA TGTCGGCAAC TACATTGTCG GTGCGATATT TGGTGACGAT
GAGGTGCGGG TGAACTGCGC GGCGGCGAAT CTGAATATTG CCAACGGCGT AGCGCGCCCG
CAGATTTTTG CTTTCGATAC TGAGAACGCG TTGATTAATG TTACCGGCAC GGCAAGTTTT
GCTTCGGAAC AGCTGGATTT GACTATTGAT CCGGAGAGTA AAGGGATTCG GATTATCACA
CTGCGTTCGC CGCTGTATGT GCGTGGGACG TTTAAAAATC CGCAGGCTGG GGTGAAAGCC
GGACCGCTGA TTGCCCGTGG TGCCGTTGCG GCGGCACTGG CAACGCTGGT AACACCGGCG
GCGGCGTTAT TGGCACTGAT CTCACCTTCC GAAGGGGAGG CTAATCAGTG TCGGACGATA
TTGTCGCAGA TGAAGAAGTG A
 
Protein sequence
MSKAGKITAA ISGAFLLLIV VAIILIATFD WNRLKPTINQ KVSAELNRPF AIRGDLGVVW 
ERQKQETGWR SWVPWPHVHA EDIILGNPPD IPEVTMVHLP RVEATLAPLA LLTKTVWLPW
IKLEKPDARL IRLSEKNNNW TFNLANDDNK DANAKPSAWS FRLDNILFDQ GRIAIDDKVS
KADLEIFVDP LGKPLPFSEV TGSKGKADKE KVGDYVFGLK AQGRYNGEPL TGTGKIGGML
ALRGEGTPFP VQADFRSGNT RVAFDGVVND PMKMGGVDLR LKFSGDSLGD LYELTGVLLP
DTPPFETDGR LVAKIDTEKS SVFDYRGFNG RIGDSDIHGS LVYTTGKPRP KLEGDVESRQ
LRLADLGPLI GVDSGKGAEK SKRSEQKKGE KSVQPAGKVL PYDRFETDKW DVMDADVRFK
GRRIEHGSSL PISDLSTHII LKNADLRLQP LKFGMAGGSI AANIHLEGDK KPMQGRADIQ
ARRLKLKELM PDVELMQKTL GEMNGDAELR GSGNSVAALL GNSNGNLKLL MNDGLVSRNL
MEIVGLNVGN YIVGAIFGDD EVRVNCAAAN LNIANGVARP QIFAFDTENA LINVTGTASF
ASEQLDLTID PESKGIRIIT LRSPLYVRGT FKNPQAGVKA GPLIARGAVA AALATLVTPA
AALLALISPS EGEANQCRTI LSQMKK