Gene SbBS512_E0880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0880 
Symbol 
ID6273276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp820365 
End bp823838 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content56% 
IMG OID641725043 
Producthost specificity protein 
Protein accessionYP_001879570 
Protein GI187732605 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAG GTGGCGGTAA GGCACACACG CCTCGTGAGG CGAAGGATAA TCTCAAATCC 
ACGCAGATGA TGAGTGTGAT TGATGCGATT GGTGAGGGAC CGATAGAAGG TCCGGTGAAG
GGACTGCAGA GTATTCTGGT GAACAAAACC CCACTGACGG ACACGGACGG CAATCCCGTG
ATACACGGTG TGACCGCGGT CTGGCGTGCC GGGGAGCAGG AGCAGACACC ACCGGAAGGC
TTTGAGTCCT CCGGAGCTGA AACCGGACTG GGCGTGGAAG TGACGAAGGC AAAACCGGTG
ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TTACCTTCGG GGTGCAGTCA
CTGGTGCAGA CCACGTCAAA GGGCGACCGT AATCCTTCCT CTGTCCGGAT TCTGATTCAG
TTACAGCGTA ATGGCCGCTG GGTGACGGAA AAGGACGTCA CCATTAACGG CAAGACCACC
TCACAGTTCC TGGCCTCGGT GATTCTGGAT AATCTGCCTC CCCGCCCCTT TAACATCCGG
ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGATG
CAGGTGGATG CGGAGCAGTT TGGTGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG TGGTATCTGG
GACGGCAGTC TGAAACCGGC ATACAGCAAC AACCCGGCCT GGTGCCTGTG GGACATGCTG
ACTCACCCGC GCTACGGCAT GGGAAAACGT CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TCGGGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGGACCGAG
CCGCGGATGA CCTTTAATGC GTACCTGGCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CTGCGATGCG CTGTATGCCG GTATGGAACG GCCAGATGCT GACGTTTGTT
CAGGACCGCC TGTCGGATGT GGTGTGGCCG TACACCAACA GCGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTCCGCTA CAGCTTCAGT GCCCTGAAGG ACCGGCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG GCGCAATCTG CTGAAGATGG ACGCGTTCGG CTGTACCAGC
CGCGGTCAGG CCCACCGTGC CGGACTGTGG GTGATAAAGA CCGAACTGCT GGAAACGCAG
ACGGTGGACT TCACGCTCGG GTCTCAGGGG CTGCGGCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ATGACTATGC CGGGACCCTG ACCGGTGGAC GTGTCCTGTC CATTGATGCT
GCCACCCGCA CCCTGACGCT GGACCGTGAG GTTACCCTGC CGGAGACAGG TACATCGGCG
GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGTGTGG ACATCACCGC ACACCCCGCG
CCGGACCGGA TACAGGTCAG TACCCTGCCT GATGGTGTGG AGACATACGG GGTGTGGGGA
CTCTCCCTGC CGTCACTGCG CCGTCGCCTG TTCCGCTGTG TCTCCGTCCG GGAAAACACG
GACGGCACCT TTGCCATCAC GGCGGTGCAG CACGTACCGG AAAAAGAAGC CATCGTGGAT
AACGGTGCCC GCTTTGAGCC GCAGTCAGGC TCCCTGAACA GCGTCATCCC ACCGGCAGTG
CAGCACCTGA CGGTGGAGGT GAGCGCAGCT GACGGCCAGT ATCTGGCACA GGCGAAATGG
GACACGCCGC GGGTGGTGAA GGGGGTGCGC TTCAGTCTGC GACTGACCAG CGGAAGCGGA
GAAGACAGCC GTCTGGTGAC CACCGCTATC ACTGCGGATA CAGAGCATCG TTTCAGTGGT
CTGCCGCTCG GGGAATACAC CCTGACAGTC AGGGCAATTA ACAGTTATGG CCAGCAGGGC
GAACCGGCCA CCACCACCTT CCGGATTGCC GCACCGGCAG CACCGTCGCG GATTGAGCTG
ACGCCGGGCT ATTTTCAGAT AACCGCAACG CCACATCTTG CCGTTTATGA CCCGACGGTA
CAGTTTGAGT TCTGGTTCTC GGAAAAGCGG ATTGCGGATA TCAGGCAGGT TGAAACCGCA
GCCCGCTATC TTGGCTCGGC GCTGTACTGG ATAGCTGCCA GTATCAATAT CAAACCGGGC
CATGATTATT ATTTTTATAT CCGCAGTGTG AATACTGTTG GCAAATCGGC ATTCGTGGAG
GCTGTCGGTC GGGCGAGCGA TGATGTGGAA GGTTACCTGG ATTTTTTCAA AGGAGAAATC
GGGAAAACAC ATCTGGCCCA GGAGCTGTGG ACGCAGATTG ATAACGGTCA GCTTGCACCG
GACCTGGCTG AAATCAGGAC GTCCATTACG AATGTCAGCA ATGAAATCAC GCAGACCGTC
AATAAAAAAC TGGAAAATCA GAGTGCGGCA ATCCAGCAGA TACAGAAAGT TCAGGTTGAT
ACAAATAATA ACCTGAACAG CATGTGGGCC GTGAAACTGC AGCAGATGCA GGACGGACGC
CTTTATATTG CGGGTATCGG TGCCGGTATT GAGAATACGC CAGCAGGAAT GCAGAGTCAG
GTGCTGCTGG CGGCAGACAG GATTGCGATG ATTAATCCTG CGAATGGCAA CACAAAGCCG
ATGTTTGTTG GTCAGGGCGA TCAGATATTT ATGAATGAAG TGTTCCTGAA ATATCTGACG
GCTCCCACCA TTACCAGCGG CGGTAATCCT CCGGCATTTT CCCTGACACC GGACGGGCGG
CTGACGGCGA AAAATGCCGA TATCAGCGGT AACGTGAATG CGAACTCCGG GACGCTCAAC
AACGTCACTA TTAACGAGAA CTGTCGGGTT CTGGGAAAAT TGTCCGCGAA CCAGATTGAA
GGCGATCTCG TTAAAACAGT GGGCAAAGCT TTCCCCCGGG ACTCCCGTGC ACCAGAGCGG
TGGCCATCAG GAACCATTAC CGTCAGGGTT TATGACGATC AGCCGTTTGA CCGGCAGATT
GTTATTCCGG CGGTGGCATT CAGCGGCGCT AAACATGAGA AAGAGCATAC TGATATTTAC
TCCTCATGCC GTCTGATAGT GCGGAAAAAC GGTGCTGAAA TTTATAACCG TACCGCGCTG
GATAATACGC TGATTTACAG TGGTGTTATT GATATGCCTG CCGGTCACGG TCACATGACA
CTGGAGTTTT CGGTGTCAGC ATGGCTGGTA AATAACTGGT ATCCCACAGC AAGTATCAGC
GATTTGCTGG TTGTGGTGAT GAAGAAAGCC ACTGCAGGCA TCACGATTAG CTGA
 
Protein sequence
MGKGGGKAHT PREAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETGL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LVQTTSKGDR NPSSVRILIQ LQRNGRWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR
MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGM QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIGQYCDQ TVPDGFGGTE PRMTFNAYLA QQRKAWDVLS DFCSAMRCMP VWNGQMLTFV
QDRLSDVVWP YTNSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTELLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTL TGGRVLSIDA ATRTLTLDRE VTLPETGTSA VNLINGSGKP VSVDITAHPA
PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSVRENT DGTFAITAVQ HVPEKEAIVD
NGARFEPQSG SLNSVIPPAV QHLTVEVSAA DGQYLAQAKW DTPRVVKGVR FSLRLTSGSG
EDSRLVTTAI TADTEHRFSG LPLGEYTLTV RAINSYGQQG EPATTTFRIA APAAPSRIEL
TPGYFQITAT PHLAVYDPTV QFEFWFSEKR IADIRQVETA ARYLGSALYW IAASINIKPG
HDYYFYIRSV NTVGKSAFVE AVGRASDDVE GYLDFFKGEI GKTHLAQELW TQIDNGQLAP
DLAEIRTSIT NVSNEITQTV NKKLENQSAA IQQIQKVQVD TNNNLNSMWA VKLQQMQDGR
LYIAGIGAGI ENTPAGMQSQ VLLAADRIAM INPANGNTKP MFVGQGDQIF MNEVFLKYLT
APTITSGGNP PAFSLTPDGR LTAKNADISG NVNANSGTLN NVTINENCRV LGKLSANQIE
GDLVKTVGKA FPRDSRAPER WPSGTITVRV YDDQPFDRQI VIPAVAFSGA KHEKEHTDIY
SSCRLIVRKN GAEIYNRTAL DNTLIYSGVI DMPAGHGHMT LEFSVSAWLV NNWYPTASIS
DLLVVVMKKA TAGITIS