Gene SbBS512_E4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4026 
Symbol 
ID6270270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3757241 
End bp3759502 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content51% 
IMG OID641727866 
Producthaemagglutinin family 
Protein accessionYP_001882298 
Protein GI187731863 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGACGA ATACCACCAA TATCGCCAAT AACACTTCCA ATATTGCCAC TAACACCACC 
AACATCTCTA ATCTGACTGA GACGGTGACT AATCTTGGTG AGGATGCGCT GAAATGGGAT
AAGGACAATG GTGTATTCAC GGCAGCTCAT GGCACCGAGA CCACCAGCAA AATCACCAAC
GTTAAAGATG GCGACCTGAC GACTGGCAGC ACCGATGCCG TTAACGGCTC TCAGCTGAAA
ACCACCAACG ATGCCGTGGC GACGAATACC ACCAATATCG CCACTAACAC CACCAACATC
TCTAATCTGA CTGAGACGGT GACTAATCTT GGTGAGGATG CGCTGAAATG GGATAAGGAC
AATGGTGTCT TCACTGCAGC TCATGGCAAC AATACCGCCA GCAAAATCAC CAATATCCTG
GACGGCACAG TCACTGCAAC CAGTTCCGAT GCCATTAACG GTAGCCAGCT TTATGACTTA
AGCAGCAATA TCGCCACCTA CTTCGGCGGC AATGCTTCTG TGAATACTGA CGGTGTGTTT
ACCGGTCCAA CCTACAAAAT CGGTGAAACA AATTATTATA ACGTCGGCGA TGCACTGGCT
GCGATTAACT CCTCATTTAG CACGTCTCTC GGCGATGCTC TGCTTTGGGA TGCCACCGCA
GGTAAATTCA GTGCCAAACA CGGTACTAAT GGTGACGCAA GCGTGATCAC TGATGTCGCA
GATGGTGAAA TTTCAGACTC CAGTTCTGAC GCAGTAAACG GCTCACAACT CCACGGCGTG
AGCAGTTATG TTGTTGATGC GCTGGGGGGT GGTGCCGAAG TCAATGCAGA CGGCACCATC
ACTGCGCCGA CGTACACCAT TGCTAATGCT GATTACGATA ATGTCGGTGA TGCCCTGAAT
GCTATCGATA CCACTCTTGA CGACGCTCTG CTCTGGGATG CGGACGCCGG TGAAAATGGT
GCATTTAGCG CCGCTCATGG AAAAGATAAA ACTGCCAGTG TAATCACTAA CGTCGCTAAC
GGTGCAATCT CTGCTGCCAG CAGCGACGCG ATTAACGGCT CACAACTCTA TACCACCAAT
AAGTACATCG CTGATGCGCT GGATGGTGAC GCAGAAGTCA ACGCTGACGG CACCATCACC
GCACCGACTT ACACCATTGC GAACGCCGAG TACAACAACG TCGGTGACGC CCTGGATGCG
CTTGATGATA ACGCCCTGCT GTGGGATGAG ACTGCCAATG GCGGTGCTGG AGCCTACAAT
GCCAGCCATG ACGGTAAAGC CAGCATCATC ACTAATGTCG CTAATGGCAG TATTAGTGAG
GACAGTACCG ATGCAGTGAA CGGTTCTCAG TTGAATGCGA CGAATATGAT GATTGAGCAG
AACACCCAAA TTATCAATCA GCTCGCTGGT AACACCGACG CAACCTATAT CCAAGAAAAC
GGTGCGGGTA TTAACTATGT GCGTACTAAC GACGACGGCT TAGCGTTCAA CGACGCCAGC
GCACAGGGTG TTGGCGCTAC AGCTATAGGT TATAACTCTG TCGCCAAAGG CGATAGCAGC
GTAGCTATTG GTCAGGGCAG CTACAGCGAC GTTGATACGG GTATCGCCCT AGGTAGCAGC
TCTGTTTCCA GCCGAGTGAT TGCCAAAGGC TCCCGTGACA CCAGCATAAC GGAAAATGGC
GTTGTTATTG GTTACGACAC CACGGATGGC GAACTGCTCG GTGCATTGTC TATCGGTGAT
GACGGTAAAT ATCGTCAAAT CATCAACGTA GCCGATGGTT CCGAAGCCCA TGACGCCGTT
ACGGTTCGTC AATTGCAGAA TGCGATTGGT GCGGTCGCAA CCACGCCGAC TAAATACTTC
CACGCTAATT CAACGGAAGA AGATTCACTG GCAGTGGGAA CTGACTCGCT GGCAATGGGT
GCGAAAACCA TCGTGAATGG CGATAAAGGT ATTGGTATCG GTTATGGTGC CTACGTGGAC
GCGAATGCAC TTAACGGCAT TGCCATTGGT AGCAATGCGC AAGTCATTCA TGTCAACAGT
ATTGCGATAG GTAATGGTTC TACGACCACT CGTGGCGCTC AAACCAATTA TACCGCCTAC
AACATGGACG CACCGCAGAA CTCTGTCGGT GAATTCTCAG TCGGTAGTGC GGATGGTCAA
CGTCAGATCA CAAACGTCGC AGCTGGTTCA GCGGATACCG ATGCGGTTAA CGTGGGTCAG
TTGAAAGTCA CTGATGAGCG CGTAGCGCAA AATACCCAGT AG
 
Protein sequence
MATNTTNIAN NTSNIATNTT NISNLTETVT NLGEDALKWD KDNGVFTAAH GTETTSKITN 
VKDGDLTTGS TDAVNGSQLK TTNDAVATNT TNIATNTTNI SNLTETVTNL GEDALKWDKD
NGVFTAAHGN NTASKITNIL DGTVTATSSD AINGSQLYDL SSNIATYFGG NASVNTDGVF
TGPTYKIGET NYYNVGDALA AINSSFSTSL GDALLWDATA GKFSAKHGTN GDASVITDVA
DGEISDSSSD AVNGSQLHGV SSYVVDALGG GAEVNADGTI TAPTYTIANA DYDNVGDALN
AIDTTLDDAL LWDADAGENG AFSAAHGKDK TASVITNVAN GAISAASSDA INGSQLYTTN
KYIADALDGD AEVNADGTIT APTYTIANAE YNNVGDALDA LDDNALLWDE TANGGAGAYN
ASHDGKASII TNVANGSISE DSTDAVNGSQ LNATNMMIEQ NTQIINQLAG NTDATYIQEN
GAGINYVRTN DDGLAFNDAS AQGVGATAIG YNSVAKGDSS VAIGQGSYSD VDTGIALGSS
SVSSRVIAKG SRDTSITENG VVIGYDTTDG ELLGALSIGD DGKYRQIINV ADGSEAHDAV
TVRQLQNAIG AVATTPTKYF HANSTEEDSL AVGTDSLAMG AKTIVNGDKG IGIGYGAYVD
ANALNGIAIG SNAQVIHVNS IAIGNGSTTT RGAQTNYTAY NMDAPQNSVG EFSVGSADGQ
RQITNVAAGS ADTDAVNVGQ LKVTDERVAQ NTQ