Gene SbBS512_E0873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0873 
Symbol 
ID6268763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp815599 
End bp816924 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content51% 
IMG OID641725039 
Productside tail fiber protein 
Protein accessionYP_001879566 
Protein GI187734209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCAGCAT CACCCAATTA TGCGACGAGT GTAAATATCC ATTGGGATTG TACTGCAAAT 
GCGTCAGTTT CTGTTTATAC CTCGCCAACA TATTCAGCGA GTAAGCCTTC CAGCGTTACC
TATGGTGTTG TTTATACGAT GTATAGCTCA CATCAGAAAC CTACACCATC AGATATTGGA
GCGCTGCCAA CGACTGGAGG GACTATTTCA GGTCCGTTGT TTGTTACTGA TGGGATCACC
GGGGCACTGA AGGGGAACGC CGATACCGCG ACGAAACTTG CGGCAGCCCC AAAAATTAAC
GGTGTTAAGT TTGATGGCTC GGCGGATATT AACCTCACGC CGGAAAATAT TGGTGCATTT
GCCCGACGTT CGACGGGGGC TTATGCGGAT TCGGATGGAG CCGTTCCCTG GAATGCCGAA
TCAGGCGCTT ACAATGTCAC CCGCTCTGGC GACAGCTATA TTCTGGTTAA CTTCTATACC
GGAGTCGGAA GTTGCCGGAC CCTGCAGATG AAGGCGCATT ACAGAAATGG TGGTCTGTTC
TACCGTTCTT CAAGAGACGG TTATGGTTTT GAGGAAGGCT GGGCAGAAGT TTATACCTCG
AAAAATCTTC CACCAGAAAG CTACCCAGTC GGCGCACCAA TCCCGTGGCC ATCAGATACC
GTTCCGTCTG GTTATGCCCT GATGCAGGGG CAGACTTTTG ACAAATCTGC TTACCCGAAA
CTTGCAGCCG CTTATCCGTC AGGCGTGATC CCTGATATGC GTGGCTGGAC GATTAAGGGC
AAACCTGCCA GTGGTCGGGC CGTATTGTCT CAGGAACAGG ACGGCATTAA ATCGCACACC
CACAGCGCCA GCGCATCCAG TACGGATTTG GGGACGAAAA CCACATCGTC GTTTGATTAC
GGCACTAAAT CCACGAATAA CACCGGGGCG CATACGCACA GTCTGAGTGG CTCTACGGGG
TCTGCCGGTG TTCATACTCA TGGTAATGGT ATTCGTTGGC CAAGAGGCGG CGGTTCTGCG
TTAGCATTTT ATGATGGCGG TGGGTTCACT TATGTCCAGA ATTCACAGTA TCAAGTAAGC
CCGGGGACTT CTTCCCGTAG ATCGTATTAT CAACGTATTC AGACACAGTC AGCAGGTGCT
CATACCCACT CGCTGTCTGG TACTGCAGCA AGTTCTGGCG CACATGCACA TACTGTAGGT
ATTGGTGCGC ATACGCACTC CGTTGCGATT GGTTCACATG GACACACCAT CACCGTTAAC
GCTGCGGGTA ACGCGGAAAA CACCGTCAAA AACATCGCAT TTAACTATAT TGTGAGGCTT
GCATAA
 
Protein sequence
MAASPNYATS VNIHWDCTAN ASVSVYTSPT YSASKPSSVT YGVVYTMYSS HQKPTPSDIG 
ALPTTGGTIS GPLFVTDGIT GALKGNADTA TKLAAAPKIN GVKFDGSADI NLTPENIGAF
ARRSTGAYAD SDGAVPWNAE SGAYNVTRSG DSYILVNFYT GVGSCRTLQM KAHYRNGGLF
YRSSRDGYGF EEGWAEVYTS KNLPPESYPV GAPIPWPSDT VPSGYALMQG QTFDKSAYPK
LAAAYPSGVI PDMRGWTIKG KPASGRAVLS QEQDGIKSHT HSASASSTDL GTKTTSSFDY
GTKSTNNTGA HTHSLSGSTG SAGVHTHGNG IRWPRGGGSA LAFYDGGGFT YVQNSQYQVS
PGTSSRRSYY QRIQTQSAGA HTHSLSGTAA SSGAHAHTVG IGAHTHSVAI GSHGHTITVN
AAGNAENTVK NIAFNYIVRL A