Gene SbBS512_E1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1014 
Symbol 
ID6268463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp936922 
End bp937980 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content52% 
IMG OID641725158 
Productputative prophage tail fiber protein 
Protein accessionYP_001879680 
Protein GI187730611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAG TGATATCAGG TGCGCTGACT GATGGCGCAG GCATCCCCAT GTCCGGATGC 
CAGATTATTC TGAAATCCCG TGTAAACACC TCAGAAGTGG TGATGCGTAC CAGGGCTGAT
GTAGTGACCG GAAATAACGG TGAATATTCG TTTGAGGCAC AGGTCGGAAA ATATTGCGTG
TATCTGAAAC GGGACTGGCG CGACGAGTAC TGTGTTGGCG ACATTGCTGT ATACGACGAC
TCAAAGCCCG GCACTCTGAA AACGGCTAAC CATCTGTCTG AAATCGCAGC AGCAGGCGAA
AAGGCACAAC AGAAGTCCCG GGATAATCTG GGGCTGAAAA GTGCGGCCAC GATGGAAGCA
CAGAGCGACA TTTACGACCG GACAAAAGGC CGTCTGGCGA TACCCGGCGC ATTCGGCTTT
GGGTGTGCTT TTCTGCCTGA AGATGTTATC CGTTTTGACA CTAAGAGTGA TTTCCAGGCC
TGGGTAAGGA ATGCGCTGCC AGGTGAATAT TCCGTTGCTG GCCCCTACGA CATCATCATA
CCCGACACAC GGTTTGAAGG GGTGCTCAGC ATCCGGTGGA CTGATGCACG CCCTGAGACA
ACAGAACCGC GGTACAGAGC CAAATCCCTT ACTTTTTACG GCATTAACGG CCCCATTTAT
CACACCCGCT ACTGCTACTG GCCCATATCC AGACTGACTG ACTGGGTGAA AATAAATATA
ACCACAGAAG ATATTATTTA CAGAATCGTG GCGAGCTCTG TCCGCAACAG ATGGGGAGAC
CCTGACATTG GCGGGCTGAT TATTGCTGCG TACCAGGGAG AAGCTGACGG TGATAAAGTC
ATCAGACTTG TCAGGGGGCA GTCATACAGA GGCTCACGAC TGGGACCGGT GGGGATTTCA
GTGCCCAGTA CTCCCACCGG AACGTATATA GCATCCCCAC AATTTTTCAT TACGGGATGT
TCAGAGCATT CATTACCGGG GTCATATTGC GCCCTGTCCG GGGTGCCGGA TGCTCATGTC
TCTGGCGCAA TGCCCGGGCT TTTTATTCGC ACATCGTGA
 
Protein sequence
MSVVISGALT DGAGIPMSGC QIILKSRVNT SEVVMRTRAD VVTGNNGEYS FEAQVGKYCV 
YLKRDWRDEY CVGDIAVYDD SKPGTLKTAN HLSEIAAAGE KAQQKSRDNL GLKSAATMEA
QSDIYDRTKG RLAIPGAFGF GCAFLPEDVI RFDTKSDFQA WVRNALPGEY SVAGPYDIII
PDTRFEGVLS IRWTDARPET TEPRYRAKSL TFYGINGPIY HTRYCYWPIS RLTDWVKINI
TTEDIIYRIV ASSVRNRWGD PDIGGLIIAA YQGEADGDKV IRLVRGQSYR GSRLGPVGIS
VPSTPTGTYI ASPQFFITGC SEHSLPGSYC ALSGVPDAHV SGAMPGLFIR TS