Gene SbBS512_E4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4042 
Symbol 
ID6272059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3775056 
End bp3776090 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content46% 
IMG OID641727883 
Productputative glycosyl transferase 
Protein accessionYP_001882315 
Protein GI187730435 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.24261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACA GCACCAATAA ACTTAGTGTT ATTATTCCGT TATATAATGC GGGCGATGAT 
TTCCGCACTT GTATGGAATC TTTAATTACG CAAACCTGGA CTGCTCTGGA AATCATTATT
ATTAACGATG GTTCAACGGA TAATTCTGTT GAAATAGCAA AGCATTACGC AGAAAACTAT
CCGCACGTTC GTTTGTTGCA TCAGGCGAAT GCTGGCGCAT CGGTAGCGCG TAATCGTGGG
ATCGAGGTGG CGACGGGCAA ATATGTCGCT TTTGTCGATG CTGACGATGA GGTCTATCCC
ACCATGTACG AAACGCTGAT GACCATGGCG TTAGAGGACG ACCTCGACGT GGCGCAGTGC
AACGCTGACT GGTCTTTTCG TGAAACGGGA GAAACCTGGC AATCCATCCC CAGCGATCGC
CTTCGCTCAA CCGGCGTATT AACCGGCCCG GACTGGCTGC GGATGGGGCT TTCTTCGCGC
CGTTGGACTC ACGTTGTCTG GATGGGGGTT TATCGCCGTG ATGTTATTGT TAAAAATAAC
ATTAAATTTA TTGCCGGATT ACATCATCAG GATATTGTCT GGACAACAGA ATTCATGTTT
AACGCGCTGC GTGCGCGATA TACCGAGCAA TCATTATATA AATATTATCT GCATAATACG
TCAGTGAGTC GGTTGCATAG ACAAGGGAAT AAAAACCTTA ATTATCAACG TCACTATATT
AAGATTACCC GCCTGCTGGA GAAATTAAAT CGAAATTATG CTGACAAAAT TACGATTTAT
CCGGAATTTC ATCAGCAAAT AACTTACGAA GCATTGCGTG TTTGCCATGC GGTGCGCAAA
GAGCCGGATA TTCTTACCCG CCAACGGATG ATTGCCGAGA TATTTACTTC CGGTATGTAT
AAGCGCCTGA TTACCAATGT GCGCAGCGTG AAGGTCGGTT ACCAGGCGTT ACTGTGGTCT
TTCCGCTTAT GGCAATGGCG CGACAAAACG CGGTCGCACC ATCGCATTAC GCGTAGCGCC
TTTAATTTGC GCTAG
 
Protein sequence
MMNSTNKLSV IIPLYNAGDD FRTCMESLIT QTWTALEIII INDGSTDNSV EIAKHYAENY 
PHVRLLHQAN AGASVARNRG IEVATGKYVA FVDADDEVYP TMYETLMTMA LEDDLDVAQC
NADWSFRETG ETWQSIPSDR LRSTGVLTGP DWLRMGLSSR RWTHVVWMGV YRRDVIVKNN
IKFIAGLHHQ DIVWTTEFMF NALRARYTEQ SLYKYYLHNT SVSRLHRQGN KNLNYQRHYI
KITRLLEKLN RNYADKITIY PEFHQQITYE ALRVCHAVRK EPDILTRQRM IAEIFTSGMY
KRLITNVRSV KVGYQALLWS FRLWQWRDKT RSHHRITRSA FNLR