Gene SbBS512_E0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0110 
Symbol 
ID6273265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp121033 
End bp122886 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content54% 
IMG OID641724366 
Producthypothetical protein 
Protein accessionYP_001878925 
Protein GI187731189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0222206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCAC TAATTTGCAG TGCCGGGCTT 
TGTGCCGCCT CTGCCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGGC TGTGGAAACG
CAATCGACAC AACTGGCTGT GTCTGACGCT GCCGCGGTTA CGCTTCCTGC AACGGTTTCC
GCGCCTCCCG TAACACCCGC CGTCGTTAAA TCCGCATTCA GCACTGCACA AATAGATCAA
TGGGTCGCGC CCGTCGCGCT GTATCCCGAC GCCCTACTTT CGCAGGTGCT GATGGCATCA
ACCTATCCGG CAAACGTTGC TCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA
GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCGTGGG ACGCCAGCGT TAAATCACTG
GTGGCCTTTC CACAATTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG
GGCGATGCTT TTCTGGCGCA GCCGCAGGAC GTGATGGACT CGGTACAGCG ATTGCGGCAA
CTGGCGCAAC AAACCGGTTC GCTGAAGTCA TCAAACGAAC AAAAAGTTAT TACCACAACG
AAGAAAGCTG TACCGGTAAA ACAGACAGTC ACGGCACCCG TCATACCATC CAATACCGTT
TTAACTGCCA GCCCCGTCAT TACAGAGCCT GCAACAACCG TCATTTCCAT TGAGCCCGCC
AATCCTGATG TGGTCTATAT TCCCAACTAC AACCCAACCG TGGTTTACGG GAACTGGGCC
AATACTGCGT ATCCGCCGGT TTATCTGCCA CCACCAGCCG GAGAACCGTT TATTGACAGC
TTTGTGCGCG GATTCGGCTA TAGCATGGGT GTTGCTACCA CGTACGCACT ATTCAGCAGC
ATCGACTGGG ATGACGACGA TCATGACCAT CATCATCATG ACGATGATGA TTATCATCAC
CACGATGGCG GTCATCGTGA CGGTAATGGC TGGCAACACA ACGGCGACAA CATCAATATC
GACGTCAACA ATTTCAACCG TATCACCGGT GAGCATCTTA CTGATAAGAA TATGGCATGG
CGGCACAATC CAAACTACCG TAATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG
TTTCATCAAA CCGATGTCAA CGGCGGAATG AGTGCCACGC AGCTACCTGC TCCAACACGC
GACAGCCAGC GTCAGGCGGC AGCAAGTCAG TTTCAGCAAC GAACACACGC CGCCCCCGTC
ATTACACGAG ATACCCAACG TCAGGCAGCG GCACAGCGGT TTAATGAAGC TGAACACTAT
GGGAGCTATG ACGACTTCCG CGACTTCAGC CGTCGCCAAC CCCTGACCCA GCAACAAAAG
GACGCCGCTC GTCAGCGTTA TCAGTCAGCT TCTCCTGAGC AGCGCCAGGC AGTTCGCGAG
AGAATGCAGA CTAACCCGCA GATCCAGCAG CGAAGAGAGG CAGCGCGTGA GCGCATTCAG
CCCGCCTCGC CTGAGCAGCG CCAGGCAGTC CGCGAGAAAA TGCAGACTAA CCCACAGATC
CAGCAGCGAA GAGACGCAGC GCGTGAGCGT ATTCAGTCAG CCTCGCCTGA GCAGCGCCAG
GTGTTTAAGG AAAAAGTACA GCAGCGCCCA CTGAACCAAC AGCAACGTGA TAACGCCCGC
CAGCGTGTTC AATCAGCATC ACCTGAACAA CGTCAGGTTT TTCGGGAGAG AGTTCAGGAG
AGCCGCCCAC AACGTCTAAA CGACAGTAAC CGTACTGCCA GATTGAATAA CGATCAACGG
TCAGCAGTAC GCGAACGTCT CTCTGAGCGC GGAGCAAGGC GACTGGAAAG GTAA
 
Protein sequence
MKMTLPFKPH VLALICSAGL CAASAGLYIK SRTVEAAVET QSTQLAVSDA AAVTLPATVS 
APPVTPAVVK SAFSTAQIDQ WVAPVALYPD ALLSQVLMAS TYPANVAQAV QWSHDNPLKQ
GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ
LAQQTGSLKS SNEQKVITTT KKAVPVKQTV TAPVIPSNTV LTASPVITEP ATTVISIEPA
NPDVVYIPNY NPTVVYGNWA NTAYPPVYLP PPAGEPFIDS FVRGFGYSMG VATTYALFSS
IDWDDDDHDH HHHDDDDYHH HDGGHRDGNG WQHNGDNINI DVNNFNRITG EHLTDKNMAW
RHNPNYRNGV PYHDQDMAKR FHQTDVNGGM SATQLPAPTR DSQRQAAASQ FQQRTHAAPV
ITRDTQRQAA AQRFNEAEHY GSYDDFRDFS RRQPLTQQQK DAARQRYQSA SPEQRQAVRE
RMQTNPQIQQ RREAARERIQ PASPEQRQAV REKMQTNPQI QQRRDAARER IQSASPEQRQ
VFKEKVQQRP LNQQQRDNAR QRVQSASPEQ RQVFRERVQE SRPQRLNDSN RTARLNNDQR
SAVRERLSER GARRLER