Gene SbBS512_E2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2423 
Symbol 
ID6271897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2219827 
End bp2221587 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content51% 
IMG OID641726418 
Producthypothetical protein 
Protein accessionYP_001880900 
Protein GI187730165 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00238381 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAA CATTTATCCC CGGCAAAGAT GCCGCTCTGG AAGATTCCAT CGCTCGCTTC 
CAGCAAAAAA TTTCAGACCT CGGCTTTCAG ATTGAAGAGG CCTCCTGGCT GAATCCTGTG
CCTAACGTCT GGTCTGTACA TATTCGCGAC AAAGAGTGCG CACTGTGTTT TACCAACGGT
AAAGGCGCAA CCAAAAAAGC GGCGCTGGCT TCTGCACTTG GTGAATATTT CGAGCGTCTC
TCAACCAACT ACTTTTTTGC TGATTTCTGG CTGGGCGAAA CCATCGCCAA CGGTCCGTTC
GTGCATTATC CCAACGAAAA ATGGTTCCCA CTGACCGAAA ATGACGATGT GCCAGAAGGA
CTACTCGATG ACCGTCTGCG CGCGTTTTAT GATCCGGAGA ATGAACTGAC CGGCAGCATG
CTGATTGACC TACAATCCGG TAACGAAGAT CGTGGTATTT GCGGCCTGCC GTTTACGCGT
CAGTCCGACA ATCAGACCGT TTATATTCCG ATGAATATCA TTGGTAACCT GTACGTCTCC
AACGGTATGT CCGCAGGTAA TACCCGCAAC GAAGCACGCG TTCAGGGATT GTCTGAAGTT
TTCGAACGCT ACGTGAAAAA CCGCATTATT GCTGAAAGCA TCAGCCTGCC AGAGATCCCG
GCAGACGTGC TGGCGCGTTA CCCAGCGGTG GTTGAAGCGA TCGAAACACT GGAAGCAGAA
GGTTTCCCGA TCTTCGCATA TGATGGTTCC CTTGGCGGCC AGTATCCGGT GATTTGCGTG
GTACTGTTCA ATCCTGCTAA CGGTACCTGC TTTGCCTCTT TCGGTGCGCA TCCTGATTTT
GGCGTAGCAC TGGAACGTAC CGTGACCGAG CTGCTGCAAG GTCGTGGCCT GAAAGATTTG
GATGTGTTTA CTCCGCCAAC CTTCGATGAT GAAGAAGTCG CTGAACATAC CAACCTCGAA
ACGCACTTTA TCGATTCCAG CGGTTTAATC TCCTGGGACC TGTTCAAGCA GGATGCCGAT
TATCCGTTTG TGGACTGGAA TTTCTCCGGC ACCACGGAAG AAGAGTTTGC TACGCTGATG
GCTATCTTCA ACAAAGAAGA TAAAGAAGTT TATATTGCCG ATTACGAGCA TCTGGGCGTT
TATGCTTGCC GTATTATCGT GCCTGGCATG TCCGATATTT ATCCGGCTGA AGATCTGTGG
CTCGCGAATA ACAGTATGGG CAGCCATTTA CGTGAAACGA TTCTTTCGCT ACCAGGCAGC
GAGTGGGAAA AAGAAGATTA CCTGAACCTC ATCGAGCAAC TGGATGAAGA AGGTTTTGAT
GACTTTACCC GCGTGCGTGA GCTGTTGGGT CTGGCGACCG GGTCGGATAA CGGTTGGTAC
ACCCTGCGTA TCGGTGAATT AAAAGCCATG CTGGCGCTGG CTGGTGGCGA TCTGGAACAG
GCTCTGGTCT GGACCGAATG GACGATGGAG TTTAACTCAT CAGTATTTAG TCCGGAACGC
GCCAACTATT ATCGCTGCCT GCAAACGTTG TTATTACTGG CACAGGAAGA AGATCGCCAG
CCGCTGCAAT ATCTGAATGC GTTTGTTCGC ATGTACGGCG CAGATGCCGT GGAAGCCGCC
AGTGCGGCAA TGAGCGGCGA AGCGGCGTTT TACGGCCTGC AACCGGTAGA TAGCGATCTG
CACGCGTTTG CTGCACATCA GTCGTTGTTG AAGGCCTACG AAAAGCTGCA GCGCGCCAAA
GCAGCATTCT GGGCAAAATA A
 
Protein sequence
MTQTFIPGKD AALEDSIARF QQKISDLGFQ IEEASWLNPV PNVWSVHIRD KECALCFTNG 
KGATKKAALA SALGEYFERL STNYFFADFW LGETIANGPF VHYPNEKWFP LTENDDVPEG
LLDDRLRAFY DPENELTGSM LIDLQSGNED RGICGLPFTR QSDNQTVYIP MNIIGNLYVS
NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP ADVLARYPAV VEAIETLEAE
GFPIFAYDGS LGGQYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL
DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFVDWNFSG TTEEEFATLM
AIFNKEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNSMGSHL RETILSLPGS
EWEKEDYLNL IEQLDEEGFD DFTRVRELLG LATGSDNGWY TLRIGELKAM LALAGGDLEQ
ALVWTEWTME FNSSVFSPER ANYYRCLQTL LLLAQEEDRQ PLQYLNAFVR MYGADAVEAA
SAAMSGEAAF YGLQPVDSDL HAFAAHQSLL KAYEKLQRAK AAFWAK